Dependency graph

Legend

Boxes: definitions
Ellipses: theorems and lemmas
Blue border: the statement of this result is ready to be formalized; all prerequisites are done
Orange border: the statement of this result is not ready to be formalized; the blueprint needs more work
Blue background: the proof of this result is ready to be formalized; all prerequisites are done
Green border: the statement of this result is formalized
Green background: the proof of this result is formalized
Dark green background: the proof of this result and all its ancestors are formalized
Dark green border: this is in Mathlib

Definition 1.1.3

The closed convex hull of a nonempty set $S\subset \mathbb {R}^n$ is the intersection of all closed convex sets containing $S$. It will be denoted by $\overline{\operatorname {co}}\, S$.

LaTeX

Lemma 1.2.1

Let $x\in \operatorname {cl}C$ and $x'\in \operatorname {ri}C$. Then the half-open segment

\[ ]x,x']=\{ \alpha x+(1-\alpha )x':\; 0\le \alpha {\lt}1\} \]

is contained in $\operatorname {ri}C$.

LaTeX

Proposition 1.2.3

For $i=1,\ldots ,k$, let $C_i\subset \mathbb {R}^{n_i}$ be convex sets. Then

\[ \operatorname {ri}(C_1\times \cdots \times C_k) = (\operatorname {ri} C_1)\times \cdots \times (\operatorname {ri} C_k). \]

LaTeX

Definition 1.2.4

A nonempty convex subset $F\subset C$ is a face of $C$ if it satisfies the following property: every segment of $C$, having in its relative interior an element of $F$, is entirely contained in $F$. In other words,

\[ \left. \begin{array}{c} (x_1,x_2)\in C\times C\\[4pt] \exists \alpha \in ]0,1[:\ \alpha x_1+(1-\alpha )x_2\in F \end{array} \right\} \implies [x_1,x_2]\subset F. \tag {2.3.2} \]

LaTeX

Definition 1.5.2

The direction $s\in \mathbb {R}^n$ is said normal to $C$ at $x\in C$ when

\[ \langle s,\, y-x\rangle \le 0\quad \text{for all }y\in C . \tag {5.2.2} \]

The set of all such directions is called normal cone to $C$ at $x$, denoted by $N_C(x)$.

LaTeX

Proposition 1.5.7

For given $x\in C$ and $d\in \mathbb {R}^n$, there holds

\[ \lim _{t\downarrow 0}\frac{\operatorname {P}_{C}(x+td)-x}{t}=\operatorname {P}_{T_C(x)}(d). \tag {5.3.3} \]

LaTeX

Proposition 2.2.4

Let $f\in \operatorname {Conv}\mathbb {R}^n$ [resp. $\overline{\operatorname {Conv}}\mathbb {R}^n$] and let $g\in \operatorname {Conv}\mathbb {R}$ [resp. $\overline{\operatorname {Conv}}\mathbb {R}$] be increasing. Assume that there is $x_0\in \mathbb {R}^n$ such that $f(x_0)\in \operatorname {dom}g$, and set $g(+\infty ):=+\infty $. Then the composite function $g\circ f:\; x\mapsto g(f(x))$ is in $\operatorname {Conv}\mathbb {R}^n$ [resp. in $\overline{\operatorname {Conv}}\mathbb {R}^n$].

LaTeX

Definition 2.2.1

Let $f_1$ and $f_2$ be two functions from $\mathbb {R}^n$ to $\mathbb {R}\cup \{ +\infty \} $. Their infimal convolution is the function from $\mathbb {R}^n$ to $\mathbb {R}\cup \{ \pm \infty \} $ defined by

\begin{align} (f_1\mathbin {\square }f_2)(x)& :=\inf \{ f_1(x_1)+f_2(x_2):\; x_1+x_2=x\} \label{eq:2.3.1}\\ & =\inf _{y\in \mathbb {R}^n}[\, f_1(y)+f_2(x-y)\, ].\notag \end{align}

LaTeX

Proposition 2.3.2

Let $f\in \operatorname {Conv}\mathbb {R}^n$. All the nonempty sublevel-sets of $f$ have the same asymptotic cone, which is the sublevel-set of $f^\infty $ at the level $0$:

\[ \forall r\in \mathbb {R}\ \text{with } S_r(f)\neq \varnothing ,\qquad [S_r(f)]_\infty =\{ d\in \mathbb {R}^n:\ f^\infty (d)\le 0\} . \]

In particular, the following statements are equivalent:

There is $r$ for which $S_r(f)$ is nonempty and compact;
all the sublevel-sets of $f$ are compact;
$f^\infty _\circ (d){\gt}0$ for all nonzero $d\in \mathbb {R}^n$.

LaTeX

Definition 2.4.1

Let $C\subset \mathbb {R}^n$ be convex. The mapping $F: C\to \mathbb {R}^n$ is said to be monotone [resp. strictly monotone, resp. strongly monotone with modulus $c{\gt}0$] on $C$ when, for all $x$ and $x'$ in $C$,

\[ \langle F(x)-F(x'),\, x-x'\rangle \ge 0 \]

[resp. $\langle F(x)-F(x'),\, x-x'\rangle {\gt} 0$ whenever $x\neq x'$, resp. $\langle F(x)-F(x'),\, x-x'\rangle \ge c\| x-x'\| ^2$].

LaTeX

Definition 5.1.1

The conjugate of a function $f$ satisfying (1.1.1) is the function $f^{*}$ defined by

\[ \mathbb {R}^{n}\ni s\mapsto f^{*}(s):=\sup \{ \langle s,x\rangle -f(x):x\in \operatorname {dom}f\} . \tag {1.1.2} \]

For simplicity, we may also let $x$ run over the whole space instead of $\operatorname {dom}f$.

The mapping $f\mapsto f^{*}$ will often be called the conjugacy operation, or the Legendre–Fenchel transform.

LaTeX

Definition 5.1.2

A function $f$ satisfying (1.1.1) is said to be coercive [resp. 1-coercive] when

\[ \lim _{\| x\| \to +\infty } f(x)=+\infty \qquad \bigl[\text{resp. }\lim _{\| x\| \to +\infty }\frac{f(x)}{\| x\| }=+\infty \bigr]. \]

LaTeX

Lemma 5.2.1

Let $g\in \operatorname {Conv}\mathbb {R}^m$ be such that $0\in \text{dom }g$ and let $A_0$ be linear from $\mathbb {R}^n$ to $\mathbb {R}^m$. Make the following assumption:

\[ \Im A_0\cap \text{ri }\text{dom }g\neq \emptyset \quad \text{i.e.}\quad 0\in \text{ri }\text{dom }g-\Im A_0\ [=\text{ri }(\text{dom }g-\Im A_0)]. \]

Then $(g\circ A_0)^* = A_0^* g^*$; for every $s\in \text{dom }(g\circ A_0)^*$, the problem

\begin{equation} \label{eq:2.2.2} \inf _{p}\{ \, g^*(p):\ A_0^*p=s\, \} \end{equation}

has at least one optimal solution $\bar p$ and there holds $(g\circ A_0)^*(s)=A_0^*g^*(s)=g^*(\bar p)$.

LaTeX

Definition 6.2.1Projected subgradient descent

For $t \ge 1$:

\begin{align} y_{t+1} & = x_t - \eta g_t,\quad \text{where } g_t \in \partial f(x_t), \label{eq:3.2}\\ x_{t+1} & = \Pi _{\mathcal X}(y_{t+1}). \label{eq:3.3} \end{align}

LaTeX

Definition 6.2.2Projected subgradient descent, varying step size

The projected subgradient descent algorithm with time-varying step size $(\eta _t)_{t\ge 1}$, that is

\[ y_{t+1}=x_t-\eta _t g_t,\quad \text{where }g_t\in \partial f(x_t) \]

\[ x_{t+1}=\Pi _{\mathcal X}(y_{t+1}). \]

LaTeX

Definition 6.4.1ISTA, Iterative Shrinkage-Thresholding Algorithm

\[ x_{t+1}=\arg \min _{x\in \mathbb {R}^n}\eta (g(x)+\nabla f(x_t)^\top x)+\frac{1}{2}\| x-x_t\| _2^2 \]

\[ \quad =\arg \min _{x\in \mathbb {R}^n} g(x)+\frac{1}{2\eta }\| x-(x_t-\eta \nabla f(x_t))\| _2^2. \tag {5.1} \]

LaTeX

Definition 6.4.4

Let $\mathcal X$ be a convex set with non-empty interior, and $f$ a $C^3$ convex function defined on $\operatorname {int}(\mathcal X)$. Then $f$ is self-concordant (with constant $M$) if for all $x\in \operatorname {int}(\mathcal X)$, $h\in \mathbb R^n$,

\[ \nabla ^3 f(x)[h,h,h]\le M\| h\| _x^3. \]

We say that $f$ is standard self-concordant if $f$ is self-concordant with constant $M=2$.

LaTeX

Corollary 1.4.1Strict Separation of Convex Sets

Let $C_1,\, C_2$ be two nonempty closed convex sets with $C_1\cap C_2=\varnothing $. If $C_2$ is bounded, there exists $s\in \mathbb {R}^n$ such that

\[ \sup _{y\in C_1}\langle s,y\rangle \; {\lt}\; \min _{y\in C_2}\langle s,y\rangle . \tag {4.1.2} \]

LaTeX Lean

Corollary 1.4.2

The data $(s_j,r_j)\in \mathbb {R}^n\times \mathbb {R}$ for $j$ in an arbitrary index set $J$ is equivalent to the data of a closed convex set $C$ via the relation

\[ C=\bigcap _{j\in J}\{ x\in \mathbb {R}^n:\langle s_j,x\rangle \le r_j\} . \]

LaTeX

Corollary 1.5.1

The tangent cone is the polar of the normal cone:

\[ T_C(x)=\{ d\in \mathbb {R}^n:\ \langle s,d\rangle \le 0\ \text{for all }s\in N_C(x)\} . \]

LaTeX Lean

Corollary 2.2.2

With the above notation, suppose that $g$ is bounded below on the set $\{ x\} \times \mathbb {R}^m$, for all $x\in \mathbb {R}^n$. Then the marginal function $\gamma $ lies in $\operatorname {Conv}\mathbb {R}^n$.

LaTeX Lean

Corollary 3.1.1

If $\sigma $ is sublinear, then

\[ \sigma (x)+\sigma (-x)\ge 0\quad \text{for all }x\in \mathbb {R}^n. \tag {1.1.6} \]

LaTeX Lean

Corollary 3.1.2

C is compact if and only if $\gamma _C(x){\gt}0$ for all $x\neq 0$.

LaTeX Lean

Corollary 3.3.1

For a nonempty closed convex set $S$ and a closed sublinear function $\sigma $, the following are equivalent:

$\sigma $ is the support function of $S$.
$S=\{ s:\ \langle s,d\rangle \le \sigma (d)\text{ for all }d\in X\} ,$ where the set $X$ can be indifferently taken as: the whole of $\mathbb {R}^n$, the unit ball $B(0,1)$ or its boundary, or $\operatorname {dom}\sigma $.

LaTeX Lean

Corollary 3.3.2

Let $C$ be a closed convex set containing the origin. Its support function $\sigma _C$ is the gauge of $C^\circ $.

LaTeX Lean

Corollary 3.3.3

Let $(S_k)$ be a sequence of nonempty convex compact sets and $S$ a nonempty convex compact set. When $k \to +\infty $, the following are equivalent

$S_k \to S$ in the Hausdorff sense, i.e. $\Delta _H(S_k,S)\to 0$;
$\sigma _{S_k}\to \sigma _S$ pointwise;
$\sigma _{S_k}\to \sigma _S$ uniformly on each compact set of $\mathbb {R}^n$.

LaTeX

Corollary 4.2.1

Let $f:\mathbb {R}^n\to \mathbb {R}$ be convex. At any $x$,

\[ f(x+h)=f(x)+\langle s,h\rangle +o(\| h\| ) \]

whenever one of the following equivalent properties holds:

\[ s\in \partial f(x)(h)\iff h\in N_{\partial f(x)}(s)\iff s=p_{\partial f(x)}(s+h). \]

LaTeX Lean

Corollary 4.2.2

If the convex $f$ is (Gâteaux) differentiable at $x$, its only subgradient at $x$ is its gradient $\nabla f(x)$. Conversely, if $\partial f(x)$ contains only one element $s$, then $f$ is (Fréchet) differentiable at $x$, with $\nabla f(x)=s$.

LaTeX

Corollary 4.4.2

The notation and assumptions are those of Theorem 4.4.2. Assume also that each $f_j$ is differentiable; then

\[ \partial f(x)=\operatorname {co}\{ \nabla f_j(x):\; j\in J(x)\} . \]

LaTeX Lean

Corollary 4.4.3

For some compact set $Y\subset \mathbb {R}^p$, let $g:\mathbb {R}^n\times Y\to \mathbb {R}$ be a function satisfying the following properties:

for each $x\in \mathbb {R}^n$, $g(x,\cdot )$ is upper semi-continuous;
for each $y\in Y$, $g(\cdot ,y)$ is convex and differentiable;
the function $f:=\sup _{y\in Y} g(\cdot ,y)$ is finite-valued on $\mathbb {R}^n$;
at some $x\in \mathbb {R}^n$, $g(x,\cdot )$ is maximized at a unique $y(x)\in Y$.

Then $f$ is differentiable at this $x$, and its gradient is

\[ \nabla f(x)=\nabla _x g\bigl(x,y(x)\bigr) \tag {4.4.8} \]

(where $\nabla _x g(x,y)$ denotes the gradient of the function $g(\cdot ,y)$ at $x$).

LaTeX Lean

Corollary 4.4.4

Make the assumptions of Theorem 4.5.1. If $g$ is differentiable at some $y\in Y(x)$, then $Ag$ is differentiable at $x$.

LaTeX Lean

Corollary 4.4.5

Suppose that the subdifferential of $g$ in (4.5.4) is associated with a scalar product $\langle \! \langle \cdot ,\cdot \rangle \! \rangle $ preserving the structure of a product space:

\[ \langle \! \langle (x,y),(x',y')\rangle \! \rangle =\langle x,x'\rangle _n+\langle y,y'\rangle _m \qquad \text{for }x,x'\in \mathbb {R}^n\text{ and }y,y'\in \mathbb {R}^m. \]

At a given $x\in \mathbb {R}^n$, take an arbitrary $y$ solving (4.5.4). Then

\[ \partial f(x)=\{ \, s\in \mathbb {R}^n:\, (s,0)\in \partial _{(x,y)}g(x,y)\, \} . \]

LaTeX

Corollary 4.4.6

Let $f_1$ and $f_2:\mathbb {R}^n\to \mathbb {R}$ be two convex functions minorized by a common affine function. For given $x$, let $(y_1,y_2)$ be such that the inf-convolution is exact at $x=y_1+y_2$, i.e.: $(f_1\mathbin {\square }f_2)(x)=f_1(y_1)+f_2(y_2)$. Then

\[ \partial (f_1\mathbin {\square }f_2)(x)=\partial f_1(y_1)\cap \partial f_2(y_2). \tag {4.5.6} \]

LaTeX Lean

Corollary 4.6.1

For $f:\mathbb {R}^n\to \mathbb {R}$ convex, the function $f'(\cdot ,d)$ is upper semicontinuous: at all $x\in \mathbb {R}^n$,

\[ f'(x,d)=\limsup _{y\to x} f'(y,d)\qquad \text{for all }d\in \mathbb {R}^n. \]

LaTeX Lean

Corollary 4.6.2

Let $(f_k)$ be a sequence of (finite) differentiable convex functions converging pointwise to the differentiable $f:\mathbb {R}^n\to \mathbb {R}$. Then $\nabla f_k$ converges to $\nabla f$ uniformly on every compact set of $\mathbb {R}^n$.

LaTeX Lean

Corollary 5.1.1

If $g$ is a function satisfying $\overline{\operatorname {co}}\, f \le g \le f$, then $g^* = f^*$. The function $f$ is equal to its biconjugate $f^{**}$ if and only if $f\in \overline{\operatorname {Conv}}\mathbb {R}^n$.

LaTeX Lean

Corollary 5.2.1

With $g:\mathbb {R}^n\times \mathbb {R}^p=\mathbb {R}^m\to \mathbb {R}\cup \{ +\infty \} $ not identically $+\infty $, let $g^*$ be associated with a scalar product preserving the structure of $\mathbb {R}^m$ as a product space: $\langle \cdot ,\cdot \rangle _m=\langle \cdot ,\cdot \rangle _n+\langle \cdot ,\cdot \rangle _p$. If there exists $s_0\in \mathbb {R}^n$ such that $(s_0,0)\in \operatorname {dom} g^*$, then the conjugate of $f$ defined by (2.1.2) is

\[ f^*(s)=g^*(s,0)\qquad \text{for all }s\in \mathbb {R}^n. \]

LaTeX

Corollary 5.2.2

Let $f_1$ and $f_2$ be two functions from $\mathbb {R}^n$ to $\mathbb {R}\cup \{ +\infty \} $, not identically $+\infty $, and satisfying $\operatorname {dom} f_1^*\cap \operatorname {dom} f_2^*\neq \varnothing $. Then their inf-convolution satisfies (1.1.1), and $(f_1\mathbin {\square }f_2)^*=f_1^*+f_2^*$.

LaTeX

Corollary 5.4.1

Let $f:\mathbb {R}^n\to \mathbb {R}$ be strictly convex, differentiable and 1-coercive. Then

$f^{*}$ is also finite-valued on $\mathbb {R}^n$, strictly convex, differentiable and 1-coercive;
the continuous mapping $\nabla f$ is one-to-one from $\mathbb {R}^n$ onto $\mathbb {R}^n$, and its inverse is continuous;

\[ \text{(iii) }\; f^*(s)=\langle s,(\nabla f)^{-1}(s)\rangle - f\bigl((\nabla f)^{-1}(s)\bigr)\quad \text{for all }s\in \mathbb {R}^n. \]

LaTeX

Definition 6.4.5

Let $f$ be a standard self-concordant function on $\mathcal{X}$. For $x\in \operatorname {int}(\mathcal{X})$, we say that $\lambda _f(x)=\| \nabla f(x)\| _{x}^*$ is the Newton decrement of $f$ at $x$.

LaTeX

Definition 6.4.6

$F$ is a $\nu $-self-concordant barrier if it is a standard self-concordant function, and it is $\tfrac {1}{\nu }$-exp-concave.

LaTeX

Definition 6.4.3

$F:\operatorname {int}(\mathcal{X})\to \mathbb {R}$ is a barrier for $\mathcal{X}$ if

\[ F(x)\; \longrightarrow _{\; x\to \partial \mathcal{X}\; }\; +\infty . \]

LaTeX

Definition 6.3.2

In mirror descent the projection is done via the Bregman divergence associated to $\Phi $. Precisely one defines

\[ \Pi ^{\Phi }_{\chi }(y)=\arg \min _{x\in \chi \cap \mathcal{D}} D_{\Phi }(x,y). \]

LaTeX

Definition 6.1.1

We consider the following simple iterative algorithm: let $S_1=\mathcal{X}$, and for $t\ge 1$ do the following:

Compute

\begin{equation} c_t=\frac{1}{\operatorname {vol}(S_t)}\int _{x\in S_t} x\, dx. \label{eq:2.1} \end{equation}
1
Query the first order oracle at $c_t$ and obtain $w_t\in \partial f(c_t)$. Let

\[ S_{t+1}=S_t\cap \{ x\in \mathbb {R}^n:(x-c_t)^\top w_t\le 0\} . \]

If stopped after $t$ queries to the first order oracle then we use $t$ queries to a zeroth order oracle to output

\[ x_t\in \operatorname*{arg\, min}_{1\le r\le t} f(c_r). \]

This procedure is known as the center of gravity method, it was discovered independently on both sides of the Wall by Levin [1965] and Newman [1965].

LaTeX

Definition 6.4.2Fast-ISTA

\[ \lambda _0 = 0,\qquad \lambda _t = \frac{1 + \sqrt{1 + 4\lambda _{t-1}^2}}{2},\qquad \text{and}\qquad \gamma _t = \frac{1-\lambda _t}{\lambda _{t+1}}. \]

Let $x_1 = y_1$ an arbitrary initial point, and

\[ y_{t+1} \; =\; \arg \min _{x\in \mathbb {R}^n} g(x) + \frac{\beta }{2}\| x - (x_t - \tfrac {1}{\beta }\nabla f(x_t))\| _2^2, \]

\[ x_{t+1} \; =\; (1-\gamma _t)y_{t+1} + \gamma _t y_t. \]

LaTeX

Definition 6.2.5Frank-Wolfe

The conditional gradient descent, introduced in Frank and Wolfe [1956], performs the following update for $t\ge 1$, where $(\gamma _s)_{s\ge 1}$ is a fixed sequence,

\begin{align} y_t & \in \operatorname*{arg\, min}_{y\in \mathcal{X}} \nabla f(x_t)^\top y \label{eq:3.8}\\ x_{t+1} & = (1-\gamma _t)x_t + \gamma _t y_t. \label{eq:3.9} \end{align}

LaTeX

Definition 6.3.3Mirror descent

The mirror descent strategy based on a mirror map $\Phi $ is as follows: let $x_1\in \operatorname*{arg\, min}_{x\in \mathcal{X}\cap \mathcal{D}}\Phi (x)$. Then for $t\ge 1$, let $y_{t+1}\in \mathcal{D}$ such that

\[ \nabla \Phi (y_{t+1})=\nabla \Phi (x_t)-\eta g_t,\quad \text{where }g_t\in \partial f(x_t),\tag {4.2} \]

and

\[ x_{t+1}\in \Pi ^{\Phi }_{\mathcal{X}}(y_{t+1}).\tag {4.3} \]

LaTeX

Definition 6.3.1Mirror map

Let $\mathcal{D}\subset \mathbb {R}^n$ be a convex open set such that $\mathcal{X}$ is included in its closure, that is $\mathcal{X}\subset \overline{\mathcal{D}}$, and $\mathcal{X}\cap \mathcal{D}\neq \varnothing $. We say that $\Phi :\mathcal{D}\to \mathbb {R}$ is a mirror map if it satisfies the following properties:

$\Phi $ is strictly convex and differentiable.
The gradient of $\Phi $ takes all possible values, that is $\nabla \Phi (\mathcal{D})=\mathbb {R}^n$.
The gradient of $\Phi $ diverges on the boundary of $\mathcal{D}$, that is

\[ \lim _{x\to \partial \mathcal{D}}\| \nabla \Phi (x)\| =+\infty . \]

LaTeX

Definition 6.3.4Mirror prox

Mirror prox is described by the following equations:

\[ \nabla \Phi (y'_{t+1})=\nabla \Phi (x_t)-\eta \nabla f(x_t), \]

\[ y_{t+1}\in \arg \min _{x\in \mathcal X\cap \mathcal D} D_{\Phi }(x,y'_{t+1}), \]

\[ \nabla \Phi (x'_{t+1})=\nabla \Phi (x_t)-\eta \nabla f(y_{t+1}), \]

\[ x_{t+1}\in \arg \min _{x\in \mathcal X\cap \mathcal D} D_{\Phi }(x,x'_{t+1}). \]

LaTeX

Definition 6.2.3Nesterov’s accelerated gradient descent

Nesterov’s accelerated gradient descent can be described as follows: Start at an arbitrary initial point $x_1=y_1$ and then iterate the following equations for $t\ge 1$,

\[ y_{t+1}=x_t-\frac{1}{\beta }\nabla f(x_t), \]

\[ x_{t+1}=\left(1+\frac{\sqrt{\kappa }-1}{\sqrt{\kappa }+1}\right)y_{t+1} -\frac{\sqrt{\kappa }-1}{\sqrt{\kappa }+1}y_t. \]

LaTeX

Definition 6.2.4Nesterov, $\alpha =0$

Nesterov’s accelerated gradient descent for the case $\alpha = 0$. First we define the following sequences:

\[ \lambda _0 = 0,\qquad \lambda _t = \frac{1 + \sqrt{1 + 4\lambda _{t-1}^2}}{2},\qquad \text{and}\qquad \gamma _t = \frac{1 - \lambda _t}{\lambda _{t+1}}. \]

(Note that $\gamma _t \le 0$.) Now the algorithm is simply defined by the following equations, with $x_1 = y_1$ an arbitrary initial point,

\[ y_{t+1} = x_t - \frac{1}{\beta }\nabla f(x_t), \]

\[ x_{t+1} = (1-\gamma _s)y_{t+1} + \gamma _t y_t. \]

LaTeX

Definition 1.1.1

The set $C \subset \mathbb {R}^n$ is said to be convex if $ \alpha x + (1-\alpha )x'$ is in $C$ whenever $x$ and $x'$ are in $C$, and $\alpha \in ]0,1[$ (or equivalently $\alpha \in [0,1]$).

LaTeX

Definition 1.1.2

A convex combination of elements $x_1,\ldots ,x_k$ in $\mathbb {R}^n$ is an element of the form

\[ \sum _{i=1}^k \alpha _i x_i \]

where

\[ \sum _{i=1}^k \alpha _i = 1 \quad \text{and}\quad \alpha _i \ge 0\ \text{for } i=1,\ldots ,k. \]

LaTeX

Definition 1.1.4

A conical combination of elements $x_1,\dots ,x_k$ is an element of the form $\sum _{i=1}^k \alpha _i x_i$, where the coefficients $\alpha _i$ are nonnegative.

The set of all conical combinations from a given nonempty $S\subset \mathbb {R}^n$ is the conical hull of $S$. It is denoted by $\operatorname {cone} S$.

LaTeX

Definition 1.1.5

The closed conical hull (or rather closed convex conical hull) of a nonempty set $S\subset \mathbb {R}^n$ is

\[ \overline{\operatorname {cone} S}:=\operatorname {cl}\big(\operatorname {cone} S\big) =\overline{\Big\{ \sum _{i=1}^k \alpha _i x_i:\ \alpha _i\ge 0,\ x_i\in S\ \text{ for } i=1,\dots ,k\Big\} }. \]

LaTeX

Definition 1.2.1

The relative interior $\operatorname {ri} C$ (or $\operatorname {relint} C$) of a convex set $C \subset \mathbb {R}^n$ is the interior of $C$ for the topology relative to the affine hull of $C$. In other words: $x \in \operatorname {ri} C$ if and only if

\[ x \in \operatorname {aff} C \quad \text{and}\quad \exists \delta {\gt}0\ \text{such that}\ (\operatorname {aff} C)\cap B(x,\delta )\subset C . \]

The dimension of a convex set $C$ is the dimension of its affine hull, that is to say the dimension of the subspace parallel to $\operatorname {aff} C$.

LaTeX

Definition 1.2.2

The asymptotic cone, or recession cone of the closed convex set $C$ is the closed convex cone $C_{\infty }$ defined by (2.2.1) or (2.2.2), in which Proposition 2.2.1 is exploited.

LaTeX

Definition 1.2.3

We say that $x\in C$ is an extreme point of $C$ if there are no two different points $x_1$ and $x_2$ in $C$ such that $x=\tfrac {1}{2}(x_1+x_2)$.

LaTeX

Definition 1.2.5Supporting Hyperplane

An affine hyperplane $H_{s,r}$ is said to support the set $C$ when $C$ is entirely contained in one of the two closed half-spaces delimited by $H_{s,r}$: say

\[ \langle s,y\rangle \le r\qquad \text{for all }y\in C. \tag {2.4.1} \]

It is said to support $C$ at $x\in C$ when, in addition, $x\in H_{s,r}$; (2.4.1) holds, as well as

\[ \langle s,x\rangle = r. \]

LaTeX

Definition 1.2.6

The set $F\subset C$ is an exposed face of $C$ if there is a supporting hyperplane $H_{s,r}$ of $C$ such that $F=C\cap H_{s,r}$.

An exposed point, or vertex, is a $0$-dimensional exposed face, i.e. a point $x\in C$ at which there is a supporting hyperplane $H_{s,r}$ of $C$ such that $H_{s,r}\cap C$ reduces to $\{ x\} $.

LaTeX

Definition 1.3.1

Let $K$ be a convex cone, as defined in Example 1.1.4. The polar cone of $K$ (called negative polar cone by some authors) is:

\[ K^{\circ } := \{ s \in \mathbb {R}^n : \langle s,x\rangle \le 0 \text{ for all } x \in K \} . \]

LaTeX

Definition 1.4.1Polyhedral Sets

A closed convex polyhedron is an intersection of finitely many half-spaces. Take $(s_1,r_1),\ldots ,(s_m,r_m)$ in $\mathbb {R}^n\times \mathbb {R}$, with $s_i\neq 0$ for $i=1,\ldots ,m$; then define

\[ P := \{ x\in \mathbb {R}^n : \langle s_j,x\rangle \le r_j\ \text{ for } j=1,\ldots ,m\} , \]

or in matrix notations (assuming the dot-product for $\langle \cdot ,\cdot \rangle $),

\[ P = \{ x\in \mathbb {R}^n : Ax \le b\} , \]

if $A$ is the matrix whose rows are $s_j$ and $b\in \mathbb {R}^m$ has coordinates $r_j$.

A closed convex polyhedral cone is the special case where $b=0$.

LaTeX

Definition 1.5.1

Let $S\subset \mathbb {R}^n$ be nonempty. We say that $d\in \mathbb {R}^n$ is a direction tangent to $S$ at $x\in S$ when there exists a sequence $\{ x_k\} \subset S$ and a sequence $\{ t_k\} $ such that, when $k\to +\infty $,

\[ x_k\to x,\qquad t_k\downarrow 0,\qquad \frac{x_k-x}{t_k}\to d. \tag {5.1.1} \]

The set of all such directions is called the tangent cone (also called the contingent cone, or Bouligand’s cone) to $S$ at $x\in S$, denoted by $T_S(x)$.

LaTeX

Definition 2.1.1

Let $C$ be a nonempty convex set in $\mathbb {R}^n$. A function $f : C \to \mathbb {R}$ is said to be convex on $C$ when, for all pairs $(x,x')\in C\times C$ and all $\alpha \in [0,1]$, there holds

\[ f(\alpha x + (1-\alpha )x') \le \alpha f(x) + (1-\alpha ) f(x'). \tag {1.1.1} \]

LaTeX

Definition 2.1.2

(The Set $\operatorname {Conv}\mathbb {R}^n$) A function $f:\mathbb {R}^n\to \mathbb {R}\cup \{ +\infty \} $, not identically $+\infty $, is said to be convex when, for all $(x,x')\in \mathbb {R}^n\times \mathbb {R}^n$ and all $\alpha \in [0,1]$, there holds

\[ f\big(\alpha x+(1-\alpha )x'\big)\le \alpha f(x)+(1-\alpha )f(x'), \]

considered as an inequality in $\mathbb {R}\cup \{ +\infty \} $.

The class of such functions is denoted by $\operatorname {Conv}\mathbb {R}^n$.

LaTeX

Definition 2.1.3

(Domain of a Function) The domain (or also effective domain) of $f\in \text{Conv }\mathbb {R}^n$ is the nonempty set

\[ \text{dom }f := \{ x\in \mathbb {R}^n : f(x) {\lt} +\infty \} . \]

LaTeX

Definition 2.1.4

(Epigraph of a Function) Given $f:\mathbb {R}^n\to \mathbb {R}\cup \{ +\infty \} $, not identically equal to $+\infty $, the epigraph of $f$ is the nonempty set

\[ \operatorname {epi} f := \{ (x,r)\in \mathbb {R}^n\times \mathbb {R} : r \ge f(x)\} . \]

Its strict epigraph $\operatorname {epi}_s f$ is defined likewise, with “$\ge $” replaced by “${\gt}$” (beware that the word “strict” here has nothing to do with strict convexity).

LaTeX

Definition 2.1.5

(Closed Functions) The function $f:\mathbb {R}^n\to \mathbb {R}\cup \{ +\infty \} $ is said to be closed if it is lower semi-continuous everywhere, or if its epigraph is closed, or if its sublevel-sets are closed.

LaTeX

Definition 2.1.6Closure of a Function

The closure (or lower semi-continuous hull) of a function $f$ is the function $\operatorname {cl}f:\mathbb {R}^n\to \mathbb {R}\cup \{ \pm \infty \} $ defined by:

\[ \operatorname {cl}f(x):=\liminf _{y\to x} f(y)\qquad \text{for all }x\in \mathbb {R}^n, \tag {1.2.4} \]

or equivalently by

\[ \operatorname {epi}(\operatorname {cl}f):=\operatorname {cl}\big(\operatorname {epi} f\big). \tag {1.2.5} \]

LaTeX

Definition 2.2.2

(Image Function) Let $A:\mathbb {R}^m\to \mathbb {R}^n$ be a linear operator and let $g:\mathbb {R}^m\to \mathbb {R}\cup \{ +\infty \} $. The image of $g$ under $A$ is the function $Ag:\mathbb {R}^n\to \mathbb {R}\cup \{ \pm \infty \} $ defined by

\[ (Ag)(x):=\inf \{ \, g(y):\; Ay=x\, \} \tag {2.4.2} \]

(here as always, $\inf \varnothing =+\infty $).

LaTeX

Definition 2.2.3

(Marginal Function) Let $g\in \operatorname {Conv}(\mathbb {R}^n\times \mathbb {R}^m)$. Then

\[ \mathbb {R}^n\ni x\mapsto \gamma (x):=\inf \{ \, g(x,y):\; y\in \mathbb {R}^m\, \} \]

is the marginal function of $g$.

LaTeX

Definition 2.2.4Convex Hulls of a Function

Let $g:\mathbb {R}^n\to \mathbb {R}\cup \{ +\infty \} $, not identically $+\infty $, be minorized by an affine function. The common function $f_1=f_2=f_3$ of Proposition 2.5.1 is called the convex hull of $g$, denoted by $\operatorname {co}g$. The closed convex hull of $g$ is any of the functions described by Proposition 2.5.2; it is denoted by $\overline{\operatorname {co}}\, g$ or $\operatorname {cl}\, \operatorname {co}g$.

LaTeX

Definition 2.3.1

(Asymptotic function) The function $f'_{\infty }$ of Proposition 3.2.1 is called the asymptotic function, or recession function, of $f$.

LaTeX

Definition 2.3.2

(Coercivity) The functions $f\in \operatorname {Conv}\mathbb {R}^n$ satisfying (i), (ii) or (iii) are called $0$-coercive. Equivalently, the $0$-coercive functions are those that “increase at infinity”:

\[ f(x)\to +\infty \qquad \text{whenever}\qquad \| x\| \to +\infty , \]

and closed convex $0$-coercive functions achieve their minimum over $\mathbb {R}^n$.

An important particular case is when $f'_\infty (d)=+\infty $ for all $d\neq 0$, i.e. when $f'_\infty = \iota _{\{ 0\} }$. It can be seen that this means precisely

\[ \frac{f(x)}{\| x\| }\to +\infty \qquad \text{whenever}\qquad \| x\| \to +\infty . \]

(to establish this equivalence, extract a cluster point of $(x_k/\| x_k\| )$, and use the lower semi-continuity of $f'_\infty $). In words: at infinity, $f$ increases to infinity faster than any affine function; such functions are called $1$-coercive, or sometimes just coercive.

LaTeX

Definition 3.1.1

A function $\sigma :\mathbb {R}^n\to \mathbb {R}\cup \{ +\infty \} $ is said to be sublinear if it is convex and positively homogeneous (of degree 1): $\sigma \in \mathrm{Conv}\, \mathbb {R}^n$ and

\[ \sigma (tx)=t\sigma (x)\qquad \text{for all }x\in \mathbb {R}^n\text{ and }t{\gt}0. \tag {1.1.1} \]

LaTeX

Definition 3.1.2

(Gauge) Let $C$ be a closed convex set containing the origin. The function $\gamma _C$ defined by

\[ \gamma _C(x) := \inf \{ \lambda {\gt}0 : x\in \lambda C\} \tag {1.2.2} \]

is called the gauge of $C$. As usual, we set $\gamma _C(x):=+\infty $ if $x\notin \lambda C$ for no $\lambda {\gt}0$.

LaTeX

Definition 3.2.1

( Support Function) Let $S$ be a nonempty set in $\mathbb {R}^n$. The function

\[ \sigma _S:\mathbb {R}^n\to \mathbb {R}\cup \{ +\infty \} \]

defined by

\[ \mathbb {R}^n\ni x\mapsto \sigma _S(x):=\sup \{ \langle s,x\rangle :\; s\in S\} \tag {2.1.1} \]

is called the support function of $S$.

LaTeX

Definition 3.2.2

(Breadth of a Set) The breadth of a nonempty set $S$ along $x\neq 0$ is

\[ \sigma _S(x)+\sigma _S(-x)=\sup _{s\in S}\langle s,x\rangle -\inf _{s\in S}\langle s,x\rangle , \]

a number in $[0,+\infty ]$. It is $0$ if and only if $S$ lies entirely in some affine hyperplane orthogonal to $x$; such a hyperplane is expressed as

\[ \{ y\in \mathbb {R}^n:\ \langle y,x\rangle =\sigma _S(x)\} , \]

which in particular contains $S$. The intersection of all these hyperplanes is just the affine hull of $S$.

LaTeX

Definition 3.3.1Direction Exposing a Face

Let $C$ be a nonempty closed convex set, with support function $\sigma $. For given $d\neq 0$, the set

\[ F_C(d):=\{ x\in C:\ \langle x,d\rangle =\sigma (d)\} \]

is called the exposed face of $C$ associated with $d$, or the face exposed by $d$.

LaTeX

Definition 4.1.1

(Directional Derivative) The directional derivative of $f$ at $x$ in the direction $d$ is

\[ f'(x,d):=\lim _{\{ q(t):t\downarrow 0\} }=\inf \{ q(t):t{\gt}0\} . \tag {1.1.2} \]

LaTeX

Definition 4.1.2Subdifferential I

The subdifferential $\partial f(x)$ of $f$ at $x$ is the nonempty compact convex set of $\mathbb {R}^n$ whose support function is $f'(x,\cdot )$, i.e.

\[ \partial f(x):=\{ s\in \mathbb {R}^n:\; \langle s,d\rangle \le f'(x,d)\ \text{ for all } d\in \mathbb {R}^n\} .\tag {1.1.6} \]

A vector $s\in \partial f(x)$ is called a subgradient of $f$ at $x$.

LaTeX

Definition 4.1.3

(Subdifferential II) The subdifferential of $f$ at $x$ is the set of vectors $s\in \mathbb {R}^n$ satisfying

\[ f(y)\ge f(x)+\langle s,y-x\rangle \qquad \text{for all }y\in \mathbb {R}^n. \tag {1.2.1} \]

LaTeX

Definition 4.2.1

A point $x$ at which $\partial f(x)$ has more than one element — i.e. at which $f$ is not differentiable — is called a kink (or corner-point) of $f$.

LaTeX

Lemma 6.1.1

(Grünbaum [1960]). Let $\mathcal{K}$ be a centered convex set, i.e., $\int _{x\in \mathcal{K}} x\, dx = 0$, then for any $w\in \mathbb {R}^n$, $w\neq 0$, one has

\[ \operatorname {Vol}\big(\mathcal{K}\cap \{ x\in \mathbb {R}^n : x^\top w\ge 0\} \big)\ge \frac{1}{e}\operatorname {Vol}(\mathcal{K}). \]

LaTeX

Lemma 6.1.2

Let $\mathcal{E}_0=\{ x\in \mathbb {R}^n:(x-c_0)^\top H_0^{-1}(x-c_0)\le 1\} $. For any $w\in \mathbb {R}^n$, $w\neq 0$, there exists an ellipsoid $\mathcal{E}$ such that

\[ \mathcal{E}\supset \{ x\in \mathcal{E}_0:w^\top (x-c_0)\le 0\} , \]

and

\[ \operatorname {vol}(\mathcal{E})\le \exp \! \left(-\frac{1}{2n}\right)\operatorname {vol}(\mathcal{E}_0). \]

Furthermore for $n\ge 2$ one can take $\mathcal{E}=\{ x\in \mathbb {R}^n:(x-c)^\top H^{-1}(x-c)\le 1\} $ where

\[ c=c_0-\frac{1}{n+1}\frac{H_0w}{\sqrt{w^\top H_0 w}}, \]

\[ H=\frac{n^2}{n^2-1}\left(H_0-\frac{2}{n+1}\frac{H_0 ww^\top H_0}{w^\top H_0 w}\right). \]

LaTeX

Lemma 6.2.4

Let $f$ be $\beta $-smooth and $\alpha $-strongly convex on $\mathbb {R}^n$. Then for all $x,y\in \mathbb {R}^n$, one has

\[ (\nabla f(x)-\nabla f(y))^\top (x-y) \; \ge \; \frac{\alpha \beta }{\beta +\alpha }\| x-y\| ^2 \; +\; \frac{1}{\beta +\alpha }\| \nabla f(x)-\nabla f(y)\| ^2. \]

LaTeX

Lemma 6.2.1

Let $f$ be a $\beta $-smooth function on $\mathbb {R}^n$. Then for any $x,y\in \mathbb {R}^n$, one has

\[ |f(x)-f(y)-\nabla f(y)^{\top }(x-y)| \le \frac{\beta }{2}\| x-y\| ^2. \]

LaTeX

Lemma 6.2.2

Let $f$ be such that (3.4) holds true. Then for any $x,y\in \mathbb {R}^n$, one has

\[ f(x)-f(y)\le \nabla f(x)^\top (x-y)-\frac{1}{2\beta }\| \nabla f(x)-\nabla f(y)\| ^2. \]

LaTeX

Lemma 6.2.3

Let $x,y\in \mathcal{X}$, $x^{+}=\Pi _{\mathcal{X}}\bigl(x-\tfrac {1}{\beta }\nabla f(x)\bigr)$, and $g_{\mathcal{X}}(x)=\beta (x-x^{+})$. Then the following holds true:

\[ f(x^{+})-f(y)\le g_{\mathcal{X}}(x)^{\top }(x-y)-\tfrac {1}{2\beta }\| g_{\mathcal{X}}(x)\| ^{2}. \]

LaTeX

Lemma 6.3.1

Let $x\in \mathcal{X}\cap \mathcal{D}$ and $y\in \mathcal{D}$, then

\[ (\nabla \Phi (\Pi ^{\Phi }_{\mathcal{X}}(y))-\nabla \Phi (y))^{\top }(\Pi ^{\Phi }_{\mathcal{X}}(y)-x)\le 0, \]

which also implies

\[ D_{\Phi }(x,\Pi ^{\Phi }_{\chi }(y))+D_{\Phi }(\Pi ^{\Phi }_{\chi }(y),y)\le D_{\Phi }(x,y). \]

LaTeX

Lemma 1.4.1

Let $x\in \partial C$, where $C\neq \varnothing $ is convex in $\mathbb {R}^n$ (naturally $C\neq \mathbb {R}^n$). There exists a hyperplane supporting $C$ at $x$.

LaTeX Lean

Lemma 1.4.2Farkas I

Let $b,s_1,\dots ,s_m$ be given in $\mathbb {R}^n$. The set

\[ \{ x\in \mathbb {R}^n:\ \langle s_j,x\rangle \le 0\quad \text{for }j=1,\dots ,m\} \tag {4.3.1} \]

is contained in the set

\[ \{ x\in \mathbb {R}^n:\ \langle b,x\rangle \le 0\} \tag {4.3.2} \]

if and only if (see Definition 1.4.5 of a conical hull)

\[ b\in \operatorname {cone}\{ s_1,\dots ,s_m\} . \tag {4.3.3} \]

LaTeX Lean

Lemma 1.4.3Farkas II

Let $b,s_1,\dots ,s_m$ be given in $\mathbb {R}^n$. Then exactly one of the following statements is true.

?? has a solution $\alpha \in \mathbb {R}^n$.
\[ \left\{ \begin{aligned} & \langle b,x\rangle {\gt}0,\\ & \langle s_j,x\rangle \le 0\quad \text{for }j=1,\dots ,m \end{aligned} \right. \]

has a solution $x\in \mathbb {R}^n$.

LaTeX Lean

Lemma 1.4.4Farkas III

Let $s_1,\dots ,s_m$ be given in $\mathbb {R}^n$. Then the convex cone

\[ K:=\operatorname {cone}\{ s_1,\dots ,s_m\} =\Big\{ \sum _{j=1}^m\alpha _j s_j:\ \alpha _j\ge 0\ \text{for }j=1,\dots ,m\Big\} \]

is closed.

LaTeX Lean

Lemma 2.3.1

Let $f\in \operatorname {Conv}\mathbb {R}^n$ and suppose there are $x_0,\ \delta ,\ m$ and $M$ such that

\[ m\le f(x)\le M\quad \text{for all }x\in B(x_0,2\delta ). \]

Then $f$ is Lipschitzian on $B(x_0,\delta )$; more precisely: for all $y$ and $y'$ in $B(x_0,\delta )$,

\begin{equation} \label{eq:3.1.1} |f(y)-f(y')|\le \frac{M-m}{\delta }\| y-y'\| . \end{equation}

LaTeX

Lemma 4.1.1

For the convex function $f:\mathbb {R}^n\to \mathbb {R}$ and the sublevel-set (1.3.1), we have

\[ T_{S_{f(x)} }(x)\subset \{ d:\; f'(x,d)\le 0\} . \tag {1.3.2} \]

LaTeX Lean

Lemma 4.2.1

Let $f:\mathbb {R}^n\to \mathbb {R}$ be convex and $x\in \mathbb {R}^n$. For any $\varepsilon {\gt}0$, there exists $\delta {\gt}0$ such that $\| h\| \le \delta $ implies

\[ \big|f(x+h)-f(x)-f'(x,h)\big|\le \varepsilon \| h\| . \tag {2.1.1} \]

LaTeX Lean

Lemma 4.2.2

The subdifferential of $\varphi $ defined by (2.3.1) is

\[ \partial \varphi (t)=\{ \; s,y-x\; :\; s\in \partial f(x_t)\; \} \]

or, more symbolically:

\[ \partial \varphi (t)=\langle \partial f(x_t),\, y-x\rangle . \]

LaTeX Lean

Lemma 4.4.1

With the notation (4.4.1), (4.4.2),

\[ \partial f(x)\supset \overline{\operatorname {co}}\{ \partial f_j(x):\; j\in J(x)\} . \tag {4.4.3} \]

LaTeX Lean

Proposition 1.1.1

Let $\{ C_j\} _{j\in J}$ be an arbitrary family of convex sets. Then

\[ C:=\bigcap \{ C_j:\; j\in J\} \]

is convex.

LaTeX Lean

Proposition 1.1.2

For $i=1,\dots ,k$, let $C_i\subset \mathbb {R}^{n_i}$ be convex sets. Then $C_1\times \cdots \times C_k$ is a convex set of $\mathbb {R}^{n_1}\times \cdots \times \mathbb {R}^{n_k}$.

LaTeX

Proposition 1.1.3

Let $A:\mathbb {R}^n\to \mathbb {R}^m$ be an affine mapping and $C$ a convex set of $\mathbb {R}^n$. The image $A(C)$ of $C$ under $A$ is convex in $\mathbb {R}^m$.

If $D$ is a convex set of $\mathbb {R}^m$, the inverse image

\[ A^{-1}(D):=\{ x\in \mathbb {R}^n:\; A(x)\in D\} \]

is convex in $\mathbb {R}^n$.

LaTeX Lean

Proposition 1.1.4

If $C$ is convex, so are its interior $\operatorname {int}C$ and its closure $\overline{C}$.

LaTeX Lean

Proposition 1.1.5

A set $C\subset \mathbb {R}^n$ is convex if and only if it contains every convex combination of its elements.

LaTeX Lean

Proposition 1.1.6

The convex hull can also be described as the set of all convex combinations:

\[ \operatorname {co}S:=\bigcap \{ C: C\ \text{is convex and contains }S\} = \Big\{ x\in \mathbb {R}^n : \text{for some }k\in \mathbb {N}_*,\ \text{there exist }x_1,\dots ,x_k\in S\ \text{and } \alpha =(\alpha _1,\dots ,\alpha _k)\in \Delta _k\ \text{such that }\sum _{i=1}^k\alpha _i x_i=x\Big\} . \tag {1.3.2} \]

LaTeX Lean

Proposition 1.1.7

The closed convex hull $\overline{\operatorname {co}}S$ of Definition 1.4.1 is the closure $\operatorname {cl}(\operatorname {co}S)$ of the convex hull of $S$.

LaTeX

Proposition 1.1.8

Let $S$ be a nonempty compact set such that $0\notin \operatorname {co}S$. Then

\[ \overline{\operatorname {cone}S}=\mathbb {R}^{+}(\operatorname {co}S)\quad [=\operatorname {cone}S]. \]

LaTeX

Proposition 1.2.2

Let the two convex sets $C_1$ and $C_2$ satisfy $\operatorname {ri}C_1\cap \operatorname {ri}C_2\neq \varnothing $. Then

\begin{align} \operatorname {ri}(C_1\cap C_2) & = \operatorname {ri}C_1\cap \operatorname {ri}C_2 \label{eq:2.1.1}\\ \operatorname {cl}(C_1\cap C_2) & = \operatorname {cl}C_1\cap \operatorname {cl}C_2 .\label{eq:2.1.2} \end{align}

LaTeX Lean

Proposition 1.2.4

Let $A:\mathbb {R}^n\to \mathbb {R}^m$ be an affine mapping and $C$ a convex set of $\mathbb {R}^n$. Then

\[ \operatorname {ri}[A(C)] = A(\operatorname {ri} C). \tag {2.1.3} \]

If $D$ is a convex set of $\mathbb {R}^m$ satisfying $A^{-1}(\operatorname {ri}D)\neq \varnothing $, then

\[ \operatorname {ri}\bigl[A^{-1}(D)\bigr] = A^{-1}(\operatorname {ri} D). \tag {2.1.4} \]

LaTeX Lean

Proposition 1.2.1

The three convex sets $\operatorname {ri}C$, $C$ and $\operatorname {cl}C$ have the same affine hull (and hence the same dimension), the same relative interior and the same closure (and hence the same relative boundary).

LaTeX Lean

Proposition 1.2.5

The closed convex cone $C_{\infty }(x)$ does not depend on $x\in C$.

LaTeX Lean

Proposition 1.2.6

A closed convex set $C$ is compact if and only if $C_\infty =\{ 0\} $.

LaTeX Lean

Proposition 1.2.7

If $\{ C_j\} _{j\in J}$ is a family of closed convex sets having a point in common, then

\[ \left(\bigcap _{j\in J} C_j\right)_\infty = \bigcap _{j\in J}(C_j)_\infty . \]
If, for $j=1,\dots ,m$, $C_j$ are closed convex sets in $\mathbb {R}^{n_j}$, then

\[ (C_1\times \cdots \times C_m)_\infty = (C_1)_\infty \times \cdots \times (C_m)_\infty . \]
Let $A:\mathbb {R}^n\to \mathbb {R}^m$ be an affine mapping. If $C$ is closed convex in $\mathbb {R}^n$ and $A(C)$ is closed, then

\[ A(C_\infty )\subset [A(C)]_\infty . \]
If $D$ is closed convex in $\mathbb {R}^m$ with nonempty inverse image, then

\[ \bigl[\; A^{-1}(D)\; \bigr]_\infty = A^{-1}(D_\infty ). \]

LaTeX

Proposition 1.2.8

If $C$ is compact, then $\operatorname {ext}C\neq \varnothing $.

LaTeX Lean

Proposition 1.2.9

Let $F$ be a face of $C$. Then any extreme point of $F$ is an extreme point of $C$.

LaTeX Lean

Proposition 1.2.10

An exposed face is a face.

LaTeX Lean

Proposition 1.2.11

Let $C$ be convex and compact. For $s\in \mathbb {R}^n$, there holds

\[ \max _{x\in C}\langle s,x\rangle =\max _{x\in \operatorname {ext} C}\langle s,x\rangle . \]

LaTeX Lean

Proposition 1.3.1

For all $(x_1,x_2)\in \mathbb {R}^n\times \mathbb {R}^n$, there holds

\[ \| p_C(x_1)-p_C(x_2)\| ^2 \le \langle p_C(x_1)-p_C(x_2),\, x_1-x_2\rangle . \]

LaTeX Lean

Proposition 1.3.2

Let $K$ be a closed convex cone. Then $y_x = p_K(x)$ if and only if

\[ y_x \in K,\qquad x-y_x\in K^\circ ,\qquad \langle x-y_x,y_x\rangle =0. \tag {3.2.1} \]

LaTeX Lean

Proposition 1.4.1

Let $S\subset \mathbb {R}^n$ and $C:=\operatorname {co}S$. Any $x\in C\cap \operatorname {bd}C$ can be represented as a convex combination of $n$ elements of $S$.

LaTeX Lean

Proposition 1.4.2

Let $K$ be a convex cone with polar $K^{\circ }$; then, the polar $K^{\circ \circ }$ of $K^{\circ }$ is the closure of $K$.

LaTeX Lean

Proposition 1.5.1

A direction $d$ is tangent to $S$ at $x\in S$ if and only if

\[ \exists (d_k)\to d,\ \exists (t_k)\downarrow 0\quad \text{such that}\quad x+t_kd_k\in S\ \text{for all }k. \]

LaTeX Lean

Proposition 1.5.2

The tangent cone is closed.

LaTeX Lean

Proposition 1.5.3

The tangent cone to $C$ at $x$ is the closure of the cone generated by $C-\{ x\} $:

\begin{align} T_C(x) & =\overline{\operatorname {cone}}(C-x)=\overline{\operatorname {cl}}\mathbb {R}^+(C-x)\nonumber \\ & =\overline{\{ d\in \mathbb {R}^n:\; d=\alpha (y-x),\; y\in C,\; \alpha \ge 0\} }. \tag {5.2.1} \end{align}

LaTeX Lean

Proposition 1.5.4

The normal cone is the polar of the tangent cone.

LaTeX Lean

Proposition 1.5.5

Here, the $C$’s are nonempty closed convex sets.

For $x\in C_1\cap C_2$, there holds

\[ T_{C_1\cap C_2}(x)\subset T_{C_1}(x)\cap T_{C_2}(x) \qquad \text{and}\qquad N_{C_1\cap C_2}(x)\supset N_{C_1}(x)+N_{C_2}(x). \]
With $C_i\subset \mathbb R^{n_i}$, $i=1,2$ and $(x_1,x_2)\in C_1\times C_2$,

\[ T_{C_1\times C_2}(x_1,x_2)=T_{C_1}(x_1)\times T_{C_2}(x_2), \qquad N_{C_1\times C_2}(x_1,x_2)=N_{C_1}(x_1)\times N_{C_2}(x_2). \]
With an affine mapping $A(x)=y_0+A_0x$ ($A_0$ linear) and $x\in C$,

\[ T_{A(C)}[A(x)]=\operatorname {cl}[A_0T_C(x)] \qquad \text{and}\qquad N_{A(C)}[A(x)]=A_0^{-*}[N_C(x)]. \]
In particular (start from (ii), (iii) and proceed as when proving (1.2.2)):

\[ T_{C_1+C_2}(x_1+x_2)=\operatorname {cl}[T_{C_1}(x_1)+T_{C_2}(x_2)], \qquad N_{C_1+C_2}(x_1+x_2)=N_{C_1}(x_1)\cap N_{C_2}(x_2). \]

LaTeX Lean

Proposition 1.5.6

For $x\in C$ and $s\in \mathbb {R}^n$, the following properties are equivalent:

$s\in N_C(x)$;
$x$ is in the exposed face $F_C(s)$: $\langle s,x\rangle =\max _{y\in C}\langle s,y\rangle $;
$x=p_C(x+s)$.

LaTeX Lean

Proposition 2.1.1

The function $f$ is strongly convex on $C$ with modulus $c$ if and only if the function $f-\tfrac {1}{2}c\| \cdot \| ^{2}$ is convex on $C$.

LaTeX Lean

Proposition 2.1.2

Let $f:\mathbb {R}^n\to \mathbb {R}\cup \{ +\infty \} $ be not identically equal to $+\infty $. The three properties below are equivalent:

$f$ is convex in the sense of Definition 1.1.3;
its epigraph is a convex set in $\mathbb {R}^n\times \mathbb {R}$;
its strict epigraph is a convex set in $\mathbb {R}^n\times \mathbb {R}$.

LaTeX Lean

Proposition 2.1.3

Let $f\in \text{Conv }\mathbb {R}^n$. The relative interior of $\operatorname {epi}f$ is the union over $x\in \operatorname {ri}\, \operatorname {dom}f$ of the open half-lines with bottom endpoints at $f(x)$:

\[ \operatorname {ri}\operatorname {epi}f=\{ (x,r)\in \mathbb {R}^n\times \mathbb {R}: x\in \operatorname {ri}\operatorname {dom}f,\ r{\gt}f(x)\} . \]

LaTeX Lean

Proposition 2.1.5

For $f:\mathbb {R}^n\to \mathbb {R}\cup \{ +\infty \} $, the following three properties are equivalent:

$f$ is lower semi-continuous on $\mathbb {R}^n$;
$\operatorname {epi} f$ is a closed set in $\mathbb {R}^n\times \mathbb {R}$;
the sublevel-sets $S_r(f)$ are closed (possibly empty) for all $r\in \mathbb {R}$.

LaTeX Lean

Proposition 2.1.6

Let $f\in \operatorname {Conv}\mathbb {R}^n$ and $x'\in \operatorname {ri}\operatorname {dom}f$. There holds (in $\mathbb {R}\cup \{ +\infty \} $)

\[ \operatorname {cl}f(x)=\lim _{t\downarrow 0}f\big(x+t(x'-x)\big)\quad \text{for all }x\in \mathbb {R}^n. \]

LaTeX Lean

Proposition 2.1.7

For $f\in \text{Conv }\mathbb {R}^n$, there holds

\[ \operatorname {cl} f\in \text{Conv }\mathbb {R}^n; \tag {1.2.7} \]

\[ \operatorname {cl} f\text{ and }f\text{ coincide on the relative interior of }\text{dom }f. \tag {1.2.8} \]

LaTeX Lean

Proposition 2.2.2

Let $\{ f_j\} _{j\in J}$ be an arbitrary family of convex [resp. closed convex] functions. If there exists $x_0$ such that $\sup _{j} f_j(x_0) {\lt} +\infty $, then their pointwise supremum $f := \sup \{ f_j : j\in J\} $ is in $\mathrm{Conv}\, \mathbb {R}^n$ [resp. in $\mathrm{Conv}_\mathrm {cl}\, \mathbb {R}^n$].

LaTeX Lean

Proposition 2.2.3

Let $f\in \operatorname {Conv}\mathbb {R}^n$ [resp. $\overline{\operatorname {Conv}}\mathbb {R}^n$] and let $A$ be an affine mapping from $\mathbb {R}^m$ to $\mathbb {R}^n$ such that $\operatorname {Im}A\cap \operatorname {dom}f\neq \varnothing $. Then the function

\[ f\circ A : \mathbb {R}^m \supseteq x\mapsto (f\circ A)(x)=f(A(x)) \]

is in $\operatorname {Conv}\mathbb {R}^m$ [resp. $\overline{\operatorname {Conv}}\mathbb {R}^m$].

LaTeX Lean

Proposition 2.2.5

If $f\in \operatorname {Conv}\mathbb {R}^n$, its perspective $\tilde f$ is in $\operatorname {Conv}\mathbb {R}^{n+1}$.

LaTeX Lean

Proposition 2.2.6

Let $f\in \operatorname {Conv}\mathbb {R}^n$ and let $x'\in \operatorname {ri}\text{dom }f$. Then the closure $\overline{f}$ of its perspective is given as follows:

\[ (\text{cl }\tilde f)(u,x)= \begin{cases} u f(x/u) & \text{if } u{\gt}0,\\[4pt] \lim _{\alpha \downarrow 0}\alpha f(x’-x+\tfrac {x}{\alpha }) & \text{if } u=0,\\[4pt] +\infty & \text{if } u{\lt}0. \end{cases} \]

LaTeX Lean

Proposition 2.2.7

Let the functions $f_1$ and $f_2$ be in $\operatorname {Conv}\mathbb {R}^n$. Suppose that they have a common affine minorant: for some $(s,b)\in \mathbb {R}^n\times \mathbb {R}$,

\[ f_j(x)\ge \langle s,x\rangle - b\qquad \text{for }j=1,2\text{ and all }x\in \mathbb {R}^n. \]

Then their infimal convolution is also in $\operatorname {Conv}\mathbb {R}^n$.

LaTeX Lean

Proposition 2.2.8

Let $g:\mathbb {R}^n\to \mathbb {R}\cup \{ +\infty \} $, not identically $+\infty $, be minorized by some affine function: for some $(s,b)\in \mathbb {R}^n\times \mathbb {R}$,

\[ g(x)\ge \langle s,x\rangle - b\quad \text{for all }x\in \mathbb {R}^n . \tag {2.5.1} \]

Then, the following three functions $f_1, f_2$ and $f_3$ are convex and coincide on $\mathbb {R}^n$:

\[ \begin{aligned} f_1(x) & := \inf \{ r : (x,r)\in \operatorname {co}\, \operatorname {epi} g\} ,\\ f_2(x) & := \sup \{ h(x): h\in \operatorname {Conv}\mathbb {R}^n,\ h\le g\} ,\\ f_3(x) & := \inf \Big\{ \sum _{j=1}^k \alpha _j g(x_j): k=1,2,\dots \\ & \qquad \qquad \qquad \alpha \in \Delta _k,\ x_j\in \operatorname {dom} g,\ \sum _{j=1}^k \alpha _j x_j = x\Big\} . \end{aligned} \tag {2.5.2} \]

LaTeX Lean

Proposition 2.2.9

Let $g$ satisfy the hypotheses of Proposition 2.5.1. Then the three functions below

\[ \bar f_1(x):=\inf \{ r:(x,r)\in \overline{\operatorname {epi} g}\} , \qquad \bar f_2(x):=\sup \{ h(x):\ h\in \operatorname {Conv}\mathbb {R}^n,\ h\le g\} , \]

\[ \bar f_3(x):=\sup \{ \langle s,x\rangle -b:\ \langle s,y\rangle -b\le g(y)\text{ for all }y\in \mathbb {R}^n\} \]

are closed, convex, and coincide on $\mathbb {R}^n$ with the closure of the function constructed in Proposition 2.5.1.

LaTeX Lean

Proposition 2.3.1

For $f\in \overline{\operatorname {Conv}}\mathbb {R}^n$, the asymptotic cone of $\operatorname {epi} f$ is the epigraph of the function $f'_{\infty }\in \overline{\operatorname {Conv}}\mathbb {R}^n$ defined by

\[ d\mapsto f'_{\infty }(d):=\sup _{t{\gt}0}\frac{f(x_0+td)-f(x_0)}{t} \; =\; \lim _{t\to +\infty }\frac{f(x_0+td)-f(x_0)}{t}, \tag {3.2.2} \]

where $x_0$ is arbitrary in $\operatorname {dom} f$.

LaTeX

Proposition 2.3.3

A function $f\in \operatorname {Conv}\mathbb {R}^n$ is Lipschitzian on the whole of $\mathbb {R}^n$ if and only if $f'_\infty $ is finite on the whole of $\mathbb {R}^n$. The best Lipschitz constant for $f$ is then

\[ \sup \{ f'_\infty (d):\| d\| =1\} . \tag {3.2.4} \]

LaTeX

Proposition 2.4.1

For $f\in \operatorname {Conv}\mathbb {R}^n$ and $x\in \operatorname {int}\text{dom }f$, the three statements below are equivalent:

The function

\[ \mathbb {R}^n\ni d\mapsto \lim _{t\downarrow 0}\frac{f(x+td)-f(x)}{t} \]

is linear in $d$;
for some basis of $\mathbb {R}^n$ in which $x=(\xi ^1,\dots ,\xi ^n)$, the partial derivatives $\dfrac {\partial f}{\partial \xi ^i}(x)$ exist at $x$, for $i=1,\dots ,n$;
$f$ is differentiable at $x$.

LaTeX

Proposition 3.1.1

A function $\sigma :\mathbb R^n\to \mathbb R\cup \{ +\infty \} $ is sublinear if and only if its epigraph $\operatorname {epi}\sigma $ is a nonempty convex cone in $\mathbb R^n\times \mathbb R$.

LaTeX Lean

Proposition 3.1.2

A function $\sigma :\mathbb {R}^n\to \mathbb {R}\cup \{ +\infty \} $, not identically equal to $+\infty $, is sublinear if and only if one of the following two properties holds:

\[ \sigma (t_1x_1+t_2x_2)\le t_1\sigma (x_1)+t_2\sigma (x_2)\qquad \text{for all }x_1,x_2\in \mathbb {R}^n\text{ and }t_1,t_2{\gt}0, \tag {1.1.4} \]

\[ \sigma \text{ is positively homogeneous and subadditive}. \tag {1.1.5} \]

LaTeX Lean

Proposition 3.1.4

Let $\sigma $ be sublinear. If $x\in U$, i.e. if

\[ \sigma (x)+\sigma (-x)=0, \]

then there holds

\[ \sigma (x+y)=\sigma (x)+\sigma (y)\qquad \text{for all }y\in \mathbb {R}^n. \]

LaTeX Lean

Proposition 3.1.5

If $\sigma _1$ and $\sigma _2$ are [closed] sublinear and $t_1,t_2$ are positive numbers, then $\sigma := t_1\sigma _1 + t_2\sigma _2$ is [closed] sublinear, if not identically $+\infty $.

LaTeX Lean

Proposition 3.1.6

If $\{ \sigma _j\} _{j\in J}$ is a family of [closed] sublinear functions, then $\sigma := \sup _{j\in J}\sigma _j$ is [closed] sublinear, if not identically $+\infty $.

LaTeX Lean

Proposition 3.1.7

Let $\{ \sigma _j\} _{j\in J}$ be a family of sublinear functions all minorized by some linear function. Then $\sigma := \operatorname {co}\big(\inf _{j\in J}\sigma _j\big)$ is sublinear.

LaTeX Lean

Proposition 3.1.8

Let $\{ \sigma _j\} _{j\in J}$ be a family of sublinear functions all minorized by some linear function. Then if $J=\{ 1,\dots ,m\} $ is a finite set, we obtain the infimal convolution

\[ \operatorname {co}\min \{ \sigma _1,\dots ,\sigma _m\} =\sigma _1\mathbin {\square }\cdots \mathbin {\square }\sigma _m. \]

LaTeX Lean

Proposition 3.2.1

A support function is closed and sublinear.

LaTeX Lean

Proposition 3.2.2

The support function of $S$ is finite everywhere if and only if $S$ is bounded.

LaTeX Lean

Proposition 3.2.3

For $S\subset \mathbb {R}^n$ nonempty, there holds $\sigma _S=\sigma _{\operatorname {cl} S}=\sigma _{\operatorname {co} S}$; whence

\[ \sigma _S=\sigma _{\overline{\operatorname {co}}\, S}. \tag {2.2.1} \]

LaTeX Lean

Proposition 3.2.4

Let $S$ be a nonempty closed convex set in $\mathbb {R}^n$. Then $\overline{\operatorname {dom}\sigma _S}$ and the asymptotic cone $S_\infty $ of $S$ are mutually polar cones.

LaTeX Lean

Proposition 3.3.1

For $x$ in a nonempty closed convex set $C$, it holds

\[ x\in F_C(d)\iff d\in N_C(x). \]

LaTeX Lean

Proposition 3.3.2

For a nonempty closed convex set $C$, it holds

\[ \operatorname {bd} C = \bigcup \{ F_C(d):\; d\in X\} \]

where $X$ can be indifferently taken as: $\mathbb {R}^n\setminus \{ 0\} $, the unit sphere $\widetilde{B}$, or $\operatorname {dom}\sigma _C\setminus \{ 0\} $.

LaTeX

Proposition 3.3.3

Let $B$ and $B^*$ be defined by (3.2.1) and (3.2.2), where $\| \cdot \| $ is a norm on $\mathbb {R}^n$. The support function of $B$ and the gauge of $B^*$ are the same function $\| \cdot \| _* $ defined by

\[ \| s\| _* := \max \{ \langle s,x\rangle : \| x\| \le 1\} . \tag {3.2.3} \]

Furthermore, $\| \cdot \| _*$ is a norm on $\mathbb {R}^n$. The support function of its unit ball $B^*$ and the gauge of its supported set $B$ are the same function $\| \cdot \| $: there holds

\[ \| x\| = \max \{ \langle s,x\rangle : \| s\| _* \le 1\} . \tag {3.2.4} \]

LaTeX

Proposition 3.3.4

Let $C$ be a closed convex set containing the origin. Its gauge $\gamma _C$ is the support function of a closed convex set containing the origin, namely

\[ C^\circ := \{ s\in \mathbb {R}^n : \langle s,d\rangle \le 1\ \text{for all } d\in C\} , \tag {3.2.8} \]

which defines the polar (set) of $C$.

LaTeX Lean

Proposition 3.3.5

Let $C$ be a nonempty compact convex set having $0$ in its interior, so that $C^\circ $ enjoys the same properties. Then, for all $d$ and $s$ in $\mathbb {R}^n$, the following statements are equivalent (the notation (3.2.9) is used)

$H(s)$ is a supporting hyperplane to $C$ at $d$;
$H(d)$ is a supporting hyperplane to $C^\circ $ at $s$;
$d\in \operatorname {bd}C,\ s\in \operatorname {bd}C^\circ \text{ and }\langle s,d\rangle =1$;
$d\in C,\ s\in C^\circ \text{ and }\langle s,d\rangle =1$.

LaTeX Lean

Proposition 3.3.6

Let $A:\mathbb {R}^n\to \mathbb {R}^m$ be a linear operator, with adjoint $A^*$ (for some scalar product $\langle \cdot ,\cdot \rangle $ in $\mathbb {R}^m$). For $S\subset \mathbb {R}^n$ nonempty, we have

\[ \sigma _{\mathrm{cl}\, A(S)}(y)=\sigma _{S}(A^*y)\qquad \text{for all }y\in \mathbb {R}^m. \]

LaTeX Lean

Proposition 3.3.7

Let $A:\mathbb {R}^m\to \mathbb {R}^n$ be a linear operator, with adjoint $A^*$ (for some scalar product $\langle \cdot ,\cdot \rangle $ in $\mathbb {R}^m$). Let $\sigma $ be the support function of a nonempty closed convex set $S\subset \mathbb {R}^m$. If $\sigma $ is minorized on the inverse image

\[ A^{-1}(d)=\{ p\in \mathbb {R}^m:\; Ap=d\} \tag {3.3.3} \]

of each $d\in \mathbb {R}^n$, then the support function of the set $(A^{-1})^*(S)$ is the closure of the image-function $A\sigma $.

LaTeX Lean

Proposition 3.3.8

A convex-compact-valued and locally bounded multifunction $F:\mathbb {R}^n\longrightarrow 2^{\mathbb {R}^n}$ is outer [resp. inner] semi-continuous at $x_0\in \operatorname {int}\text{dom }F$ if and only if its support function $x\mapsto \sigma _{F(x)}(d)$ is upper [resp. lower] semi-continuous at $x_0$ for all $d$ of norm $1$.

LaTeX

Proposition 4.1.1

For fixed $x$, the function $f'(x,\cdot )$ is finite sublinear.

LaTeX Lean

Proposition 4.1.2

The finite sublinear function $d\mapsto \sigma (d):=f'(x,d)$ satisfies

\[ \sigma '(0,\delta )=f'(x,\delta )\qquad \text{for all }\delta \in \mathbb {R}^n; \tag {1.1.8} \]

\[ \sigma (\delta )=\sigma (0)+\sigma '(0,\delta )=\sigma '(0,\delta )\qquad \text{for all }\delta \in \mathbb {R}^n; \tag {1.1.9} \]

\[ \partial \sigma (0)=\partial f(x). \tag {1.1.10} \]

LaTeX Lean

Proposition 4.1.3

A vector $s\in \mathbb {R}^n$ is a subgradient of $f$ at $x$ if and only if $(s,-1)\in \mathbb {R}^n\times \mathbb {R}$ is normal to $\operatorname {epi}f$ at $(x,f(x))$. In other words:

\[ N_{\operatorname {epi}f}(x,f(x))=\{ (\lambda s,-\lambda ): s\in \partial f(x),\ \lambda \ge 0\} . \]
The tangent cone to the set $\operatorname {epi}f$ at $(x,f(x))$ is the epigraph of the directional-derivative function $d\mapsto f'(x,d)$:

\[ T_{\operatorname {epi}f}(x,f(x))=\{ (d,r): r\ge f'(x,d)\} . \]

LaTeX Lean

Proposition 4.1.4

Let $g:\mathbb {R}^n\to \mathbb {R}$ be convex and suppose that $g(x_0){\lt}0$ for some $x_0\in \mathbb {R}^n$. Then

\[ \operatorname {cl}\{ z: g(z){\lt}0\} =\{ z: g(z)\le 0\} ,\tag {1.3.4} \]

\[ \{ z: g(z){\lt}0\} =\operatorname {int}\{ z: g(z)\le 0\} .\tag {1.3.5} \]

It follows

\[ \operatorname {bd}\{ z: g(z)\le 0\} =\{ z: g(z)=0\} .\tag {1.3.6} \]

LaTeX Lean

Proposition 4.2.1

Let $f:\mathbb {R}^n\to \mathbb {R}$ be convex. For all $x$ and $d$ in $\mathbb {R}^n$, we have

\[ F_{\partial f(x)}(d)=\partial [f'(x,\cdot )](d). \]

LaTeX Lean

Proposition 4.6.1

The subdifferential mapping is monotone in the sense that, for all $x_1$ and $x_2$ in $\mathbb {R}^n$,

\begin{equation} \tag {6.1.1} \langle s_2 - s_1, x_2 - x_1\rangle \ge 0\quad \text{for all } s_i\in \partial f(x_i),\; i=1,2. \end{equation}

LaTeX Lean

Proposition 4.6.2

A necessary and sufficient condition for a convex function $f:\mathbb {R}^n\to \mathbb {R}$ to be strictly convex on a convex set $C$ is: for all $x_1,x_2\in C$ with $x_2\neq x_1$,

\[ f(x_2){\gt}f(x_1)+\langle s,x_2-x_1\rangle \quad \text{for all } s\in \partial f(x_1) \]

or equivalently

\[ \langle s_2-s_1,x_2-x_1\rangle {\gt}0\quad \text{for all } s_i\in \partial f(x_i),\ i=1,2. \]

LaTeX Lean

Proposition 4.6.3

Let $f:\mathbb {R}^n\to \mathbb {R}$ be convex. The graph of its subdifferential mapping is closed in $\mathbb {R}^n\times \mathbb {R}^n$.

LaTeX Lean

Proposition 4.6.4

The mapping $\partial f$ is locally bounded, i.e. the image $\partial f(B)$ of a bounded set $B\subset \mathbb {R}^n$ is a bounded set in $\mathbb {R}^n$.

LaTeX Lean

Proposition 4.6.5

Let $x$ and $d\neq 0$ be given in $\mathbb {R}^n$. For any sequence $(t_k,s_k,d_k)\subset \mathbb {R}_+^*\times \mathbb {R}^n\times \mathbb {R}^n$ satisfying

\[ t_k\downarrow 0,\quad s_k\in \partial f(x+t_k d_k),\quad d_k\to d,\qquad \text{for }k=1,2,\dots \]

and any cluster point $s$ of $(s_k)$, there holds

\[ s\in \partial f(x)\qquad \text{and}\qquad \langle s,d\rangle =f'(x,d). \]

LaTeX

Proposition 5.1.1

There holds for all $x\in \mathbb {R}^n$

\[ f^*(s)=\sigma _{\mathrm{epi}\, f}(s,-1)=\sup \{ \langle s,x\rangle - r : (x,r)\in \mathrm{epi}\, f\} . \tag {1.2.1} \]

It follows that the support function of $\mathrm{epi}\, f$ has the expression

\[ \sigma _{\mathrm{epi}\, f}(s,-u)= \begin{cases} u\, f^*\! \bigl(\tfrac {1}{u}s\bigr) & \text{if } u{\gt}0,\\[4pt] \sigma _{\mathrm{epi}\, f}(s,0)=\sigma _{\mathrm{dom}\, f}(s) & \text{if } u=0,\\[4pt] +\infty & \text{if } u{\lt}0. \end{cases} \tag {1.2.2} \]

LaTeX Lean

Proposition 5.1.3

The function $f$ is assumed to satisfy (1.1.1). The conjugate of the function $g(x):=f(x)+r$ is $g^*(s)=f^*(s)-r$.

LaTeX Lean

Proposition 5.1.4

The function $f$ is assumed to satisfy (1.1.1). With $t{\gt}0$, the conjugate of the function $g(x):=tf(x)$ is $g^*(s)=t f^*(s/t)$.

LaTeX Lean

Proposition 5.1.5

The function $f$ is assumed to satisfy (1.1.1). With $t\neq 0$, the conjugate of the function $g(x):=f(tx)$ is $g^*(s)=f^*(s/t)$.

LaTeX Lean

Proposition 5.1.6

The function $f$ is are assumed to satisfy (1.1.1). More generally: if $A$ is an invertible linear operator, $(f\circ A)^*=f^*\circ (A^{-1})^*$.

LaTeX Lean

Proposition 5.1.7

The function $f$ is assumed to satisfy (1.1.1). The conjugate of the function $g(x):=f(x-x_{0})$ is $g^{*}(s)=f^{*}(s)+\langle s,x_{0}\rangle $.

LaTeX Lean

Proposition 5.1.8

The function $f$ is assumed to satisfy (1.1.1). The conjugate of the function $g(x):=f(x)+\langle s_{0},x\rangle $ is $g^{*}(s)=f^{*}(s-s_{0})$.

LaTeX Lean

Proposition 5.1.9

The functions $f_j$ appearing below are assumed to satisfy (1.1.1). If $f_{1}\le f_{2}$, then $f_{1}^{*}\ge f_{2}^{*}$.

LaTeX Lean

Proposition 5.1.10

The functions $f_j$ appearing below are assumed to satisfy (1.1.1). “Convexity” of the conjugation: if $\mathrm{dom}\, f_{1}\cap \mathrm{dom}\, f_{2}\neq \varnothing $ and $\alpha \in ]0,1[$,

\[ [\alpha f_{1}+(1-\alpha )f_{2}]^{*}\le \alpha f_{1}^{*}+(1-\alpha )f_{2}^{*}; \]

LaTeX Lean

Proposition 5.1.11

Let $f$ satisfy (1.1.1), let $H$ be a subspace of $\mathbb {R}^n$, and call $p_H$ the operator of orthogonal projection onto $H$. Suppose that there is a point in $H$ where $f$ is finite. Then $f+p_{iH}$ satisfies (1.1.1) and its conjugate is

\[ (f+p_{iH})^*=(f\circ p_H)^*\circ p_H . \tag {1.3.1} \]

LaTeX Lean

Proposition 5.1.12

For $f$ satisfying (1.1.1), let a subspace $V$ contain the subspace parallel to $\operatorname {aff\, dom}f$ and set $U:=V^{\perp }$. For any $z\in \operatorname {aff\, dom}f$ and any $s\in \mathbb {R}^{n}$ decomposed as $s=s_{U}+s_{V}$, there holds

\[ f^{*}(s)=\langle s_{U},z\rangle +f^{*}(s_{V}). \]

LaTeX Lean

Proposition 5.1.13

If $f$ satisfying (1.1.1) is 1-coercive, then $f^*(s) {\lt} +\infty $ for all $s \in \mathbb {R}^n$.

LaTeX Lean

Proposition 5.1.14

For $f$ satisfying (1.1.1), the following holds:

If $x_0\in \operatorname {int}\text{dom }f$ then $f^*-\langle x_0,\cdot \rangle $ is $0$-coercive;
in particular, if $f$ is finite over $\mathbb R^n$, then $f^*$ is $1$-coercive.

LaTeX Lean

Proposition 5.1.15

For $f$ satisfying (1.1.1), the following properties hold:

\[ \partial f(x)\neq \varnothing \quad \Longrightarrow \quad (\overline{\operatorname {co}}\, f)(x)=f(x);\tag {1.4.3} \]

\[ \overline{\operatorname {co}}\, f\le g\le f\ \text{ and }\ g(x)=f(x) \quad \Longrightarrow \quad \partial g(x)=\partial f(x);\tag {1.4.4} \]

\[ s\in \partial f(x)\quad \Longrightarrow \quad x\in \partial f^*(s).\tag {1.4.5} \]

LaTeX Lean

Proposition 5.3.1

The function $g$ of (3.2.1) has the conjugate

\[ g^{*}(s)=\begin{cases} \tfrac {1}{2}\langle s, (p_{H}\circ B\circ p_{H})^{-}s\rangle & \text{if } s\in \operatorname {Im}B+H^{\perp },\\[4pt] +\infty & \text{otherwise}, \end{cases}\tag {3.2.2} \]

where $p_{H}$ is the operator of orthogonal projection onto $H$ and $(\cdot )^{-}$ is the Moore–Penrose pseudo-inverse.

LaTeX

Proposition 5.3.2

At each $s\in \operatorname {co}\{ s_1,\ldots ,s_k\} =\operatorname {dom}f^*$, the conjugate of $f$ has the value ( $\Delta _k$ is the unit simplex)

\[ f^*(s)=\min \Big\{ \sum _{i=1}^k \alpha _i b_i:\ \alpha \in \Delta _k,\ \sum _{i=1}^k \alpha _i s_i=s\Big\} . \tag {3.3.2} \]

LaTeX

Theorem 6.1.1

The center of gravity method satisfies

\[ f(x_t)-\min _{x\in \mathcal{X}} f(x)\le 2B\left(1-\frac{1}{e}\right)^{t/n}. \]

LaTeX Lean

Theorem 6.1.2

For $t \ge 2n^2\log (R/r)$ the ellipsoid method satisfies $\{ c_1,\dots ,c_t\} \cap \mathcal{X}\neq \varnothing $ and

\[ f(x_t)-\min _{x\in \mathcal{X}}f(x)\le \frac{2BR}{r}\exp \! \left(-\frac{t}{2n^2}\right). \]

LaTeX Lean

Theorem 6.2.5

Let $f$ be $\alpha $-strongly convex and $\beta $-smooth on $\mathcal X$. Then projected gradient descent with $\eta =\frac{1}{\beta }$ satisfies for $t\ge 0$,

\[ \| x_{t+1}-x^*\| ^2 \le \exp \! \bigg(-\frac{t}{\kappa }\bigg)\| x_1-x^*\| ^2. \]

LaTeX Lean

Theorem 6.2.6

Let $f$ be $\beta $-smooth and $\alpha $-strongly convex on $\mathbb {R}^n$. Then gradient descent with $\eta =\frac{2}{\alpha +\beta }$ satisfies

\[ f(x_{t+1})-f(x^*) \le \frac{\beta }{2}\exp \! \Big(-\frac{4t}{\kappa +1}\Big)\| x_1-x^*\| ^2. \]

LaTeX

Theorem 6.2.7

Let $f$ be $\alpha $-strongly convex and $\beta $-smooth, then Nesterov’s accelerated gradient descent satisfies

\[ f(y_t)-f(x^*) \le \frac{\alpha +\beta }{2}\| x_1-x^*\| ^2\exp \! \left(-\frac{t-1}{\sqrt{\kappa }}\right). \]

LaTeX Lean

Theorem 6.2.8

Let $f$ be a convex and $\beta $-smooth function, then Nesterov’s accelerated gradient descent satisfies

\[ f(y_t) - f(x^*) \le \frac{2\beta \| x_1 - x^*\| ^2}{t^2}. \]

LaTeX Lean

Theorem 6.2.1

The projected subgradient descent method with $\eta = \frac{R}{L\sqrt{t}}$ satisfies

\[ f\! \left(\frac{1}{t}\sum _{s=1}^t x_s\right)-f(x^*) \le \frac{RL}{\sqrt{t}}. \]

LaTeX Lean

Theorem 6.2.3

Let $f$ be convex and $\beta $-smooth on $\mathbb {R}^n$. Then gradient descent with $\eta =\frac{1}{\beta }$ satisfies

\[ f(x_t)-f(x^*)\le \frac{2\beta \| x_1-x^*\| ^2}{t-1}. \]

LaTeX Lean

Theorem 6.2.4

Let $f$ be convex and $\beta $-smooth on $\mathcal{X}$. Then projected gradient descent with $\eta =\frac{1}{\beta }$ satisfies

\[ f(x_t)-f(x^*) \le \frac{3\beta \| x_1-x^*\| ^2 + f(x_1)-f(x^*)}{t}. \]

LaTeX Lean

Theorem 6.2.9

Let $f$ be a convex and $\beta $-smooth function w.r.t. some norm $\| \cdot \| $, $R=\sup _{x,y\in \mathcal{X}}\| x-y\| $, and $\gamma _s=\frac{2}{s+1}$ for $s\ge 1$. Then for any $t\ge 2$, one has

\[ f(x_t)-f(x^*)\le \frac{2\beta R^2}{t+1}. \]

LaTeX Lean

Theorem 6.2.2

Let $f$ be $\alpha $-strongly convex and $L$-Lipschitz on $\mathcal{X}$. Then projected subgradient descent with $\eta _s=\frac{2}{\alpha (s+1)}$ satisfies

\[ f\! \left(\sum _{s=1}^t \frac{2s}{t(t+1)}x_s\right)-f(x^*) \le \frac{2L^2}{\alpha (t+1)}. \]

LaTeX Lean

Theorem 6.3.1

Let $\Phi $ be a mirror map $\rho $-strongly convex on $\mathcal{X}\cap \mathcal{D}$ w.r.t. $\| \cdot \| $. Let $R^2=\sup _{x\in \mathcal{X}\cap \mathcal{D}}\Phi (x)-\Phi (x_1)$, and $f$ be convex and L-Lipschitz w.r.t. $\| \cdot \| $. Then mirror descent with $\eta =\dfrac {R}{L}\sqrt{\dfrac {2\rho }{t}}$ satisfies

\[ f\! \left(\frac{1}{t}\sum _{s=1}^t x_s\right)-f(x^*)\le RL\sqrt{\frac{2}{\rho t}}. \]

LaTeX Lean

Theorem 6.3.2

Let $\Phi $ be a mirror map $\rho $-strongly convex on $\mathcal{X}\cap D$ w.r.t. $\| \cdot \| $. Let $R^2=\sup _{x\in \mathcal{X}\cap D}\Phi (x)-\Phi (x_1)$, and $f$ be convex and $L$-Lipschitz w.r.t. $\| \cdot \| $. Then dual averaging with $\eta =\dfrac {R}{L}\sqrt{\dfrac {\rho }{2t}}$ satisfies

\[ f\! \left(\tfrac {1}{t}\sum _{s=1}^t x_s\right)-f(x^*)\le 2RL\sqrt{\dfrac {2}{\rho t}}. \]

LaTeX Lean

Theorem 6.3.3

Let $\Phi $ be a mirror map $\rho $-strongly convex on $\mathcal{X}\cap \mathcal{D}$ w.r.t. $\| \cdot \| $. Let $R^{2}=\sup _{x\in \mathcal{X}\cap \mathcal{D}}\Phi (x)-\Phi (x_{1})$, and $f$ be convex and $\beta $-smooth w.r.t. $\| \cdot \| $. Then mirror prox with $\eta =\frac{\rho }{\beta }$ satisfies

\[ f\! \left(\tfrac {1}{t}\sum _{s=1}^{t} y_{s+1}\right)-f(x^{*}) \le \frac{\beta R^{2}}{\rho t}. \]

LaTeX Lean

Theorem 6.4.3

Assume that $f$ has a Lipschitz Hessian, that is

\[ \| \nabla ^2 f(x)-\nabla ^2 f(y)\| \le M\| x-y\| . \]

Let $x^*$ be a local minimum of $f$ with strictly positive Hessian, that is $\nabla ^2 f(x^*)\succeq \mu I_n$, $\mu {\gt}0$. Suppose that the initial starting point $x_0$ of Newton’s method is such that

\[ \| x_0-x^*\| \le \frac{\mu }{2M}. \]

Then Newton’s method is well-defined and converges to $x^*$ at a quadratic rate:

\[ \| x_{k+1}-x^*\| \le \frac{M}{\mu }\| x_k-x^*\| ^2. \]

LaTeX Lean

Theorem 6.4.4

Let $f$ be a standard self-concordant function on $\mathcal{X}$, and $x\in \operatorname {int}(\mathcal{X})$ such that $\lambda _f(x)\le 1/4$, then

\[ \lambda _f\bigl(x-[\nabla ^2 f(x)]^{-1}\nabla f(x)\bigr)\le 2\lambda _f(x)^2. \]

LaTeX Lean

Theorem 6.4.5

Let $\mathcal X\subset \mathbb R^n$ be a closed convex set with non-empty interior. There exists $F$ which is a $(c\, n)$-self-concordant barrier for $\mathcal X$ (where $c$ is some universal constant).

LaTeX Lean

Theorem 6.4.6

The path-following scheme described above satisfies

\[ c^\top x_k - \min _{x\in \mathcal X} c^\top x \le \frac{2\nu }{t_0}\exp \! \left(-\frac{k}{1+13\sqrt{\nu }}\right). \]

LaTeX Lean

Theorem 6.4.2

The function $f$ is convex and $\beta $-smooth, and $g$ is convex. The rate of convergence of FISTA on $f+g$ is similar to the one of Nesterov’s accelerated gradient descent on $f$, more precisely:

\[ f(y_t) + g(y_t) - (f(x^*) + g(x^*)) \le \frac{2\beta \| x_1 - x^*\| ^2}{t^2}. \]

LaTeX Lean

Theorem 6.4.1

The function $f$ is convex and $\beta $-smooth, and $g$ is convex. With $\eta =\frac{1}{\beta }$ one has

\[ f(x_t)+g(x_t)-(f(x^*)+g(x^*))\le \frac{\beta \| x_1-x^*\| _2^2}{2t}. \]

LaTeX Lean

Theorem 1.1.1

(C. Carathéodory) Any $x\in \operatorname {co}S\subset \mathbb {R}^n$ can be represented as a convex combination of $n+1$ elements of $S$.

LaTeX Lean

Theorem 1.1.2

(W. Fenchel and L. Bunt) If $S\subset \mathbb {R}^n$ has no more than $n$ connected components (in particular, if $S$ is connected), then any $x\in \operatorname {co}S$ can be expressed as a convex combination of $n$ elements of $S$.

LaTeX

Theorem 1.1.3

If $S$ is bounded [resp. compact], then $\operatorname {co}S$ is bounded [resp. compact].

LaTeX Lean

Theorem 1.2.1

If $C\neq \varnothing $, then $\operatorname {ri}C\neq \varnothing $. In fact, $\dim (\operatorname {ri}C)=\dim C$.

LaTeX Lean

Theorem 1.2.2

(H. Minkowski) Let $C$ be compact, convex in $\mathbb {R}^n$. Then $C$ is the convex hull of its extreme points: $C=\operatorname {co}(\operatorname {ext}C)$.

LaTeX Lean

Theorem 1.3.1

A point $y_x\in C$ is the projection $p_C(x)$ if and only if

\[ \langle x-y_x,\; y-y_x\rangle \le 0\qquad \text{for all }y\in C . \tag {3.1.3} \]

LaTeX Lean

Theorem 1.3.2J.-J. Moreau

Let $K$ be a closed convex cone. For the three elements $x,x_1$ and $x_2$ in $\mathbb {R}^n$, the properties below are equivalent:

$x=x_1+x_2$ with $x_1\in K$, $x_2\in K^\circ $ and $\langle x_1,x_2\rangle =0$;
$x_1=\mathrm{p}_K(x)$ and $x_2=\mathrm{p}_{K^\circ }(x)$.

LaTeX Lean

Theorem 1.4.1

Let $C\subset \mathbb {R}^n$ be nonempty closed convex, and let $x\notin C$. Then there exists $s\in \mathbb {R}^n$ such that

\[ \langle s,x\rangle {\gt} \sup _{y\in C}\langle s,y\rangle . \tag {4.1.1} \]

LaTeX Lean

Theorem 1.4.2Proper Separation of Convex Sets

If the two nonempty convex sets $C_1$ and $C_2$ satisfy $\operatorname {ri}C_1\cap (\operatorname {ri}C_2)=\varnothing $, they can be properly separated.

LaTeX Lean

Theorem 1.4.3

Let $\varnothing \neq C\subseteq \mathbb {R}^n$ be convex. The set $C^*$ defined above is the closure of $C$.

LaTeX Lean

Theorem 1.4.4Generalized Farkas

Let be given $(b,r)$ and $(s_j,\rho _j)$ in $\mathbb {R}^n\times \mathbb {R}$, where $j$ varies in an (arbitrary) index set $J$. Suppose that the system of inequalities

\[ \langle s_j,x\rangle \le \rho _j\qquad \text{for all }j\in J \tag {4.3.6} \]

has a solution $x\in \mathbb {R}^n$ (the system is consistent). Then the following two properties are equivalent:

(i) $\langle b,x\rangle \le r$ for all $x$ satisfying (4.3.6);

(ii) $(b,r)$ is in the closed convex conical hull of $S:=\{ (0,1)\} \cup \{ (s_j,\rho _j)\} _{j\in J}$.

LaTeX

Theorem 2.1.1

(Inequality of Jensen) Let $f\in \operatorname {Conv}\mathbb {R}^n$. Then, for all collections $\{ x_1,\dots ,x_k\} $ of points in $\text{dom }f$ and all $\alpha =(\alpha _1,\dots ,\alpha _k)$ in the unit simplex of $\mathbb {R}^k$, there holds (inequality of Jensen in summation form)

\[ f\! \bigg(\sum _{i=1}^k \alpha _i x_i\bigg)\le \sum _{i=1}^k \alpha _i f(x_i). \]

LaTeX Lean

Proposition 2.1.4

Any $f\in \operatorname {Conv}\mathbb {R}^n$ is minorized by some affine function. More precisely: for any $x_0\in \operatorname {ri}\operatorname {dom}f$, there is $s$ in the subspace parallel to $\operatorname {aff}\operatorname {dom}f$ such that

\[ f(x)\ge f(x_0)+\langle s,\, x-x_0\rangle \quad \text{for all }x\in \mathbb {R}^n. \]

LaTeX Lean

Proposition 2.1.8

The closure of $f\in \text{Conv }\mathbb {R}^n$ is the supremum of all affine functions minorizing $f$:

\[ \text{cl }f(x)=\sup _{(s,b)\in \mathbb {R}^n\times \mathbb {R}}\{ \langle s,x\rangle -b:\langle s,y\rangle -b\le f(y)\text{ for all }y\in \mathbb {R}^n\} . \tag {1.2.9} \]

LaTeX Lean

Theorem 2.1.2

Let $C$ be a nonempty subset of $\mathbb {R}^n\times \mathbb {R}$ satisfying ??, and let its lower-bound function $\ell _C$ be defined by ??.

If $C$ is convex, then $\ell _C\in \operatorname {Conv}\mathbb {R}^n$;
If $C$ is closed convex, then $\ell _C\in \overline{\operatorname {Conv}}\mathbb {R}^n$.

LaTeX Lean

Proposition 2.2.1

Let $f_1,\dots ,f_m$ be in $\text{Conv }\mathbb {R}^n$ [resp. in $\overline{\text{Conv }}\mathbb {R}^n$], let $t_1,\dots ,t_m$ be positive numbers, and assume that there is a point where all the $f_j$’s are finite. Then the function $f:=\sum _{j=1}^m t_j f_j$ is in $\text{Conv }\mathbb {R}^n$ [resp. in $\overline{\text{Conv }}\mathbb {R}^n$].

LaTeX Lean

Theorem 2.2.1

Let $g$ of Definition 2.4.1 be in $\mathrm{Conv}\, \mathbb {R}^m$. Assume also that, for all $x\in \mathbb {R}^n$, $g$ is bounded below on the inverse image $A^{-1}(x)=\{ y\in \mathbb {R}^m:Ay=x\} $. Then $Ag\in \mathrm{Conv}\, \mathbb {R}^n$.

LaTeX Lean

Corollary 2.2.1

Let (2.4.1) have the following form: $U=\mathbb {R}^p$; $\varphi \in \text{Conv }\mathbb {R}^p$; $X=\mathbb {R}^n$ is equipped with the canonical basis; the mapping $c$ has its components $c_j\in \text{Conv }\mathbb {R}^p$ for $j=1,\dots ,n$. Suppose also that the optimal value is ${\gt}-\infty $ for all $x\in \mathbb {R}^n$, and that

\[ \text{dom }\varphi \cap \text{dom }c_1\cap \cdots \cap \text{dom }c_n\neq \varnothing . \tag {2.4.4} \]

Then the value function

\[ v_{\varphi ,c}(x):=\inf \{ \varphi (u):c_j(u)\le x_j,\ \text{for }j=1,\dots ,n\} \]

lies in $\text{Conv }\mathbb {R}^n$.

LaTeX Lean

Proposition 2.2.10

Let $g_1,\dots ,g_m$ be in $\operatorname {Conv}\mathbb {R}^n$, all minorized by the same affine function. Then the convex hull of their infimum is the function

\[ \mathbb {R}^n \ni x \mapsto [\operatorname {co}(\min _j g_j)](x)= \inf \Big\{ \sum _{j=1}^m \alpha _j g_j(x_j)\; :\; \alpha \in \Delta _m,\; x_j\in \operatorname {dom} g_j,\; \sum _{j=1}^m\alpha _j x_j=x\Big\} . \tag {2.5.3} \]

LaTeX

Theorem 2.3.1

With $f\in \operatorname {Conv}\mathbb {R}^n$, let $S$ be a convex compact subset of $\operatorname {ri}\text{dom }f$. Then there exists $L=L(S)\ge 0$ such that

\[ |f(x)-f(x')|\le L\| x-x'\| \quad \text{for all }x\text{ and }x'\text{ in }S. \tag {3.1.2} \]

LaTeX

Theorem 2.3.2

Let the convex functions $f_k:\mathbb {R}^n\to \mathbb {R}$ converge pointwise for $k\to +\infty $ to $f:\mathbb {R}^n\to \mathbb {R}$. Then $f$ is convex and, for each compact set $S$, the convergence of $f_k$ to $f$ is uniform on $S$.

LaTeX

Proposition 2.3.43.2.8

Let $f_1,\dots ,f_m$ be $m$ functions of $\text{Conv }\mathbb {R}^n$, and $t_1,\dots ,t_m$ be positive numbers. Assume that there is $x_0$ at which each $f_j$ is finite. Then,

\[ \text{for } f:=\sum _{j=1}^m t_j f_j,\qquad \text{we have } f'_\infty =\sum _{j=1}^m t_j(f_j)'_\infty . \]
Let $\{ f_j\} _{j\in J}$ be a family of functions in $\text{Conv }\mathbb {R}^n$. Assume that there is $x_0$ at which $\sup _{j\in J} f_j(x_0){\lt}+\infty $. Then,

\[ \text{for } f:=\sup _{j\in J} f_j,\qquad \text{we have } f'_\infty =\sup _{j\in J}(f_j)'_\infty . \]
Let $A:\mathbb {R}^n\to \mathbb {R}^m$ be affine with linear part $A_0$, and let $f\in \text{Conv }\mathbb {R}^m$. Assume that $A(\mathbb {R}^n)\cap \text{dom }f\neq \emptyset $. Then $(f\circ A)'_\infty =f'_\infty \circ A_0$.

LaTeX

Theorem 2.4.1

Let $f$ be a function differentiable on an open set $\Omega \subset \mathbb {R}^n$, and let $C$ be a convex subset of $\Omega $. Then

$f$ is convex on $C$ if and only if

\[ f(x)\ge f(x_0)+\langle \nabla f(x_0),\, x-x_0\rangle \quad \text{for all }(x_0,x)\in C\times C; \tag {4.1.1} \]
$f$ is strictly convex on $C$ if and only if strict inequality holds in (4.1.1) whenever $x\neq x_0$;
$f$ is strongly convex with modulus $c$ on $C$ if and only if, for all $(x_0,x)\in C\times C$,

\[ f(x)\ge f(x_0)+\langle \nabla f(x_0),x-x_0\rangle +\tfrac {1}{2}c\| x-x_0\| ^2. \tag {4.1.2} \]

LaTeX

Theorem 2.4.2

Let $f$ be a function differentiable on an open set $\Omega \subset \mathbb {R}^n$, and let $C$ be a convex subset of $\Omega $. Then, $f$ is convex [resp. strictly convex, resp. strongly convex with modulus $c$] on $C$ if and only if its gradient $\nabla f$ is monotone [resp. strictly monotone, resp. strongly monotone with modulus $c$] on $C$.

LaTeX

Theorem 2.4.3

Let $f\in \operatorname {Conv}\mathbb {R}^n$. The subset of $\operatorname {int}\operatorname {dom}f$ where $f$ fails to be differentiable is of zero (Lebesgue) measure.

LaTeX

Theorem 2.4.4

Let $f$ be twice differentiable on an open convex set $\Omega \subset \mathbb {R}^n$. Then

$f$ is convex on $\Omega $ if and only if $\nabla ^2 f(x_0)$ is positive semi-definite for all $x_0\in \Omega $;
if $\nabla ^2 f(x_0)$ is positive definite for all $x_0\in \Omega $, then $f$ is strictly convex on $\Omega $;
$f$ is strongly convex with modulus $c$ on $\Omega $ if and only if the smallest eigenvalue of $\nabla ^2 f(\cdot )$ is minorized by $c$ on $\Omega $: for all $x_0\in \Omega $ and all $d\in \mathbb {R}^n$,

\[ \langle \nabla ^2 f(x_0)d,d\rangle \ge c\| d\| ^2. \]

LaTeX

Proposition 3.1.3

Let $\sigma $ be sublinear and suppose that there exist $x_1,\dots ,x_m$ in $\text{dom }\sigma $ such that

\[ \sigma (x_j)+\sigma (-x_j)=0\quad \text{for } j=1,\dots ,m. \tag {1.1.7} \]

Then $\sigma $ is linear on the subspace spanned by $x_1,\dots ,x_m$.

LaTeX Lean

Theorem 3.1.1

Let $C$ be a closed convex set containing the origin. Then

its gauge $\gamma _C$ is a nonnegative closed sublinear function;
$\gamma _C$ is finite everywhere if and only if $0$ lies in the interior of $C$;
$C_\infty $ being the asymptotic cone of $C$,

\[ \{ x\in \mathbb {R}^n:\ \gamma _C(x)\le r\} =rC\quad \text{for all }r{\gt}0, \qquad \{ x\in \mathbb {R}^n:\ \gamma _C(x)=0\} =C_\infty . \]

LaTeX Lean

Theorem 3.1.2

For $\sigma _1$ and $\sigma _2$ in the set $\Phi $ of sublinear functions that are finite everywhere, define

\[ \Delta (\sigma _1,\sigma _2):=\max _{\| x\| \le 1}|\sigma _1(x)-\sigma _2(x)|. \tag {1.3.2} \]

Then $\Delta $ is a distance on $\Phi $.

LaTeX

Theorem 3.1.3

Let $(\sigma _k)$ be a sequence of finite sublinear functions and let $\sigma $ be a finite function. Then the following are equivalent when $k\to +\infty $:

$(\sigma _k)$ converges pointwise to $\sigma $;
$(\sigma _k)$ converges to $\sigma $ uniformly on each compact set of $\mathbb {R}^n$;
$\Delta (\sigma _k,\sigma )\to 0$.

LaTeX Lean

Theorem 3.2.1

For the nonempty $S\subset \mathbb {R}^n$ and its support function $\sigma _S$, there holds

\[ s\in \overline{\operatorname {co}}S\iff \bigl[\langle s,d\rangle \le \sigma _S(d)\quad \text{for all }d\in X\bigr], \tag {2.2.2} \]

where the set $X$ can be indifferently taken as: the whole of $\mathbb {R}^n$, the unit ball $B(0,1)$ or its boundary the unit sphere $\tilde{B}$, or $\operatorname {dom}\sigma _S$.

LaTeX Lean

Theorem 3.2.2

Let $S$ be a nonempty closed convex set in $\mathbb {R}^n$. Then

$s\in \operatorname {aff}S$ if and only if

\[ \langle s,d\rangle =\sigma _S(d)\quad \text{for all }d\text{ with }\sigma _S(d)+\sigma _S(-d)=0; \tag {2.2.3} \]
$s\in \operatorname {ri}S$ if and only if

\[ \langle s,d\rangle {\lt}\sigma _S(d)\quad \text{for all }d\text{ with }\sigma _S(d)+\sigma _S(-d){\gt}0; \tag {2.2.4} \]
in particular, $s\in \operatorname {int}S$ if and only if

\[ \langle s,d\rangle {\lt}\sigma _S(d)\quad \text{for all }d\neq 0. \tag {2.2.5} \]

LaTeX Lean

Theorem 3.3.1

Let $\sigma $ be a closed sublinear function; then there is a linear function minorizing $\sigma $. In fact, $\sigma $ is the supremum of the linear functions minorizing it. In other words, $\sigma $ is the support function of the nonempty closed convex set

\[ S_\sigma := \{ s\in \mathbb {R}^n : \langle s,d\rangle \le \sigma (d)\ \text{for all } d\in \mathbb {R}^n\} . \tag {3.1.1} \]

LaTeX Lean

Theorem 3.3.2

Let $S_1$ and $S_2$ be nonempty closed convex sets; call $\sigma _1$ and $\sigma _2$ their support functions. Then

\[ S_1\subset S_2 \iff \sigma _1(d)\le \sigma _2(d)\text{ for all }d\in \mathbb {R}^n. \]

LaTeX Lean

Theorem 3.3.3

Let $\sigma _1$ and $\sigma _2$ be the support functions of the nonempty closed convex sets $S_1$ and $S_2$. If $t_1$ and $t_2$ are positive, then

\[ t_1\sigma _1+t_2\sigma _2\ \text{is the support function of }\ \overline{(t_1S_1+t_2S_2)}. \]

LaTeX Lean

Theorem 3.3.4

Let $\{ \sigma _j\} _{j\in J}$ be the support functions of the family of nonempty closed convex sets $\{ S_j\} _{j\in J}$. Then

\[ \sup _{j\in J}\sigma _j\text{ is the support function of }\overline{\operatorname {co}}\{ \, \bigcup _{j\in J}S_j:\; j\in J\} . \]

LaTeX Lean

Theorem 3.3.5

Let $\{ \sigma _j\} _{j\in J}$ be the support functions of the family of closed convex sets $\{ S_j\} _{j\in J}$. If

\[ S:=\bigcap _{j\in J}S_j\neq \varnothing , \]

then

\[ \sigma _S=\overline{\operatorname {co}}\{ \inf \sigma _j:\; j\in J\} . \]

LaTeX Lean

Theorem 3.3.6

Let $S$ and $S'$ be two nonempty compact convex sets of $\mathbb {R}^n$. Then

\[ \Delta (\sigma _S,\sigma _{S'}) := \max _{\| d\| \le 1} |\sigma _S(d)-\sigma _{S'}(d)| = \Delta _H(S,S') . \tag {3.3.5} \]

LaTeX

Theorem 4.1.1

The definitions 1.1.4 and 1.2.1 are equivalent.

LaTeX Lean

Theorem 4.1.2

Let $f:\mathbb {R}^n\to \mathbb {R}$ be convex and suppose $0\notin \partial f(x)$. Then, $S_{f}(x)$ being the sublevel-set (1.3.1),

\[ T_{S_{f}(x)}(x)=\{ d\in \mathbb {R}^n:\; f'(x,d)\le 0\} \tag {1.3.7} \]

\[ \operatorname {int}\big[T_{S_{f}(x)}(x)\big]=\{ d\in \mathbb {R}^n:\; f'(x,d){\lt}0\} \neq \varnothing .\tag {1.3.8} \]

LaTeX Lean

Theorem 4.1.3

Let $f:\mathbb {R}^n\to \mathbb {R}$ be convex and suppose $0\notin \partial f(x)$. Then a direction $d$ is normal to $S f(x)$ at $x$ if and only if there is some $t\ge 0$ and some $s\in \partial f(x)$ such that $d=ts$:

\[ N_{S f(x)}(x)=\mathbb {R}^+ \partial f(x). \]

LaTeX Lean

Theorem 4.2.1

For $f:\mathbb {R}^n\to \mathbb {R}$ convex, the following three properties are equivalent:

$f$ is minimized at $x$ over $\mathbb {R}^n$, i.e., $f(y)\ge f(x)$ for all $y\in \mathbb {R}^n$;
$0\in \partial f(x)$;
$f'(x,d)\ge 0$ for all $d\in \mathbb {R}^n$.

LaTeX Lean

Theorem 4.2.2

Let $f:\mathbb {R}^n\to \mathbb {R}$ be convex. Given two points $x\neq y$ in $\mathbb {R}^n$, there exist $t\in ]0,1[$ and $s\in \partial f(x_t)$ such that

\[ f(y)-f(x)=\langle s,\, y-x\rangle . \tag {2.3.2} \]

In other words,

\[ f(y)-f(x)\in \bigcup _{t\in ]0,1[}\{ \langle \partial f(x_t),\, y-x\rangle \} . \]

LaTeX Lean

Theorem 4.2.3

Let $f:\mathbb {R}^n\to \mathbb {R}$ be convex. For $x,y\in \mathbb {R}^n$,

\[ f(y)-f(x)=\int _0^1\langle \partial f(xt),\, y-x\rangle \, dt. \]

LaTeX

Theorem 4.4.1

Let $f_1,f_2$ be two convex functions from $\mathbb {R}^n$ to $\mathbb {R}$ and $t_1,t_2$ be positive. Then

\[ \partial (t_1 f_1 + t_2 f_2)(x) = t_1\partial f_1(x) + t_2\partial f_2(x) \qquad \text{for all }x\in \mathbb {R}^n. \tag {4.1.1} \]

LaTeX Lean

Theorem 4.4.2

Let $A:\mathbb {R}^n\to \mathbb {R}^m$ be an affine mapping ( $Ax = A_0x + b$, with $A_0$ linear and $b\in \mathbb {R}^m$) and let $g$ be a finite convex function on $\mathbb {R}^m$. Then

\[ \partial (g\circ A)(x)=A_0^*\partial g(Ax)\qquad \text{for all }x\in \mathbb {R}^n. \tag {4.2.1} \]

LaTeX Lean

Theorem 4.4.3

Let $f$, $F$ and $g$ be defined as above. For all $x\in \mathbb {R}^n$,

\[ \partial (g\circ F)(x)=\big\{ \sum _{i=1}^m \rho ^i s_i :\; (\rho ^1,\dots ,\rho ^m)\in \partial g(F(x)), \; s_i\in \partial f_i(x)\ \text{ for } i=1,\dots ,m\big\} . \tag {4.3.1} \]

LaTeX Lean

Corollary 4.4.1

Let $f_1,\dots ,f_m$ be $m$ convex functions from $\mathbb {R}^n$ to $\mathbb {R}$ and define

\[ f := \max \{ f_1,\dots ,f_m\} . \]

Denoting by $I(x) := \{ i : f_i(x) = f(x)\} $ the active index-set, we have

\[ \partial f(x) = \operatorname {co}\{ \bigcup _{i\in I(x)}\partial f_i(x)\} . \tag {4.3.4} \]

LaTeX Lean

Theorem 4.4.4

With the notation (4.4.1), (4.4.2), assume that $J$ is a compact set (in some metric space), on which the functions $j\mapsto f_j(x)$ are upper semi-continuous for each $x\in \mathbb {R}^n$. Then

\[ \partial f(x)=\operatorname {co}\{ \cup \partial f_j(x):\; j\in J(x)\} . \tag {4.4.4} \]

LaTeX Lean

Theorem 4.4.5

With the notation (4.5.1), (4.5.2), assume $A$ is surjective. Let $x$ be such that $Y(x)$ is nonempty. Then, for arbitrary $y\in Y(x)$,

\[ \partial (Ag)(x)=\{ \, s\in \mathbb {R}^n:\; A^*s\in \partial g(y)\, \} =\bigl(A^*\bigr)^{-1}[\partial g(y)] \tag {4.5.3} \]

(and this set is thus independent of the particular optimal $y$).

LaTeX Lean

Theorem 4.5.1

With the notations (5.3.1), (5.3.2), suppose $\varphi _{0}\notin H$. A necessary and sufficient condition for $\bar{x}=(\bar{\xi }^{1},\dots ,\bar{\xi }^{n})\in \mathbb {R}^{n}$ to minimize $f$ of (5.3.1) is that, for some positive integer $p\le n+1$, there exist $p$ points $t_{1},\dots ,t_{p}$ in $T$, $p$ integers $\varepsilon _{1},\dots ,\varepsilon _{p}$ in $\{ -1,+1\} $ and $p$ positive numbers $\alpha _{1},\dots ,\alpha _{p}$ such that

\[ \sum _{i=1}^{n}\xi _i^r\varphi _i(t_k)-\varphi _0(t_k)=\varepsilon _k f(\bar{x})\qquad \text{for }k=1,\dots ,p, \]

\[ \sum _{k=1}^{p}\alpha _k\varepsilon _k\varphi _i(t_k)=0\qquad \text{for }i=1,\dots ,n \]

(or equivalently: $\displaystyle \sum _{k=1}^{p}\alpha _k\varepsilon _k\psi (t_k)=0\quad \text{for all }\psi \in H$). $\square $

LaTeX

Theorem 4.6.1

A necessary and sufficient for a convex function $f:\mathbb {R}^n\to \mathbb {R}$ to be strongly convex with modulus $c{\gt}0$ on a convex set $C$ is: for all $x_1,x_2$ in $C$,

\begin{equation} \tag {6.1.3} f(x_2)\ge f(x_1)+\langle s,x_2-x_1\rangle +\frac{c}{2}\| x_2-x_1\| ^2\quad \text{for all }s\in \partial f(x_1), \end{equation}

or equivalently

\begin{equation} \tag {6.1.4} \langle s_2-s_1,x_2-x_1\rangle \ge c\| x_2-x_1\| ^2\quad \text{for all }s_i\in \partial f(x_i),\; i=1,2. \end{equation}

LaTeX Lean

Theorem 4.6.2

The subdifferential mapping of a convex function $f:\mathbb {R}^n\to \mathbb {R}$ is outer semi-continuous at any $x\in \mathbb {R}^n$, i.e.

\[ \forall \varepsilon {\gt}0,\ \exists \delta {\gt}0:\quad y\in B(x,\delta )\implies \partial f(y)\subset \partial f(x)+B(0,\varepsilon ). \tag {6.2.1} \]

LaTeX Lean

Theorem 4.6.3

Let $(f_k)$ be a sequence of (finite) convex functions converging pointwise to $f:\mathbb {R}^n\to \mathbb {R}$ and let $(x_k)$ converge to $x\in \mathbb {R}^n$. For any $\varepsilon {\gt}0$,

\[ \partial f_k(x_k)\subset \partial f(x)+B(0,\varepsilon )\qquad \text{for $k$ large enough.} \]

LaTeX Lean

Theorem 4.6.4

Let $f:\mathbb {R}^n\to \mathbb {R}$ be convex. With the notation (6.3.1), $\partial f(x)=\operatorname {co}\gamma f(x)$ for all $x\in \mathbb {R}^n$.

LaTeX Lean

Theorem 5.1.1

For $f$ satisfying (1.1.1), the conjugate $f^*$ is a closed convex function: $f^*\in \text{Conv }\mathbb {R}^n$.

LaTeX Lean

Proposition 5.1.2

For $f\in \overline{\operatorname {Conv}}\mathbb {R}^n$,

\[ \sigma _{\operatorname {dom} f}(s)=\sigma _{\operatorname {epi} f}(s,0)=(f^*)^\infty (s)\quad \text{for all }s\in \mathbb {R}^n. \tag {1.2.3} \]

LaTeX Lean

Theorem 5.1.2

For $f$ satisfying (1.1.1), the function $f^{**}$ of (1.3.2) is the pointwise supremum of all the affine functions on $\mathbb {R}^n$ majorized by $f$. In other words

\[ \operatorname {epi} f^{**}=\overline{\operatorname {co}}\bigl(\operatorname {epi} f\bigr). \tag {1.3.3} \]

LaTeX Lean

Theorem 5.1.3

For $f$ satisfying (1.1.1) and $\partial f$ defined by (1.4.1), $s\in \partial f(x)$ if and only if

\[ f^*(s)+f(x)-\langle s,x\rangle =0\quad (\text{or }\le 0). \tag {1.4.2} \]

LaTeX Lean

Theorem 5.1.4

Let $f\in \operatorname {Conv}\mathbb {R}^n$. Then $\partial f(x)\neq \varnothing $ whenever $x\in \operatorname {ri}\text{dom }f$.

LaTeX Lean

Corollary 5.1.2

If $f\in \operatorname {Conv}\mathbb {R}^n$, the following equivalences hold:

\[ f(x)+f^*(s)-\langle s,x\rangle =0\ (\text{or }\le 0) \quad \Longleftrightarrow \quad s\in \partial f(x) \quad \Longleftrightarrow \quad x\in \partial f^*(s). \]

LaTeX Lean

Theorem 5.2.1

With the above notation, assume that $\operatorname {Im}A^* \cap \text{dom }g^* \neq \varnothing $. Then $Ag$ satisfies (1.1.1) and its conjugate is

\[ (Ag)^* = g^* \circ A^* . \]

LaTeX Lean

Theorem 5.2.2

Take $g\in \text{Conv }\mathbb {R}^m$, $A_0$ linear from $\mathbb {R}^n$ to $\mathbb {R}^m$ and consider the affine operator $A(x):=A_0x+y_0\in \mathbb {R}^m$. Suppose that $A(\mathbb {R}^n)\cap \text{dom }g\neq \emptyset $. Then $g\circ A\in \text{Conv }\mathbb {R}^n$ and its conjugate is the closure of the convex function

\[ \mathbb {R}^n\ni s\mapsto \inf _p\{ \, g^*(p)-\langle y_0,p\rangle _m : A_0^*p=s\, \} . \tag {2.2.1} \]

LaTeX

Theorem 5.2.3

Take $g\in \operatorname {Conv}\mathbb {R}^m$, $A_0$ linear from $\mathbb {R}^n$ to $\mathbb {R}^m$ and consider the affine operator $A(x):=A_0x+y_0\in \mathbb {R}^m$. Make the following assumption:

\[ A(\mathbb {R}^n)\cap \operatorname {ri}\operatorname {dom}g\neq \varnothing . \tag {2.2.4} \]

Then, for every $s\in \operatorname {dom}(g\circ A_0)^*$, the problem

\[ \min _p\{ \, g^*(p)-\langle p,y_0\rangle :\ A_0^*p=s\, \} \tag {2.2.5} \]

has at least one optimal solution $\bar p$ and there holds $(g\circ A)^*(s)=g^*(\bar p)-\langle \bar p,y_0\rangle $.

LaTeX

Theorem 5.2.4

Let $g_1,g_2$ be in $\operatorname {Conv}\mathbb {R}^n$ and assume that $\operatorname {dom}g_1\cap \operatorname {dom}g_2\neq \varnothing $. The conjugate $(g_1+g_2)^*$ of their sum is the closure of the convex function $g_1^*\mathbin {\square }g_2^*$.

LaTeX

Theorem 5.2.5

Let $g_1,g_2$ be in $\text{Conv }\mathbb {R}^n$ and assume that

\[ \begin{aligned} & \text{the relative interiors of }\text{dom }g_1\text{ and }\text{dom }g_2\text{ intersect,}\\ & \text{or equivalently: }0\in \text{ri }(\text{dom }g_1-\text{dom }g_2). \end{aligned} \tag {2.3.1} \]

Then $(g_1+g_2)^* = g_1^*\mathbin {\square }g_2^*$ and, for every $s\in \text{dom }(g_1+g_2)^*$, the problem

\[ \inf \{ g_1^*(p)+g_2^*(q)\; :\; p+q=s\} \]

has at least one optimal solution $(\bar p,\bar q)$, which therefore satisfies

\[ g_1^*(\bar p)+g_2^*(\bar q)=(g_1^*\mathbin {\square }g_2^*)(s)=(g_1+g_2)^*(s). \]

LaTeX

Theorem 5.2.6

Let $\{ f_j\} _{j\in J}$ be a collection of functions satisfying (1.1.1) and having a common affine minorant: $\sup _{j\in J} f_j^*(s) {\lt} +\infty $ for some $s\in \mathbb {R}^n$. Then their infimum $f := \inf _{j\in J} f_j$ satisfies (1.1.1), and its conjugate is the supremum of the $f_j^*$’s:

\[ (\inf _{j\in J} f_j)^* = \sup _{j\in J} f_j^*. \tag {2.4.1} \]

LaTeX

Theorem 5.2.7

Let $\{ g_j\} _{j\in J}$ be a collection of functions in $\text{Conv }\mathbb {R}^n$. If their supremum $g:=\sup _{j\in J} g_j$ is not identically $+\infty $, it is in $\text{Conv }\mathbb {R}^n$, and its conjugate is the closed convex hull of the $g_j^*$’s:

\[ \bigl(\sup _{j\in J} g_j\bigr)^* = \overline{\text{co }}\bigl(\inf _{j\in J} g_j^*\bigr). \tag {2.4.5} \]

LaTeX

Theorem 5.2.8

With $f$ and $g$ defined as above, assume that $f(\mathbb R^n)\cap \operatorname {int}\operatorname {dom}g\neq \varnothing $. For all $s\in \operatorname {dom}(g\circ f)^*$, define the function $\psi _s\in \operatorname {Conv}\mathbb R$ by

\[ \mathbb R\ni \alpha \mapsto \psi _s(\alpha ):= \begin{cases} \alpha f^*\! \big(\tfrac {1}{\alpha }s\big)+g^*(\alpha )& \text{if }\alpha {\gt}0,\\[4pt] \sigma _{\operatorname {dom}f}(s)+g^*(0)& \text{if }\alpha =0,\\[4pt] +\infty & \text{if }\alpha {\lt}0. \end{cases} \]

Then $(g\circ f)^*(s)=\min _{\alpha \in \mathbb R}\psi _s(\alpha )$.

LaTeX

Theorem 5.4.1

Let $f\in \operatorname {Conv}\mathbb {R}^n$ be strictly convex. Then $\operatorname {int}\text{dom }f^*\neq \varnothing $ and $f^*$ is continuously differentiable on $\operatorname {int}\text{dom }f^*$.

LaTeX

Theorem 5.4.2

Let $f\in \operatorname {Conv}\mathbb {R}^n$ be differentiable on the set $\Omega :=\operatorname {int}\operatorname {dom}f$. Then $f^*$ is strictly convex on each convex subset $C\subset \nabla f(\Omega )$.

LaTeX

Theorem 5.4.3

Assume that $f:\mathbb {R}^n\to \mathbb {R}$ is strongly convex with modulus $c{\gt}0$ on $\mathbb {R}^n$: for all $(x_1,x_2)\in \mathbb {R}^n\times \mathbb {R}^n$ and $\alpha \in ]0,1[$,

\[ f(\alpha x_1+(1-\alpha )x_2)\le \alpha f(x_1)+(1-\alpha )f(x_2)-\tfrac {1}{2}c\alpha (1-\alpha )\| x_1-x_2\| ^2. \tag {4.2.1} \]

Then $\operatorname {dom} f^*=\mathbb {R}^n$ and $\nabla f^*$ is Lipschitzian with constant $1/c$ on $\mathbb {R}^n$:

\[ \| \nabla f^*(s_1)-\nabla f^*(s_2)\| \le \tfrac {1}{c}\| s_1-s_2\| \quad \text{for all }(s_1,s_2)\in \mathbb {R}^n\times \mathbb {R}^n. \]

LaTeX

Theorem 5.4.4

Let $f:\mathbb {R}^n\to \mathbb {R}$ be convex and have a gradient-mapping Lipschitzian with constant $L{\gt}0$ on $\mathbb {R}^n$: for all $(x_1,x_2)\in \mathbb {R}^n\times \mathbb {R}^n$,

\[ \| \nabla f(x_1)-\nabla f(x_2)\| \le L\| x_1-x_2\| . \]

Then $f^*$ is strongly convex with modulus $1/L$ on each convex subset $C\subset \operatorname {dom}\partial f^*$. In particular, there holds for all $(x_1,x_2)\in \mathbb {R}^n\times \mathbb {R}^n$

\[ \langle \nabla f(x_1)-\nabla f(x_2),\, x_1-x_2\rangle \ge \tfrac {1}{L}\| \nabla f(x_1)-\nabla f(x_2)\| ^2. \tag {4.2.2} \]

LaTeX