Subdifferentials of convex functions
1 Introduction
Whenever we speak about a vector space in this note we mean a vector space over . If is a topological vector space then we denote by the set of all continuous linear maps . is called the dual space of , and is itself a vector space.11 1 In this note, we are following the presentation of some results in Charalambos D. Aliprantis and Kim C. Border, Infinite Dimensional Analysis: A Hitchhiker’s Guide, third ed., chapter 7. Three other sources for material on subdifferentials are: Jean-Paul Penot, Calulus Without Derivatives, chapter 3; Viorel Barbu and Teodor Precupanu, Convexity and Optimization in Banach Spaces, fourth ed., §2.2, pp. 82–125; and Jean-Pierre Aubin, Optima and Equilibria: An Introduction to Nonlinear Analysis, second ed., chapter 4, pp. 57–73.
2 Definition of subdifferential
If is a topological vector space, is a function, , and , then we say that is a subgradient of at if22 2 , , and is nonsense; if , then and .
The subdifferential of at is the set of all subgradients of at and is denoted by . Thus is a function from to the power set of , i.e. . If , we say that is subdifferentiable at .
It is immediate that if there is some such that , then
Thus, little is lost if we prove statements about subdifferentials of functions that do not take the value .
Theorem 1.
If is a topological vector space, is a function and , then is a convex subset of .
Proof.
If and , then of course . For any we have
showing that and thus that is convex. ∎
To say that is equivalent to saying that for all and so . This can be said in the following way.
Lemma 2.
If is a topological vector space and is a function, then is a minimizer of if and only if .
3 Convex functions
If is a set and is a function, then the epigraph of is the set
and the effective domain of is the set
To say that is equivalent to saying that there is some such that . We say that is finite if for all .
If is a vector space and is a function, then we say that is convex if is a convex subset of the vector space .
If is a set and is a function, we say that is proper if it does not take only the value and never takes the value . It is unusual to talk merely about proper functions rather than proper convex functions; we do so to make clear how convexity is used in the results we prove.
4 Weak-* topology
Let be a topological vector space and for define by . The weak-* topology on is the initial topology for the set of functions , that is, the coarsest topology on such that for each , the function is continuous.
Lemma 3.
If is a topological vector space, is the weak-* topology on , and is the subspace topology on inherited from with the product topology, then .
Proof.
Let converge in to . For each , the function is continuous, so , i.e. . But for to converge to means that for each , we have . Thus converges to in . This shows that .
Let , and let converge in to . We then have ; since was an arbitrary net that converges in , this shows that is continuous. Thus, we have shown that for each , the function is continuous. But is the coarsest topology for which is continuous for all , so we obtain . ∎
In other words, the weak-* topology on is the topology of pointwise convergence. We now prove that at each point in the effective domain of a proper function on a topological vector space, the subdifferential is a weak-* closed subset of the dual space.33 3 cf. Charalambos D. Aliprantis and Kim C. Border, Infinite Dimensional Analysis: A Hitchhiker’s Guide, third ed., p. 265, Theorem 7.13.
Theorem 4.
If is a topological vector space, is a proper function, and , then is a weak-* closed subset of .
Proof.
If , then for all we have
so, for any , using ,
or,
this makes sense because is finite. On the other hand, let . If for all , then , i.e. , and so . Therefore
(1) |
Defining for by , for each we have
Because is continuous, this inverse image is a closed subset of . Therefore, each of the sets in the intersection (1) is a closed subset of , and so is a closed subset of . ∎
5 Support points
If is set, is a subset of , and is a function, we say that is a minimizer of over if
and that is a maximizer of over if
If is a nonempty subset of a topological vector space and , we say that is a support point of if there is some nonzero for which is a minimizer or a maximizer of over . Moreover, is a minimizer of over if and only if is a maximizer of over . Thus, if we know that is a support point of a set , then we have at our disposal both that is a minimizer of some nonzero element of over and that is a maximizer of some nonzero element of over .
If is a support point of and is not contained in the hyperplane , we say that is properly supported at . To say that is not contained in the set is equivalent to saying that there is some such that .
In the following lemma, we show that the support points of a set are contained in the boundary of the set.
Lemma 5.
If is a topological vector space, is a subset of , and is a support point of , then .
Proof.
Because is a support point of there is some nonzero for which is a maximizer of over :
As is nonzero there is some with . For any ,
hence if then . But as and , showing that . ∎
The following lemma gives conditions under which a boundary point of a set is a proper support point of the set.44 4 Charalambos D. Aliprantis and Kim C. Border, Infinite Dimensional Analysis: A Hitchhiker’s Guide, third ed., p. 259, Lemma 7.7.
Lemma 6.
If is a topological vector space, is a convex subset of that has nonempty interior, and , then is properly supported at .
Proof.
The Hahn-Banach separation theorem55 5 Gert K. Pedersen, Analysis Now, revised printing, p. 65, Theorem 2.4.7. tells us that if and are disjoint nonempty convex subsets of and is open then there is some and some such that
Check that the interior of a convex set in a topological vector space is convex, and hence that we can apply the Hahn-Banach separation theorem to and : as belongs to the boundary of it does not belong to the interior of , so and are disjoint nonempty convex sets. Thus, there is some and some such that for all , from which it follows that for all , and because of the strict inequality for the interior. As , this means that is a maximizer of over , and as this means that is a support point of . But is nonempty and if then , hence is a proper support point of . ∎
6 Subdifferentials of convex functions
If is a proper function then there is some for which , and for to have a subgradient at demands that , and hence that . Therefore, if is a proper function then the set of at which is subdifferentiable is a subset of .
We now prove conditions under which a function is subdifferentiable at a point, i.e., under which the subdifferential at that point is nonempty.66 6 Charalambos D. Aliprantis and Kim C. Border, Infinite Dimensional Analysis: A Hitchhiker’s Guide, third ed., p. 265, Theorem 7.12.
Theorem 7.
If is a topological vector space, is a proper convex function, is an interior point of , and is continuous at , then has a subgradient at .
Proof.
Because is convex, the set is convex, and the interior of a convex set in a topological vector space is convex so is convex. is proper so it does not take the value , and on it does not take the value , hence is finite on . But for a finite convex function on an open convex set in a topological vector space, being continuous at a point is equivalent to being continuous on the set, and is also equivalent to being bounded above on an open neighborhood of the point.77 7 Charalambos D. Aliprantis and Kim C. Border, Infinite Dimensional Analysis: A Hitchhiker’s Guide, third ed., p. 188, Theorem 5.43. Therefore, is continuous on and is bounded above on some open neighborhood of contained in , say for all . is an open subset of , and is contained in . This shows that has nonempty interior. Since , if then , and since we have , and therefore . We can now apply Lemma 6: is a convex subset of the topological vector space with nonempty interior and , so is properly supported at . That is, Lemma 6 shows that there is some such that
and there is some for which . Now, there is some and some such that for all . Thus, there is some nonzero and some such that
If then the right-hand side would be while the left-hand side is constant and , so . Suppose by contradiction that . Then for all , and as this means that is a support point of , and then by Lemma 5 we have that , contradicting . Hence , so
i.e.,
Furthermore, if then , for which the above inequality is true. Therefore, for all , showing that is a subgradient of at . ∎
7 Directional derivatives
Lemma 8.
If is a vector space, is a proper convex function, , , and , then
Proof.
We have
and because is convex this gives
i.e.
Dividing by ,
∎
If is a proper convex function, , and , then the above lemma shows that
is an increasing function , and therefore that
exists; it belongs to , and if there is at least one for which then the limit will be . We define the one-sided directional derivative of at to be the function defined by88 8 We are following the notation of Charalambos D. Aliprantis and Kim C. Border, Infinite Dimensional Analysis: A Hitchhiker’s Guide, third ed., p. 266.
Lemma 9.
If is a topological vector space, is a proper convex function, , is continuous at , and , then .
Proof.
Because , there is some for which and hence for which . This implies that .
Let . By Theorem 7, the subdifferential is nonempty, i.e. there is some for which for all . Thus, for all we have, with ,
i.e.,
Since this difference quotient is bounded below by , its limit as is , and therefore . ∎