Rigorously, a subderivative of a convex function f : I → R {\displaystyle f:I\to \mathbb {R} } at a point Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "http://localhost:6011/en.wikipedia.org/v1/":): {\displaystyle x_0} in the open interval I {\displaystyle I} is a real number c {\displaystyle c} such that f ( x ) − f ( x 0 ) ≥ c ( x − x 0 ) {\displaystyle f(x)-f(x_{0})\geq c(x-x_{0})} for all x ∈ I {\displaystyle x\in I} . By the converse of the mean value theorem, the set of subderivatives at x 0 {\displaystyle x_{0}} for a convex function is a nonempty closed interval [ a , b ] {\displaystyle [a,b]} , where Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "http://localhost:6011/en.wikipedia.org/v1/":): {\displaystyle a} and b {\displaystyle b} are the one-sided limits a = lim x → x 0 − f ( x ) − f ( x 0 ) x − x 0 , {\displaystyle a=\lim _{x\to x_{0}^{-}}{\frac {f(x)-f(x_{0})}{x-x_{0}}},} b = lim x → x 0 + f ( x ) − f ( x 0 ) x − x 0 . {\displaystyle b=\lim _{x\to x_{0}^{+}}{\frac {f(x)-f(x_{0})}{x-x_{0}}}.} The interval [ a , b ] {\displaystyle [a,b]} of all subderivatives is called the subdifferential of the function f {\displaystyle f} at x 0 {\displaystyle x_{0}} , denoted by ∂ f ( x 0 ) {\displaystyle \partial f(x_{0})} . If f {\displaystyle f} is convex, then its subdifferential at any point is non-empty. Moreover, if its subdifferential at x 0 {\displaystyle x_{0}} contains exactly one subderivative, then f {\displaystyle f} is differentiable at x 0 {\displaystyle x_{0}} and ∂ f ( x 0 ) = { f ′ ( x 0 ) } {\displaystyle \partial f(x_{0})=\{f'(x_{0})\}} .2
Consider the function f ( x ) = | x | {\displaystyle f(x)=|x|} which is convex. Then, the subdifferential at the origin is the interval [ − 1 , 1 ] {\displaystyle [-1,1]} . The subdifferential at any point x 0 < 0 {\displaystyle x_{0}<0} is the singleton set { − 1 } {\displaystyle \{-1\}} , while the subdifferential at any point x 0 > 0 {\displaystyle x_{0}>0} is the singleton set { 1 } {\displaystyle \{1\}} . This is similar to the sign function, but is not single-valued at 0 {\displaystyle 0} , instead including all possible subderivatives.
The concepts of subderivative and subdifferential can be generalized to functions of several variables. If Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "http://localhost:6011/en.wikipedia.org/v1/":): {\displaystyle f:U\to\mathbb{R}} is a real-valued convex function defined on a convex open set in the Euclidean space R n {\displaystyle \mathbb {R} ^{n}} , a vector v {\displaystyle v} in that space is called a subgradient at x 0 ∈ U {\displaystyle x_{0}\in U} if for any x ∈ U {\displaystyle x\in U} one has that
where the dot denotes the dot product. The set of all subgradients at Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "http://localhost:6011/en.wikipedia.org/v1/":): {\displaystyle x_0} is called the subdifferential at x 0 {\displaystyle x_{0}} and is denoted ∂ f ( x 0 ) {\displaystyle \partial f(x_{0})} . The subdifferential is always a nonempty convex compact set.
These concepts generalize further to convex functions f : U → R {\displaystyle f:U\to \mathbb {R} } on a convex set in a locally convex space V {\displaystyle V} . A functional v ∗ {\displaystyle v^{*}} in the dual space V ∗ {\displaystyle V^{*}} is called a subgradient at x 0 {\displaystyle x_{0}} in U {\displaystyle U} if for all Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "http://localhost:6011/en.wikipedia.org/v1/":): {\displaystyle x\in U} ,
The set of all subgradients at x 0 {\displaystyle x_{0}} is called the subdifferential at Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "http://localhost:6011/en.wikipedia.org/v1/":): {\displaystyle x_0} and is again denoted ∂ f ( x 0 ) {\displaystyle \partial f(x_{0})} . The subdifferential is always a convex closed set. It can be an empty set; consider for example an unbounded operator, which is convex, but has no subgradient. If Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "http://localhost:6011/en.wikipedia.org/v1/":): {\displaystyle f} is continuous, the subdifferential is nonempty.
The subdifferential on convex functions was introduced by Jean Jacques Moreau and R. Tyrrell Rockafellar in the early 1960s. The generalized subdifferential for nonconvex functions was introduced by Francis H. Clarke and R. Tyrrell Rockafellar in the early 1980s.4
Bubeck, S. (2014). Theory of Convex Optimization for Machine Learning. ArXiv, abs/1405.4980. ↩
Rockafellar, R. T. (1970). Convex Analysis. Princeton University Press. p. 242 [Theorem 25.1]. ISBN 0-691-08069-0. 0-691-08069-0 ↩
Lemaréchal, Claude; Hiriart-Urruty, Jean-Baptiste (2001). Fundamentals of Convex Analysis. Springer-Verlag Berlin Heidelberg. p. 183. ISBN 978-3-642-56468-0. 978-3-642-56468-0 ↩
Clarke, Frank H. (1983). Optimization and nonsmooth analysis. New York: John Wiley & Sons. pp. xiii+308. ISBN 0-471-87504-X. MR 0709590. 0-471-87504-X ↩