Convex Functions, Partial Orderings, and Statistical Applications

In Mathematics in Science and Engineering, 1992

1.25 Remark

Let U be an open convex set in L, x 0U, yL, and define the function g(t) = f(x 0 + ty) where t ∈ (a, b) such that x 0 + tyU for all t. The function f: U is said to be convex if g(t) is a convex function on (a, b) (Roberts and Varberg, 1973, p. 91).

1.26 Theorem

(a) Let f be convex on an open set UL. If f is bounded from above in a neighborhood of a point in U, then f is locally Lipschitz in U, hence Lipschitz on any compact subset of U, and f is continuous on U.

(b) If f is convex on the open set U n , then f is Lipschitz on every compact subset of U and continuous on U.

Proof

See Roberts and Varberg (1973, pp. 93–94).

1.27 Theorem

(a) Assume that f is defined on the open convex set UL. If f is convex on U and Fréchet differentiable at x 0, then, for xU,

(1.26) f ( x ) f ( x 0 ) f ( x 0 ) ( x x 0 )

holds. If f is differentiable throughout U, then f is convex iff (1.26) holds for all x, x 0U. Furthermore, f is strictly convex iff the inequality is strict.

(b) Let f: U be continuous and Fréchet differentiable on the open set UL. Then f is (strictly) convex if f′ is (strictly) increasing on U.

(c) Let f be continuously differentiable and suppose that the second derivative exists throughout an open set UL. Then f is convex on U iff f″(x) is nonnegative definite for every xU. Furthermore, if f″(x) is positive definite on U, then f is strictly convex.

Proof

See Roberts and Varberg (1973, pp. 98–101).

1.28 Remarks

(a) We shall say that f′ is increasing on U if for x, yU we have

( f ( x ) f ( y ) ) ( x y ) 0 ,

and that f′ is strictly increasing on U if this inequality is strict for all xy.

(b) A function f: UM (UL; L, M are normed linear vector spaces) is Fréchet differentiable at x 0 (x 0U) if there exists a linear transformation T: LM such that

lim x x 0 f ( x ) f ( x 0 ) T ( x x 0 ) x x 0 = 0 ,

which is equivalent to

f ( x ) = f ( x 0 ) + T ( x x 0 ) + o ( x x 0 )

as xx 0. The linear transformation T is called the Fréchet derivative and is denoted by f′(x 0).

(c) A similar derivative of the Fréchet derivative is called the second Fréchet derivative. This derivative is a symmetric bilinear transformation defined on L x L, i.e., f k, h (x) = f h, k (x) (h, kL). Note that if f: U is continuously differentiable on the open convex set UL and f″(x) exists throughout U, then for any x, x 0U there is an s ∈ (0, 1) such that

(1.27) f ( x ) = f ( x 0 ) + f h ' ( x 0 ) + 1 2 f h , h " ( x 0 + s h ) ,

where h = x − x 0.

(d) A symmetric bilinear transformation B(h, k) defined on L x L is positive (nonnegative) definite if for every hL (h ≠ 0), we have

B ( h , h ) > 0 ( B ( h , h ) 0 ) .

(e) The following definition is also valid: A (continuous real-valued) function f is operator convex on (λ, v) if fa + βb) ≤ αf(a) + βf(b) for positive reals α, β such that α + β = 1 and operators a, b with their spectra in (λ, v). (See Davis, 1957, for a brief survey of operator functions and Ando, 1978, for further comments on classes of operator functions.)

Read full chapter

URL:

https://www.sciencedirect.com/science/article/pii/S0076539208628131

Computational Theory of Iterative Methods

In Studies in Computational Mathematics, 2007

Definition 8.2.4

Let S⊆ ℝ n be an open convex set, and let G:S→ ℝ n be a differentiable operator on S. Let x 0S, and assume:

(8.2.14) G ( x 0 ) = I ( t h e i d e n t i t y m a t r i x )

there exists η ≥ 0 such that

(8.2.15) | | G ( x 0 ) | | η ;

there exists an0≥ 0 such that

(8.2.16) | | G ( x ) G ( x 0 ) | | 0 | | x x 0 | | for all x S .

Define:

(8.2.17) h 0 = 2 0 η .

We say that G satisfies the weak center-Kantorovich conditions in x 0 if

(8.2.18) h 0 1.

We also say that G satisfies the strong center-Kantorovich conditions in x 0 if

(8.2.19) h 0 c 2 2 .

Moreover define

(8.2.20) r 1 = c c 2 h 0 0 ,

(8.2.21) r 2 = c + c 2 h 0 0

and

(8.2.22) R = [ r 1 , r 2 ] for 0 0.

Furthermore if0= 0,define

(8.2.23) r 1 = η c

and

(8.2.24) R = [ r 1 , ) .

Read full chapter

URL:

https://www.sciencedirect.com/science/article/pii/S1570579X07800280

THE FIXED-POINT PRINCIPLE

L.V. KANTOROVICH , G.P. AKILOV , in Functional Analysis (Second Edition), 1982

THEOREM 1.

Let X be a Bspace, K a non-empty compact convex set in X, and let f: K → 2 K be a many-valued mapping satisfying the conditions:

1)

for each x K , the set f (x) is a non-empty convex subset of K ;

2)

f is a closed mapping.

Then f has a fixed point.

Proof. *As K is compact, Hausdorff's Theorem shows that for each ɛ > 0 there exists a finite ɛ-net for K, consisting of x ɛ1, x ɛ2,…, x ɛ n (ɛ), say. For every xK, we set

φ ε i ( x ) = max ( ε | | x x ε i | | , 0 )    ( i = 1 , n ( ε ) ) .

Obviously, the ϕɛ i are continuous non-negative functions on K. As {x ɛ i } n (ɛ) i = 1is an ɛ-net, for each xK there is at least one i with || xx ɛ i ||<ɛ. Therefore we have ϕɛ i (x) > 0 for this i. This argument shows that the following definition is meaningful:

w ε i ( x ) = φ ε i ( x ) / Σ j = 1 n ( ε ) φ ε j ( x )     ( i = 1 , , n ( ε ) ) .

Now we fix an arbitrary point y ɛ i f (x ɛ i ) (i = 1,…, n (ɛ)) and define a continuous single-valued mapping f ɛ: KK, which acts according to the formula

f ε ( x ) = Σ i = 1 n ( ε ) w ε i ( x ) y ε i .

From the conditions y ɛ i K, w ɛ i (x) ⩾ 0, ∑ w ɛ i (x) = 1, and the convexity of K it follows that f ɛ(x) ∈ K. Hence, for any ɛ > 0, we have a continuous single-valued mapping f ɛ: KK. By Schauder's Theorem (3.1), this mapping has a fixed point x ɛ: thus f ɛ(x ɛ) = x ɛ.

As K is compact, there exist a sequence {ɛ n } of positive numbers and a point x *ɛ K such that the following conditions are satisfied:

a ) lim n ε n = 0 ; b ) x ε n x * ; c ) f ε n ( x ε n ) = x n .

We show that x *is the fixed point of f we are seeking. Set U δ= f (x *) + B δ, where B δ= {y:|| y ||<δ}, δ > 0. We shall show that x *U δ for every δ > 0, and so, as f (x *) is closed, we obtain the required result, that x *f (x *) (the fact that f (x *) is closed follows from the fact that f is a closed mapping).

It is clear that U δ is an open convex set and that f (x *) ⊂ U δ. Given x *and U δ, we can choose, by Lemma 1, a ball K ɛ= {x: || xx * ||<ɛ} such that f (K ɛ) ⊂ U δ. By conditions a) and b), there exists a number N such that ɛ n < ɛ/2 and x ɛ n K ɛ/2whenever nN. If w ɛ n i (x ɛ n ) > 0, then

| | x ε n i x ε n | | < ε n < ε / 2

and

| | x ε n i x * | | | | x ε n i x ε n | | + | | x ε n x * | | < ε / 2 + ε /2 = ε .

Hence, for n > N we have x ɛ n i K ɛ, for all i such that w ɛ n i (x ɛ n ) > 0. For these i, we have

(1) y ε n i f ( x ε n i ) f ( K ε ) U δ .

By condition c), we deduce that

(2) x ε n = ω ε n i ( x ε n ) y ε n i .

From (1) and (2) we see that, for nN, the point x ɛ n is a convex combination of just those points y ɛ n i that lie in U δ, and so, as U δ is convex, we have x ɛ n U δ. Taking limits as n → ∞, we see that x * U ¯ δU . As we have already remarked, it follows from this that x *f (x *). This completes the proof of Kakutani's Theorem.

5.3. Now we present an application of Kakutani's Theorem to the theory of games.

We consider a game between two people (player I and player II) having zero sum: this means that one player wins by whatever amount the other loses. A player's strategy is a complete enumeration of all moves that this player will make in each of the possible positions in the course of the game. We denote the set of all strategies (or the space of strategies) of player I by X, and the set of strategies of player II by Y.

A pay-off function is a real-valued function K (x, y) defined on X × Y, the number K (x, y) being interpreted as the winnings of player I if player I chooses strategy x and player II strategy y. The number – K (x, y) is interpreted as the winnings of player II in the same situation. A triple (X, Y, K) where X and Y are sets and K (x, y) is a function on X × Y, is called a game.

When player I chooses his strategy xX, he must assume, knowing nothing about player II's strategy, that his opponent will choose the worst strategy from player I's point of view—that is, that player I's winnings will be inf y Y K ( x , y ) . Hence it is natural for player I to seek a strategy x 0X such that

(3) inf y Y K ( x 0 , y ) = max x X inf y Y K ( x , y ) .

Bearing in mind that player II's winnings are – K (x, y), we see from (3) that player II must look for a strategy y 0Y such that

inf y X { K ( x , y 0 ) } = max y Y inf x X [ K ( x , y ) ] ,

or, what is the same thing,

(4) sup x X K ( x , y 0 ) = min y Y sup x X K ( x , y ) .

Comparison of (3) and (4) shows that a necessary and sufficient condition for the existence of strategies x 0X and y 0Y satisfying (3) and (4) is that for all xX, yY the following inequality should hold:

(5) K ( x , y 0 ) K ( x 0 , y 0 ) K ( x 0 , y ) ,

and a necessary and sufficient condition for this is that

(6) min y Y sup x X K ( x , y ) = max x X inf y Y K ( x , y ) .

The common value of the two sides of (6) is called the value of the game. Each pair of strategies x 0, y 0satisfying (5) is called a solution of the game, and the strategies x 0, y 0themselves are called optimal strategies. * To prove the existence of a solution for a game one is required to prove a minimax theorem, which consists in verifying equation (6) for the given game. With the help of Kakutani's Theorem, we now prove one very general theorem of this type. First we state the following lemma, whose proof is left to the reader.

Read full chapter

URL:

https://www.sciencedirect.com/science/article/pii/B9780080230368500229

Convex Functions, Partial Orderings, and Statistical Applications

In Mathematics in Science and Engineering, 1992

2.39 Theorem

Let X be a normed space and DX be an open convex set. Let the discrete functional AM + 1(D, ) be given by

A ( f ) = k = 1 n p k f ( x k ) , P n = 1 , p k 0 , k = 1 , , n .

If there exists a point aD with the property

k = 1 n p k a * ( x k , α x k ) 0 ,

then f(α) ≥ A(f).

Read full chapter

URL:

https://www.sciencedirect.com/science/article/pii/S0076539208628143

Convex Functions, Partial Orderings, and Statistical Applications

In Mathematics in Science and Engineering, 1992

13.1 Theorem

For n ≥ 1 let ϕ: A → ℝ be a continuous convex function where A ⊂ ℝ n is an open convex set such that P[XA] = 1. Then μ ∈ A and

(13.1) E ϕ ( X ) = A ϕ ( x ) d F ( x ) ϕ ( μ ) .

Proof

The fact that μA is easy to verify. To show that (13.1) holds we use the proof in Marshall and Olkin (1979, p. 454).

For zA and i = 1, …, n, let

ϕ ( i ) + ( z ) = lim 0 1 [ ϕ ( z 1 , , z i 1 , z i + , z i + 1 , , z n ) ϕ ( z 1 , , z n ) ] ,

and denote

+ ϕ ( x ) = ( ϕ ( 1 ) + ( x ) , , ϕ ( n ) + ( x ) ) , x A .

Then ∇+ϕ(x) is Borel-measurable and, more importantly,

ϕ ( x ) ϕ ( z ) [ + ϕ ( z ) ] [ x z ) , x , z A

holds (Marshall and Olkin, 1979, p. 451). Choosing z = μ, we have

(13.2) ϕ ( x ) ϕ ( μ ) [ + ϕ ( μ ) ] ( x μ ) a .s .

The proof follows by taking expectations on both sides of (13.2).

Read full chapter

URL:

https://www.sciencedirect.com/science/article/pii/S0076539208628258

Analysis of Combinatorial Neural Codes: An Algebraic Approach

Carina Curto , ... Nora Youngs , in Algebraic and Combinatorial Computational Biology, 2019

7.4.1 Convex Realizability

One of the main questions motivating the construction of the neural ideal is to determine which codes have realizations using convex sets, and which do not. Many types of neurons with receptive fields, including both 2D and 3D place cells, have natural convex receptive fields. Structural features of the code are known which either guarantee or prohibit the existence of a realization with convex sets, and the neural ideal and canonical form can often be used to detect these features. We will refer to a code which has a realization using open convex receptive fields in R d for some d as a convex code.

As Exercise 7.8 shows, the simplicial complex alone is insufficient to characterize a code. In addition to the fact that different codes may have the same simplicial complex, codes with the same simplicial complex may have very different properties with respect to convex realizability.

Exercise 7.37

Let C 1 , C 2 , and C 3 be the three codes from Exercise 7.8. Show that C 1 has a realization with convex open sets in R , that C 2 does not have such a realization, but can be realized with convex open sets in R 2 , and that C 3 cannot be realized with convex open sets in R d for any dimension d.

As seen in Exercise 7.37 and previously in Exercise 7.5, there exist codes which are not convex. The obstructions to convexity in these exercises are topological in nature, and are specific examples of a broad class of obstructions known as local obstructions, defined in [13, 14]. These obstructions follow from an application of the Nerve lemma.

Lemma 7.1 (Nerve Lemma)

If the sets U = { U 1 , U n } are convex, then the homotopy type of i = 1 n U i is equal to the homotopy type of the nerve N ( U ) . In particular, i = 1 n U i and N ( U ) have exactly the same homology groups.

The Nerve lemma is a consequence of [15, Corollary 4G.3]. Local obstructions arise in instances where features of the code dictate that any realization of the code using convex sets would violate the Nerve lemma; thus, a code which can be realized with convex sets must have no local obstructions.

Determining if a code C has local obstructions to convexity can be reduced to the question of determining whether C contains a particular minimal code, as Theorem 7.2 will show. However, we first require a definition.

Definition 7.11

Let Δ be a simplicial complex. For any σΔ, the link of σ in Δ is

Lk σ ( Δ ) = def { ω Δ | σ ω = and σ ω Δ } .

Example 7.15

Consider the simplicial complex

Δ = { , { 1 } , { 2 } , { 3 } , { 4 } , { 1 , 2 } , { 1 , 3 } , { 1 , 4 } , { 2 , 3 } , { 3 , 4 } , { 1 , 2 , 3 } } .

Then, Lk{1}(Δ) = {, {2}, {3}, {4}, {2, 3}}, whereas Lk{2}(Δ) = {, {1}, {3}, {1, 3}} and Lk{1, 3}(Δ) = {, {2}} (See Fig. 7.4).

Fig. 7.4

Fig. 7.4. The simplicial complex Δ from Example 7.15 (at left) and the link Lk{1}(Δ) (at right). Here, Lk{1}(Δ) is disconnected, and thus not contractible.

The necessary condition for a code to have no local obstructions will involve determining whether links are contractible. Informally, a topological space (such as a simplicial complex) is contractible if it is connected and has no "holes"; more formally, a space is contractible if it is homotopy-equivalent to a single point. In the previous example, Lk{1}(Δ) is disconnected, and thus is not contractible, but the other two links are both contractible.

Theorem 7.2

(Theorem 1.3 from [13])

For each simplicial complex Δ, there is a unique minimal code C min ( Δ ) with the following properties:

1.

the simplicial complex of C min ( Δ ) is Δ, and

2.

any code C satisfying Δ ( C ) = Δ has no local obstructions if and only if C C min ( Δ ) .

Determining this minimal code depends only on the simplicial complex Δ ( C ) :

C min ( Δ ) = { σ Δ | Lk σ ( Δ ) is noncontractible } { } .

Exercise 7.38

Recall code C = C 3 from Exercises 7.8 and 7.37. With Δ = Δ ( C ) as computed in that exercise, show that Lk{1, 2}(Δ) is not contractible. Conclude that 1100 C min , and that since C does not contain 1100, it cannot be convex.

Exercise 7.39

Recall C = { 000 , 010 , 001 , 110 , 101 } , the code from Exercise 7.5. Find Δ ( C ) . Then, identify a set σ such that Lk σ (Δ) is not contractible, and the associated codeword c with supp(c) = σ is not in C . Conclude that C is not convex.

While it can be difficult in general to detect these local obstructions from the canonical form, it is sometimes possible. Exercise 7.38 exhibits a simple example of such a situation.

Example 7.16

The code C 3 from Exercises 7.8 and 7.38 has canonical form

C F ( J C 3 ) = { x 3 x 4 , x 1 x 2 ( 1 x 3 ) ( 1 x 4 ) , x 3 ( 1 x 1 ) ( 1 x 2 ) , x 4 ( 1 x 2 ) , x 4 ( 1 x 1 ) } .

The first two polynomials together tell us that ( U 1 U 2 ) ( U 3 U 4 ) and U 3 U 4 = . This gives us a clue that we may have an obstruction using σ = {1, 2}, since in any convex realization, U 1 U 2 should also be convex and thus connected. However, we just found that it is contained in the union of two disjoint sets, so it cannot be connected.

We can show that examples such as this one provide a pattern for similar local obstructions.

Exercise 7.40

Suppose C F 2 n . Show that if there is some set σ [ n ] such that both of the following conditions hold:

1.

x σ ( 1 x i ) ( 1 x j ) C F ( J C )

2.

x σ x i x j J C

then C is not convex, as we can use σ to find a local obstruction.

Demonstrating that a code has a local obstruction proves immediately that the code cannot be convex. However, the converse does not hold; there are codes which have no local obstructions but still have no open convex realization, as in the following example.

Example 7.17

(Lienkaemper et al. [16])

The following code on five neurons has no local obstructions, but has no open convex realization:

C = { 00000 , 00100 , 00010 , 10100 , 10010 , 01100 , 00110 , 00011 , 11100 , 10110 , 10011 , 01111 } .

The obstruction to convexity in this case is more geometric in its flavor—one can use convexity show that the straight line which connects points in two different regions has only a limited set of possible regions it can pass through, and only in certain orders, and this can be shown to be impossible in all cases.

Thus, local obstructions can only prove nonconvexity; we cannot assume that a code without local obstructions is necessarily convex. That said, several large classes of codes are known to have convex realizations. We will now show some examples of these classes, and give algebraic signatures from the canonical form which can tell us immediately if a code fits into one of these classes. The first such class is simplicial complexes.

Theorem 7.3

(Curto et al. [13])

If C is a simplicial complex, then C has a convex realization.

Proposition 7.2

Let C be a neural code. Then C is a simplicial complex if and only if C F ( J C ) consists only of monomials.

Exercise 7.41

Prove Proposition 7.2.

Theorem 7.3 and Proposition 7.2 combined show that by computing the canonical form C F ( J C ) , we can immediately detect if C is a simplicial complex, and if so, conclude that C is convex. However, it is quite specific to require that a code be a simplicial complex, and many codes which are quite clearly realizable (including any code where one receptive field is covered by others, as in Example 7.1) are not simplicial complexes. Fortunately, there is a relaxation of the simplicial complex condition which is also known to be convex.

Definition 7.12

A code C is intersection-complete if for any c , d C , there is a codeword v C such that supp ( v ) = supp ( c ) supp ( d ) .

Exercise 7.42

Show that codes which are simplicial complexes are intersection-complete. Give an example to show that the converse does not hold.

The larger class of intersection-complete codes is also known to be convex realizable.

Theorem 7.4

(Cruz et al. [17])

If C is intersection-complete, then C has a convex realization.

Intersection-complete codes are more general than simplicial complex codes, but we can use the canonical form to detect them as well, as the following result indicates.

Theorem 7.5

(Curto et al. [18])

A code C containing the all-zero word 0 is intersection-complete if and only if C F ( J C ) consists only of monomials, and mixed monomials of the form

i σ x i ( 1 x j ) .

From the previous two results, we see that codes whose canonical forms are entirely monomial (as with simplicial complexes) or allow only the most basic mixed monomials (intersection-complete codes) are convex. This might lead us to suspect that having mostly monomial-like relationships is somehow necessary for convexity. However, this is not the case, as codes with no monomials at all in their canonical form are also convex.

Theorem 7.6

(Curto et al. [13])

If C is a neural code and C F ( J C ) contains no monomials, then C has a convex realization.

Furthermore, codes with no monomial relationships are also known to have convex realizations in very low dimensions (1 or 2); see Fig. 7.5 and related discussion. This property also has a simple characterization in terms of the canonical form, as seen in the following exercise.

Fig. 7.5

Fig. 7.5. Constructing a 2D realization of C = { 000 , 100 , 001 , 101 , 110 , 111 } . The codeword 111 is placed at the center of a polygon inscribed within a circle; the remaining codewords are assigned to the spaces created around the edges. We then take U i to be the union of the regions labeled by codewords with neuron i firing.

Exercise 7.43

Show that C F ( J C ) contains no monomials if and only if C contains the codeword 1 = 111…1.

Read full chapter

URL:

https://www.sciencedirect.com/science/article/pii/B9780128140666000076

Introduction to Ulam stability theory

In Ulam Stability of Operators, 2018

4 Other functional equations and inequalities in several variables

Now, we present examples of stability results that have been obtained for various functional equations and inequalities in several variables.

Let us start with a result of D.H. Hyers and S.M. Ulam [78] for convex functions.

Theorem 14

Let D n be an open convex set with non-empty interior and a function f : D satisfy

f tx + 1 t y tf x + 1 t f y + ε , x , y D , 0 t 1 .

Let B be a closed bounded convex subset of D. Then there exists a convex function g : B such that

f x g x k n ε , x B ,

where

k n = n 2 + 3 n 4 n + 4 .

Next, let us recall a result of Z. Kominek [93] for the Jensen equation.

Theorem 15

Let D n be a bounded set with a nonempty interior and Y be a Banach space. Assume that there exists x 0D such that D 0 := Dx 0 satisfies the condition

1 2 D 0 D 0 .

Then, for each function f : DY such that

sup x , y D f x + y 2 f x + f y 2 < ,

there exists g : n Y such that

g x + y 2 = g x + g y 2 , x , y n ,

and

sup x D f x g x < .

The first author treating Hyers-Ulam stability of the quadratic equation was F. Skof [138], who proved the following.

Theorem 16

Let X be a normed vector space and Y a Banach space. If a function f : XY fulfills

f x + y + f x y 2 f x 2 f y δ , x , y X ,

for some δ > 0, then for every xX the limit

g x = lim n f 2 n x 2 2 n

exists and g is the unique solution of the functional equation

g x + y + g x y = 2 g x + 2 g y , x , y X ,

with

f x g x δ 2 , x X .

Since then, the stability problem for the quadratic equation has been extensively investigated by a number of mathematicians in, e.g., [56, 80, 124, 127].

The Hyers-Ulam-Rassias stability of the pexiderized versions of the additive, Jensen, and quadratic equations, i.e., of the following three equations

f x + y = g x + h y , f x + y 2 = g x + h y 2 , f x + y + g x y = h x + k y ,

has been studied in [79, 95, 96]. In particular the subsequent theorem has been proved in [96].

Theorem 17

Let X be a normed space, Y be a Banach space, and let f, g, h : XY be mappings. Assume that K ≥ 0 and p ≠ 1 are real numbers with

f x + y g x h y K x p + y p , x , y X \ 0 .

Then there exists a unique additive mapping T : XY such that

f x T x f 0 4 K 3 + 3 p 2 p 3 3 p x p , x X \ 0 .

Read full chapter

URL:

https://www.sciencedirect.com/science/article/pii/B9780128098295500015

Information Geometry

Jun Zhang , Ting-Kam Leonard Wong , in Handbook of Statistics, 2021

3.2 λ-Deformed Legendre duality

Legendre duality has been generalized to c-duality which is used extensively in the theory of optimal transport (Villani, 2003, 2008). In our context, the bilinear pairing xy in (26) is replaced by c(xy), where c is monotone but non-linear in general. In particular, we take c(t) = κ λ (t). By Theorem 1, the requirement of convexity of f(x) is replaced by the requirement of convexity of g(x) = γ λ (f(x)) on a convex set Ω ∋ x, and the λ-conjugate variable is given by the λ-deformed Legendre transformation. Following Wong and Zhang (2021), we made the following definition.

Definition 5

λ-conjugate function

Fix λ R , λ 0 . Given a function f : Ω R on some Ω R d , we define f (λ) on Ω R d by

(30) f ( λ ) ( u ) = sup x Ω κ λ ( x u ) f ( x ) , u R d .

We call f (λ) the λ-conjugate of f.

The following result connects the notion of λ-convexity and the deformed duality under the function κ λ .

Theorem 1 λ-duality

Let λ ≠ 0 and let f be a smooth and λ-convex function on some open convex set Ω R d . Further assume 1 − λ∇f(x) ⋅ x > 0 on Ω. c

(i)

Consider f (λ) as a function on some other open convex set Ω′. Then the operation of λ-conjugation is involutive on Ω: (f (λ))(λ) = f;

(ii)

For x ∈ Ω, define the λ-gradient ∇ λ by

(31) λ f ( x ) : = 1 1 λ f ( x ) x f ( x ) .

Then u = ∇ λ f(x) is a C 1 diffeomorphism from Ω onto its range Ω′ which is an open set;
(iii)

For x ∈ Ω and u = ∇ λ f(x) we have 1 + λxu > 0 and the following identity holds

(32) f ( x ) + f ( λ ) ( u ) 1 λ log ( 1 + λ x u ) = κ λ ( x u ) .

In reminiscent of Legendre duality, we call the above set of relations the λ -deformed Legendre duality, or λ -duality in short.

Proof

The proof of this theorem is given in scattered parts in Wong (2018, Section 3.3). Here we just rephrased the results in terms of the λ-duality.

Read full chapter

URL:

https://www.sciencedirect.com/science/article/pii/S0169716121000249

Barrels and Other Features of TVS's

Eric Schechter , in Handbook of Analysis and Its Foundations, 1997

BARRELS AND ULTRABARRELS

27.20.

Remarks Ultrabarrels are a generalization of barrels. Barrels are simpler to define, but they are mainly useful in locally convex spaces; ultrabarrels can be useful in the more general setting of topological vector spaces. The theories of barrels in LCS and ultrabarrels in TVS are closely analogous; the analogy will be developed in the sections below.

The definitions of barrels and ultrabarrels involve absorbing sets (defined in 12.8). In a TVS, absorbing sets may be viewed as "generalized neighborhoods of 0" — any neighborhood of 0 is absorbing, but not every absorbing set is a neighborhood of 0. For instance, sketch a graph of {(x, y) ∈ ℝ2 : |y| ≥ x 2 or y = 0}; show that this set is absorbing but is not a neighborhood of (0, 0) when ℝ2 is equipped with its usual topology.

27.21.

Definition Let X be a topological vector space. A barrel in X is a subset of X that is closed, convex, balanced, and absorbing. (Those terms are defined in 5.13, 12.3, and 12.8.)

27.22.

Basic properties of barrels Let X be a topological vector space.

a.

If X is a locally convex space, then X has a neighborhood base at 0 consisting of barrels.

b.

If ρ is a continuous seminorm on X and k > 0, then {xX : ρ(x) ≤ k} is a barrel.

c.

If B is a barrel in X, then its Minkowski functional μ B is a seminorm on X. Moreover, μ B is continuous if and only if B is a neighborhood of 0.

27.23.

Definition Let X be a vector space. A string in X is a sequence of sets (S n : n ∈ ℕ) that are balanced, absorbing, and satisfy S n S n+1 + S n+1 for all n. The S n 's are then called the knots of the string.

In a topological vector space, a closed string is a string whose knots are closed sets; those closed sets are called ultrabarrels. We still have a string if we discard the first few knots of a string; hence every ultrabarrel may also be viewed as the first knot of a closed string.

In a topological vector space, a neighborhood string is a string, all of whose knots are neighborhoods of 0. (Some mathematicians call this a topological string.)

27.24.

Basic properties of strings and ultrabarrels

a.

If B is a barrel in a TVS X, then B is also an ultrabarrel — it is a knot of the closed convex string (S n ) defined by S n = 2n B.

b.

If (S n ) is a string in a vector space X, then {S n } forms a neighborhood base at 0 for a TVS topology on X.

Conversely, if X is a TVS, then X has a neighborhood base at 0 consisting of ultrabarrels.

c.

If ρ is an F-seminorm on a vector space X and k is a positive constant, then the sets S n = {xX : ρ(x) ≤ 2n } form a string.

If ρ is a continuous F-seminorm on a TVS X, then the sequence (S n ) defined as above is a closed string; thus its members are ultrabarrels.

d.

If (S n ) is a string in a vector space X, then there exist an F-seminorm ρ on X and positive numbers a n , b n decreasing to 0 that satisfy

{ x : ρ ( x ) a n } S n { x X : ρ ( x ) b n }

for all n. (Hint: The sets V n = {(x, y) ∈ X × X : x − yS 2n } satisfy the hypotheses of 4.44.)

Suppose, moreover, that X is a TVS. Then ρ is continuous if and only if (S n ) is a neighborhood string. (Hint: An F-seminorm is continuous if and only if it is continuous at 0.)

e.

If (S n ) and (T n ) are strings and S n + T n = U n , then (U n ) is a string.

27.25.

Proposition Suppose X is a complete metric space or, more generally, a Baire space. Then:

(i)

If X is a TVS, then X is ultrabarrelled, as defined in 27.26(U1).

(ii)

If X is an LCS, then X is barrelled, as defined in 27.27(B1).

Proof It suffices to prove (i), since that result implies (ii) by 27.24.a. Let (S n ) be a closed string; we wish to show that S 1 is a neighborhood of 0. Since S 2 is absorbing, X = k = 1 k S 2 . By 20.15(B), some kS 2 has nonempty interior. Hence S 2 has nonempty interior. Say x 0 ∈ int(S 2). Thus x 0 + GS 2 where G is some neighborhood of 0. Since S 2 is symmetric, we have 0 + (G − G) = (x 0 + G) - (x 0 + G) ⊆ S 2S 2 = S 2 + S 2S 1.

27.26.

Theorem and definition Let (X, J) be a real or complex topological vector space. Then the following conditions on (X, J) are equivalent. If any, hence all, of these conditions are satisfied, we say (X, J) is ultrabarrelled.

(U1)

Every ultrabarrel in X is a neighborhood of 0.

(U2)

(F-Seminorms Property.) Each lower semicontinuous F-seminorm on X is continuous.

(U3)

(Banach's Closed Graph Property.) Let Y be an F-space. Let f : XY be linear and have closed graph. Then f is continuous.

(U4)

(Neumann's Nonlinear Closed Graph Property.) Let Y be a locally full F-space. Let Ω ⊆ X be an open convex set. Suppose f : Ω → Y is a convex operator whose graph is a closed subset of Ω − Y. Then f is continuous.

(U5)

(Banach-Steinhaus Uniform Boundedness Property.) Let Y be a topological vector space. Let Φ be a collection of continuous linear maps from X into Y that is toplinearly bounded pointwise. Then Φ is equicontinuous.

(U6)

(Neumann's Nonlinear Uniform Boundedness Property.) Let Y be an ordered topological vector space that is locally full. Let Ω ⊆ X be an open convex set. Let Φ be a collection of continuous convex maps from Ω into Y. Suppose Φ is toplinearly bounded pointwise. Then Φ is equicontinuous.

Proof of this theorem begins in Section 27.31.

27.27.

Theorem and Definition Let (X, J) be a real or complex locally convex space. Then the following conditions on (X, J) are equivalent. If any, hence all, of these conditions are satisfied, we say (X, J) is barrelled.

(B1)

Every barrel in X is a neighborhood of 0.

(B2)

(Seminorms Property.) Each lower semicontinuous seminorm on X is continuous.

(B3)

(Closed Graph Property.) Let Y be a Fréchet space. Let f : XY be linear and have closed graph. Then f is continuous.

(B4)

(Neumann's Nonlinear Closed Graph Property.) Let Y be a locally full Fréchet space. Let Ω ⊆ X be an open convex set. Suppose f : Ω → Y is a convex function, whose graph is a closed subset of Ω x Y. Then f is continuous.

(B5)

(Uniform Boundedness Property.) Let Y be a locally convex space. Let Φ be a collection of continuous linear maps from X into Y that is toplinearly bounded pointwise. Then Φ is equicontinuous.

(B6)

(Neumann's Nonlinear Uniform Boundedness Property.) Let Y be a locally full, locally convex space. Let Ω ⊆ X be an open convex set. Let Φ be a collection of continuous convex maps from Ω into Y that is toplinearly bounded pointwise. Then Φ is equicontinuous.

Proof of this theorem begins in Section 27.31.

Remark A seventh characterization of barrelled spaces will be given in 28.30.

27.28.

Corollaries: classical versions Let X be an F-space. By 27.25:

a.

Any lower semicontinuous F-seminorm on X is continuous.

b.

Uniform Boundedness Theorem. If Y is a topological vector space and Φ is a collection of continuous linear maps from X into Y such that Φ(x) = {f(x) : f ∈ Φ} is a bounded subset of Y for each xX, then Φ is equicontinuous.

c.

Closed Graph Theorem. If Y is an F-space and f : XY is a linear map whose graph is a closed subset of XY, then f is continuous.

d.

If Ω is an open convex subset of X and f : Ω → ℝ is a convex function whose graph is a closed subset of Ω − ℝ, then f is continuous.

27.29.

Example application Let (Ω, S, μ) be a measure space, let X be a Banach space, and let α, β ∈ (0, +∞) be exponents such that L α(μ, X) ⊆ L β(μ, X). Then the inclusion L α ( μ , X ) L β ( μ , X ) is continuous.

Proof It suffices to show that the inclusion map has closed graph. Suppose f n f in L α(μ, X) and f n g in L β(μ, X); we are to show that f = g. By 22.31(ii) we may pass to subsequences such that f n f and f n g pointwise μ-almost everywhere.

Remarks This example is taken from Villani [1985]. That paper also shows the following interesting result: Let X be a Banach space, let (Ω, S, μ) be a measure space, and let α, β ∈ (0, +∞) with α > β. Then

L α ( μ , X ) L β ( μ , X ) if and only if inf { μ ( S ) : S S , μ ( S ) > 0 } > 0 ; L α ( μ , X ) L β ( μ , X ) if and only if sup { μ ( S ) : S S , μ ( S ) < 0 } < ;

Special cases of this were given in 22.34. (Villani's paper only shows this for X = ℝ, but that case easily yields the general case since all the functions in L α(μ, X) or L β(μ, X) are measurable, and we can separate the "regular" condition from the "not too big" condition — see the remarks in 22.28.)

27.30.

Change of scalar field Let X be a complex topological vector space (respectively, a complex locally convex space). Then X, with the same topology, may also be viewed as a real topological vector space (respectively, a real locally convex space) if we "forget" how to multiply members of X by nonreal scalars. Let us denote these two TVS's by X and X . Note that the choice of scalars affects the definitions of "balanced" and "absorbing;" hence it affects the definitions of "barrel" and "ultrabarrel." Show that

a.

If B is an (ultra)barrel in X , then B is also an (ultra)barrel in X . Likewise, any (F−)seminorm on X C is also an (F−)seminorm on X .

Hence, if X is (ultra)barrelled, then X is (ultra)barrelled, too.

b.

It is possible for X to have more (ultra)barrels than X .

For instance, show that the set B = {z ∈ ℂ : |Re(z)| ≤ 1 and |Im(z)| ≤ 2} is both a barrel and an ultrabarrel in X , but is neither in X since B is not balanced in X .

c.

Nevertheless, X is (ultra)barrelled if and only if X is (ultra)barrelled.

For the moment, we shall prove this equivalence using only definitions (U2) and (B2); proofs with the other definitions in 27.26 and 27.27 will follow from the arguments given in the next subchapter.

We have already established half of this "if and only if" result. Now assume X is (ultra)barrelled — i.e., it satisfies condition (U2) or (B2). To show the same for X , let ρ be any lower semicontinuous (F-)seminorm on X ; we wish to show that ρ is continuous on X . Note that X and X differ only in their algebraic operations — they are the same set, and they have the same topology; so a function is continuous on X if and only if it is continuous on X . Define γ : X → [0, +∞) as in 26.5.b. As we noted in 26.5.b, this function γ is also lower semicontinuous on X, and γ is an (F-)seminorm on X . Hence, by our assumption, γ is continuous. From the inequality ρ ≤ γ, we see that ρ is continuous at 0. Since ρ is a G-seminorm, we have |ρ(u) - ρ(v)| ≤ ρ(u − v), and therefore ρ is continuous.

Read full chapter

URL:

https://www.sciencedirect.com/science/article/pii/B9780126227604500273

Probabilities and Potential C

In North-Holland Mathematics Studies, 1988

Convex Cones with a Compact Base

62

It will be convenient here to suppose that K is contained in a closed affine hyperplane H ∈ E, which doesn't pass through the origin. We then denote by C the cone with base K, in other words C = U{t≥0} tK.

If μ is a positive bounded measure on K we define as in no. 29 the resultant r(μ) of μ.

In many applications in analysis it is the cone C which arises naturally, the choice of K - the intersection of C with a hyperplane H = {h = 1}-corresponding to a certain way of normalizing the elements of C, which is somewhat arbitrary. See the examples at the end of the chapter.

Since C is a convex cone, there corresponds to it an order relation compatible with addition, which we will denote < in this section. We will consider it only on C itself,

(62.1) ( x y ) ( y x C ) ( x , y C )

Remark that we have tx ≤ × for 0 ≤ t ≤ 1, and × ≤ tx for t ≥ 1. On the other hand if we represent the hyperplane in the form {h = 1} where h is continuous linear on E, h is increasing on C, and takes the value 0 on C only at the point 0. There is a bisection between the decompositions of × ∈ K in C,

x = y 1 + + y n ; y 1 , …. , y n C , y i 0 f o r a l l i

and the barycentric representations in K

x = t 1 z 1 + + t n z n ; z 1 , , z n k , t i > 0 f o r a l l i , i t i = 1

the correspondence being given by ti = h(yi), zi = yi/ti. For example to say that × is extremal amounts to saying that in every decomposition × = Σiyi of × into elements of C, all the yi are proportional to x.

The main tool will be the following lemma, which permits calculation of f ˆ by means of decompositions in the cone C:

63

THEOREM. Let f be a continuous function on K, which we extend (without changing notation) as a positively homogeneous function on C. Then we have for all × ∈ K

(63.1) f ˆ ( x ) = sup i f ( y i )

the sup being taken over all the finite decompositions × = Σiyi, yi ∈ C.

Proof. Thanks to the relation between decompositions and barycentric representations, this formula can be written f ˆ ( x ) = s u p μ μ ( f ) , μ running over the set of probability laws on K with barycentre x, and with finite support. We know moreover (40) that f ˆ ( x ) = s u p μ ( f ) , μ running over set of all the laws with barycentre x. It thus suffices to be able to approximate each law μ, in the sense of vague convergence, by discrete measures with the same barycentre . To establish this consider a finite covering of K by convex open sets ω i; it is easy to construct a representation of the law μ as a convex combination μ = σitiμi, where for each i the law μi is carried by ωi. The measure with finite support Σitiεb(μi) = λ thus admits the same barycentre as μ. On the other hand, as E is locally convex and K compact, there exist coverings of the preceding type finer than any given covering, and the corresponding measures λ are arbitrarily close to μ for narrow convergence.

Here is Choquet's uniqueness theorem (the proof says a little more than the statement). Recall that each point of K is the barycentre of at least one maximal measure for the order ⊣λ (41), and that the maximal measures are exactly those carried by the boundary when K is metrizable (43).

64

THEOREM. The two following properties are equivalent

1)

The cone C is a lattice for its own order ≤.

2)

Every point of K is the barycente of a unique maximal law.

Proof. The implication 2) ⇒ 1) is the easy part: if there is uniqueness, the application μ ↦ r(μ) is an isomorphism of the convex cone of bounded maximal measures to the cone C, and the cone of maximal measures is a lattice (in the metrizable case, it is the cone of measures on the boundary, in the non-metrizable case, it is characterized by the property μ ( f ˆ f ) = 0 for f ∈ C (K)). Conversely suppose that C is a lattice, we shall establish not only property 2) of the statement but many other interesting properties.. Here is the first:

(64.1) For all continuous convex ¯ f on ¯ k , f ˆ is affine on ¯ K .

As f ˆ is concave, it suffices to show that f ˆ is also convex. Extending f and f ˆ from K to C as positively homogeneous functions (without changing notationcf. 63), this comes down to showing that f ˆ ( x + y ) f ˆ ( x ) + f ˆ ( y ) for x, y ∈ C. But by 63

f ˆ ( x + y ) = sup i f ( z i ) , where x + y = i z i .

As C is a lattice, the classical decomposition lemma asserts the existence of decompositions zi = xi + yi with Σixi = x, Σiyi = y. As f is convex, we have f(zi) ≤ f(xi) + f(yi), and hence

Σ i f ( z i ) Σ i f ( x i ) + Σ i f ( y i ) f ˆ ( x ) + f ˆ ( y ) ,

the required inequality.

To establish the decomposition lemma, we reduce the problem to the case of a decomposition z ▭ z1 + z2 with two elements. A proof appears in Bourbaki, Intégr. chap II, §1 no. 1 (reference not guaranteed, the "definitive" edition not yet being ready). But the reader could also extract it from IX.88.

Now, since f ˆ is u.s.c. affine, it is strongly affine (remark in no. 53), and we have μ ( f ˆ ) = f ˆ ( x ) for every law μ with barycentre x. If further μ is is maximal, we have μ ( f ˆ ) = μ ( f ) , so

(64.2) μ ( f ) = f ˆ ( x ) for f continuous convex .

Thus the linear functional μ is determined on the cone V of continuous convex functions. As V-V is dense in C (K) (Stone-Weierstrass theorem) this uniquely determines μ, and the theorem is proven.

We note a further property in the same vein. It is a consequence of(64.2) that

(64.3) T h e m a p f f i s a d d i c t i v e o n V .

Conversely, if this property holds, the map f f ˆ ( x ) is a positive linear functional on V-V, which extends to a positive linear functional μ on C (K), and it is easy to see that every law λ with barycentre × satisfies λ⊣λ μ: so μ is the unique maximal sweeping of εx.

REMARKS.

a)

We say that K is a simplex if every point of E is the barycentre of a unique maximal law, by analogy with the case of R n where the word has its usual meaning. We note that the properties (64.1), (64.3) are internal characterizations of simplexes: no recourse is needed to the cone C and its order.

b)

Property (64.1) generalizes to infinite dimensions the remark well known to café waiters: the cups are upset less often when the tray is placed on a three legged pedestal than when it is placed on a table with four unequal legs.

Read full chapter

URL:

https://www.sciencedirect.com/science/article/pii/S0304020808719169