
This post is centered around a recent nice observation by Boáz Klartag on the Poincaré constant of uniformly log-concave measures, at the heart of his recent investigation of the Kannan-Lovász-Simonovits (KLS) conjecture, that we do not discuss here.
A famous observation due to André Lichnerowicz in differential geometry states that if a compact connected Riemannian manifold of dimension n≥2 has Ricci curvature uniformly bounded below by some real number ρ>0, then the spectral gap λ, or first non-zero eigenvalue of minus the Laplace operator on the manifold, satisfies
λ≥nn−1ρ.
This simply follows by testing the Bochner commutation-curvature formula on an eigenfunction associated to the spectral gap, a method still at the heart of the Klartag observation. Equality is achieved for spheres, and for the unit sphere we have λ=n while we can take ρ=n−1. The Lichnerowicz inequality is essentially a comparison to spheres.
This statement has an analogue for uniformly log-concave probability measures on Rn : if dμ(x)=e−V(x)dx on Rn with, for some ρ>0, ∇2V(x)≥ρId as quadratic forms for all x∈Rn, then the spectral gap λ of the Markov diffusion operator Δ−⟨∇V,∇⟩ satisfies
λ≥ρ.
This inequality is obtained below by following the Lichnerowicz argument, via the Bochner commutation-curvature formula on an eigenfunction. Equality is achieved for the Gaussian case V(x)=ρ2‖x‖2, and the inequality is this time essentially a comparison to Gaussians.
This inequality can also be deduced from the Bakry-Émery curvature-dimension criterion, which is an abstraction of the Bochner commutation-curvature formula approach. For the pleasure, we also present, at the end of the post, a derivation of the Brascamp-Lieb inequality from the Bochner commutation-curvature formula, via a Helffer-Sjöstrand representation of the covariance, which is of independent interest.
This post is mostly devoted to the following improvement by Boáz Klartag :
λ≥√ρ‖Σ‖op≥ρ,
where ‖Σ‖op is the operator norm of the covariance matrix of μ. On the opposite side, note that we always have λ≤1/‖Σ‖op regardless of the log-concavity of μ, see below. Equality is achieved in the above inequalities in the Gaussian case for which Σ=1ρId and λ=ρ.
The first Klartag inequality reads
λ2‖Σ‖op≥ρ,
and reminds the uncertainty principle in harmonic analysis : for a given lower bound on the curvature, we cannot have in the same time a small spectral gap and a small operator norm for the covariance. The second Klartag inequality reads
ρ‖Σ‖op≤1,
and can be interpreted as follows : we cannot have in the same time a high value for the lower bound on the curvature and for the operator norm of the covariance.
Poincaré inequality and Poincaré constant. The Poincaré constant cP(μ) of a probability measure μ on Rn is the smallest constant c such that for all f∈C∞c,
Varμ(f):=∫(f−∫fdμ)2dμ≤c∫‖∇f‖2dμ.
An approximation argument based on cutoff and smoothing allows typically to extend the inequality to f∈H1(μ) where H1(μ)=W1,2(μ) is the Sobolev space of square integrable functions with square integrable weak derivative. Both the left and right hand sides of the inequality vanish if and only if f is constant μ almost surely, and
1cP(μ)=inf{∫|∇f|2dμVarμ(f):f not constant}.
Let us introduce the mean vector or barycenter m=(mi)1≤i≤n of μ defined by
m:=∫xdμ(x)
and the covariance matrix (Σi,j)1≤i,j≤n of μ defined by
Σij=∫xixjdμ(x)−(∫xidμ(x))(∫xjdμ(x))=∫(xi−mi)(xj−mj)dμ(x).
For all u∈Rn with ‖u‖=1, the linear function gu(x):=⟨x,u⟩=∑ni=1uixi satisfies
∫gudμ=⟨m,u⟩and∫g2udμ=⟨Σu,u⟩+⟨b,u⟩2and‖∇gu(x)‖2=‖u‖2=1,
hence by Poincaré inequality,
⟨Σu,u⟩=Varμ(gu)≤cP,
Introducing the operator norm ‖Σ‖op:=sup‖u‖=1‖Σu‖=sup‖u‖=1⟨Σu,u⟩, this gives
‖Σ‖op≤cP.
Equality is achieved when μ is Gaussian, as we can check by scaling with √Σ, tensorization, and expansion of L2(N(0,1)) in terms of Hermite orthogonal polynomials.
The Kannan-Lovász-Simonovits (KLS) conjecture states that the opposite bound
cP≤c‖Σ‖op
holds up to a universal constant c provided that μ is log-concave (meaning that V is convex), in other words the Poincaré constant of log-concave measures can be checked on linear test functions. The best bound at the time of writing, due to Klartag, and based on his discovery of an improved Lichnerowicz bound, is
cP≤c√log(n)‖Σ‖op.
Spectral gap of Markov diffusion operator. Suppose from now on that μ writes
dμ(x)=e−V(x)dx
with V∈C2(Rn→R). The associated Markov diffusion operator is
Lf:=Δf−⟨∇V,∇f⟩.
We have the integration by parts formula, for all f,g∈C∞c,
∫f(−L)gdμ=∫g(−L)fdμ=∫⟨∇f,∇g⟩dμ,
in particular, f=g and g→1 give
∫f(−L)fdμ=∫‖∇f‖2dμand∫Lfdμ=0.
In L2(μ) the unbounded operator −L is non-negative since L is Markov, and its kernel contains only the constant functions. We have
1cP=inf{∫‖∇f‖2dμVarμ(f):f not constant}=inf∫fdμ=0∫f2dμ=1∫f(−L)fdμ,
Moreover −L has discrete spectrum formed by eigenvalues 0=λ0<λ1<⋯ and λ:=λ1−λ0=λ1 is the spectral gap of −L. If f is an eigenfunction of −L with eigenvalue λ1, we can assume by scaling that ∫f2dμ=1, while −Lf=λf gives ∫fdμ=0, thus
1cP=λ.
Bochner formula. It is the commutator-curvature formula
L∇−∇L=∇2V∇.
By taking the inner product with ∇ and using the integration by parts, we get
∫⟨∇2V∇f,∇f⟩dμ=∫⟨L∇f,∇f⟩dμ−∫⟨∇f,∇Lf⟩dμ=−∫‖∇2f‖2HSdμ+∫(Lf)2dμ,
where ‖∇2f‖2HS=Tr((∇2f)2)=∑nij=1(∂2ijf)2 is the squared Hilbert-Schmidt norm of the Hessian matrix of f. In other words, we have obtained the mean Bochner formula
∫(Lf)2dμ=∫‖∇2f‖2HSdμ+∫⟨∇2V∇f,∇f⟩dμ.
In Bakry-Émery theory, this is also the integrated Γ2 formula ∫(Lf)2dμ=∫Γ2fdμ.
If μ is uniformly log-concave : for a constant ρ>0 and a convex C:Rn→R,
V(x)=12ρ‖x‖2+C(x),
then ∇2V≥ρId as quadratic forms, and if f is an eigenfunction of −L associated to the eigenvalue λ carrying the spectral gap of −L then we can assume by scaling that ∫f2dμ=1, and we get then from the mean Bochner formula and integration by parts
λ2=∫(Lf)2dμ≥∫⟨∇2V∇f,∇f⟩dμ≥ρ∫‖∇f‖2dμ=ρ∫f(−L)fdμ=λρ,
hence the inequality
1cP=λ≥ρ,
which is a log-concave analogue of the Lichnerowicz inequality. This method of proof is essentially the one of Lichnerowicz. Equality is achieved in the Gaussian case C≡0.
Klartag theorem. It reinforces the previous result by incorporating an information on the covariance matrix. If μ is uniformly log-concave of constant ρ>0 then
cP≤√‖Σ‖opρ≤1ρ.
Proof of Klartag theorem. Let f be an eigenfunction of −L associated to the eigenvalue λ=1/cP, namely −Lf=λf=1cPf. We can assume by scaling that ∫f2dμ=1. Moreover −Lf=λf gives ∫fdμ=−λ−1∫Lfdμ=0. The Klartag lemma below gives
∫⟨∇V2∇f,∇f⟩dμ≤λ3‖Σ‖op.
But since ∇2V≥ρId in the sense of quadratic forms, we get
∫⟨∇V2∇f,∇f⟩dμ≥ρ∫‖∇f‖2dμ=ρ∫f(−L)fdμ=ρλ,
hence the first inequality of the theorem
λ2‖Σ‖op≥ρ.
For the second inequality of the theorem, we use the fact that ‖Σ‖op≤cP=1λ≤1ρ.
Klartag lemma. If f is an eigenfunction of −L associated to the eigenvalue λ=1/cP in other words −Lf=λf=(1/cP)f and such that ∫f2dμ=1, then
1λ∫⟨∇2V∇f,∇f⟩dμ≤‖∫∇fdμ‖2=λ2‖∫xf(x)dμ(x)‖2≤λ2‖Σ‖op.
Proof of Klartag lemma. Let f be such that −Lf=λf with λ=1/cP, and ∫f2dμ=1. By using the integration by parts, the mean Bochner formula, and the Poincaré inequality for each ∂if, 1≤i≤n, we get
λ2=∫(Lf)2dμ=∫⟨∇2V∇f,∇f⟩dμ+∫‖∇2f‖2HSdμ≥∫⟨∇2V∇f,∇f⟩dμ+λ(∫‖∇f‖2dμ−‖∫∇fdμ‖2)≥∫⟨∇2V∇f,∇f⟩dμ+λ2−λ‖∫∇fdμ‖2.
On the other hand, for any u∈Rn such that ‖u‖=1, with gu(x):=⟨x,u⟩,
∫⟨∇f,u⟩dμ=∫⟨∇f,∇gu⟩dμ=−∫(Lf)gudμ=λ∫f(x)⟨x,u⟩dμ(x).
But from this identity, using ∫f2dμ=1, ∫fdμ=0, and the Cauchy-Schwarz inequality,
|∫⟨∇f,u⟩dμ|2=λ2|∫f(x)(⟨x,u⟩−⟨m,u⟩)dμ(x)|2≤λ2∫(⟨x,u⟩−⟨m,u⟩)2dμ(x)=λ2⟨Σu,u⟩.
Helffer-Sjöstrand, Brascamp-Lieb, Poincaré. The covariance of f and g is
Covμ(f,g):=∫(f−∫fdμ)(g−∫gdμ)dμ.
For a fixed f, let us seek for h depending on f such that for all g,
Covμ(f,g)=∫⟨∇h,∇g⟩dμ=−∫(Lh)gdμ.
Since ∫Lhdμ=0, we get
∫(f−∫fdμ+Lh)(g−∫gdμ)dμ=0,
in other words f−∫fdμ+Lh is orthogonal to centered functions, and is thus constant, also the following Poisson equation holds true:
f−∫fdμ=−Lh.
Let us try to express ∇h in terms of ∇f. By the Bochner formula
∇f=−∇Lh=−L∇h
where L∇h is the action of L on ∇h, coordinate by coordinate, in other words the operator L acting on differential forms just like the Laplacian in de Rham cohomology. Also h must be such that ∇h=(−L)−1∇f, which gives the Helffer-Sjöstrand formula
Covμ(f,g)=∫⟨(−L)−1(∇f),∇g⟩dμ.
Suppose from now on that μ is strictly log-concave in the sense that ∇2V>0 everywhere in the sense of quadratic forms. By the Bochner formula, as functional quadratic forms on differential forms we have
(−L)≥∇2V,hence(−L)−1≤(∇2V)−1.
Combined with the Helffer-Sjöstrand representation of the variance, we get
Varμ(f)≤∫⟨(∇2V)−1∇f,∇f⟩dμ.
This is the Brascamp-Lieb inequality, which can be proved by many ways. If μ is uniformly log-concave : ∇2V≥ρId as quadratic forms for some constant ρ>0, then we obtain a Poincaré inequality of constant 1/ρ :
Varμ(f)≤1ρ∫‖∇f‖2dμ=1ρ∫f(−L)fdμ.
In other words cP≤1ρ. Equality is achieved for instance in the gaussian case C≡0. For Bakry-Émery connoisseurs, the inequality (−L)≥ρId on differential forms (gradients) means ∫(Lf)2dμ≥ρ∫‖∇f‖2dμ, which is, thanks to the mean Bochner formula, nothing else but the inequality ∫Γ2(f)dμ≥ρ∫Γ(f)dμ known as the critère Γ2 intégré''.
Apparently, the Brascamp-Lieb inequality was already known by Lars Hörmander.
Bakry-Émery Γ2. The Bakry-Émery Γ2 is an abstraction of the Bochner commutation-curvature formula. Indeed, having in mind that L=Δ−⟨∇V,∇⟩, we find, for all f,φ,
L(φ(f))=φ′(f)Lf+φ″(f)‖∇f‖2,
in particular L(f2)=2fLf+2‖∇f‖2, which leads to the functional quadratic form
Γ(f,f):=12L(f2)−fLf=‖∇f‖2,
and equivalently or more generally,
Γ(f,g)=12L(fg)−fLg−gLf=⟨∇f,∇g⟩.
Similarly, we find, using also this time the Bochner commutation-curvature formula,
LΓ(f,f)=2Γ(f,Lf)+2(‖∇2f‖2HS+⟨∇2V∇f,∇f⟩),
which leads to define
Γ2(f,f):=12LΓ(f,f)−Γ(f,Lf)=‖∇2f‖2HS+⟨∇2V∇f,∇f⟩.
At this step, we already know that by averaging over dμ(x)=e−V(x)dx and using integration by parts, we recover two variants of the mean Bochner formula :
∫Γ2(f)dμ=∫(Lf)2dμ=∫fL2fdμ.
The Γ and Γ2 objects can be defined on manifolds, in which case a Ricci curvature term appears in Γ2, that suggests naturally to also interpret the Hessian ∇2V as a curvature. This makes perfectly sense having in mind the Lichnerowicz type comparisons.
Integrated Bakry-Émery Γ2 criterion. It is a characterization of the Poincaré inequality that reads as follows, for all ρ>0 :
∀f,Varμ(f)≤1ρ∫‖∇f‖2dμ⇕∀f,∫‖∇f‖2dμ≤1ρ∫(‖∇f2‖2HS+⟨∇2V∇f,∇f⟩)dμ,
in other words, by using the integration by parts and the mean Bochner formula,
∀f centered,∫f2dμ≤1ρ∫f(−L)fdμ⇕∀f,∫f(−L)fdμ≤1ρ∫(Lf)2dμ.
This equivalence is immediate when using an eigenfunction carrying the spectral gap λ and the equivalence of λ≥ρ with cP≤1/ρ. It appears also immediately if we reformulate in terms of functional quadratic forms on centered functions :
Id≤1ρ(−L)⇕(−L)≤1ρ(−L)2.
This reminds what we did above for the Helffer-Sjöstrand formula, indeed
∫(Lf)2dμ=∫⟨∇f,∇(−L)f⟩dμ.
Note that on centered functions (−L)−1=∫∞0etLdt, a link to semigroup interpolation.
Final words. The probabilistic and geometric functional analysis contains other comparisons to spheres, for instance the Myers diameter inequality and the Lévy-Gromov isoperimetric inequality. The analogue comparisons to Gaussians were extensively developed by Dominique Bakry and Michel Ledoux, around a curvature-dimension inequality
Γ2(f)≥ρΓ(f)+1n(Lf)2,
which abstracts the Bochner commutation-curvature formula while incorporating the dimension. For simplicity, this post is free of any semigroup or stochastic process.
By analogy, we could ask about a Klartag type improvement of the log-Sobolev inequality by incorporating the operator norm of the covariance matrix, something like
cP≤cLS2≤√‖Σ‖opρ≤ρ.
But it could be something more involved. Regarding this type of analogy, it is already known that there is no log-Sobolev analogue of the Brascamp-Lieb inequality.
We could ask about the relevance of the Poincaré inequality with respect to the spectral gap. Actually, in particular in statistical mechanics, the Poincaré inequality is a functional formulation that allows specific methods such as conditioning and the martingale method. It is also related to the quantification of the ergodic phenomenon, and to the geometric analysis related to isoperimetry and concentration of measure. It plays moreover an essential role in the family of Sobolev type inequalities. Depending on your culture or tastes, you may prefer this or that, but a truth is that many aspects are here, connected, waiting for enthusiasm, curiosity, and talent.
Further reading.
- Klartag, Boáz
Logarithmic bounds for isoperimetry and slices of convex sets
arXiv:2303.14938 (2023) - Gallot, Sylvestre
Minorations sur le λ1 des variétés riemanniennes
Séminaire N. Bourbaki (1981) - Ledoux, Michel
The geometry of Markov diffusion generators
Annales de la Faculté des sciences de Toulouse : Mathématiques (2000) - Chafaï, Djalil
Covariance de modèles d'interfaces et marches aléatoires en environnement aléatoire
Unpublished expository notes in French (2001)
