{"id":15112,"date":"2022-03-19T18:00:29","date_gmt":"2022-03-19T17:00:29","guid":{"rendered":"https:\/\/djalil.chafai.net\/blog\/?p=15112"},"modified":"2022-04-01T16:49:28","modified_gmt":"2022-04-01T14:49:28","slug":"few-bits-of-optimal-transportation","status":"publish","type":"post","link":"https:\/\/djalil.chafai.net\/blog\/2022\/03\/19\/few-bits-of-optimal-transportation\/","title":{"rendered":"Few bits of optimal transportation"},"content":{"rendered":"<figure id=\"attachment_15193\" aria-describedby=\"caption-attachment-15193\" style=\"width: 773px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/fr.wikipedia.org\/wiki\/Gaspard_Monge\"><img loading=\"lazy\" class=\"wp-image-15193 size-large\" src=\"http:\/\/djalil.chafai.net\/blog\/wp-content\/uploads\/2021\/08\/Beaune_-_Monument_de_Gaspard_Monge-773x1030.jpg\" alt=\"Statue de Gaspard Monge \u00e0 Beaune\" width=\"773\" height=\"1030\" srcset=\"https:\/\/djalil.chafai.net\/blog\/wp-content\/uploads\/2021\/08\/Beaune_-_Monument_de_Gaspard_Monge-773x1030.jpg 773w, https:\/\/djalil.chafai.net\/blog\/wp-content\/uploads\/2021\/08\/Beaune_-_Monument_de_Gaspard_Monge-225x300.jpg 225w, https:\/\/djalil.chafai.net\/blog\/wp-content\/uploads\/2021\/08\/Beaune_-_Monument_de_Gaspard_Monge-768x1024.jpg 768w, https:\/\/djalil.chafai.net\/blog\/wp-content\/uploads\/2021\/08\/Beaune_-_Monument_de_Gaspard_Monge-1152x1536.jpg 1152w, https:\/\/djalil.chafai.net\/blog\/wp-content\/uploads\/2021\/08\/Beaune_-_Monument_de_Gaspard_Monge-1536x2048.jpg 1536w, https:\/\/djalil.chafai.net\/blog\/wp-content\/uploads\/2021\/08\/Beaune_-_Monument_de_Gaspard_Monge-scaled.jpg 1920w\" sizes=\"(max-width: 773px) 100vw, 773px\" \/><\/a><figcaption id=\"caption-attachment-15193\" class=\"wp-caption-text\">Statue of Gaspard Monge (1746 - 1818), place Monge, Beaune, C\u00f4te d'Or, France.<\/figcaption><\/figure>\n<p style=\"text-align: justify;\">This post is about some aspects of transportation of measure. It is mostly inspired from the lecture notes of an advanced master course prepared few years ago in collaboration with my colleague <a href= \"https:\/\/djalil.chafai.net\/scripts\/search.php?q=Joseph+Lehec\">Joseph Lehec<\/a> in Universit\u00e9 Paris-Dauphine - PSL. The objective is to reach the Caffarelli contraction theorem, one of my favourite theorems.<\/p>\n<p style=\"text-align: justify;\"><b>Pushforward or image measure.<\/b> Let \\( {T :\\mathbb{R}^n \\rightarrow \\mathbb{R}^n} \\) and \\( {\\mu} \\) be a probability measure on \\( {\\mathbb{R}^n} \\). The <em>pushforward<\/em> of \\( {\\mu} \\) by \\( {T} \\) is the measure \\( {\\nu} \\) given, for every Borel set \\( {A\\subset\\mathbb{R}^n} \\), by<\/p>\n<p style=\"text-align: center;\">\\[ \\nu ( A ) = \\mu ( T^{-1} ( A ) ). \\]<\/p>\n<p style=\"text-align: justify;\">In other words \\( {T(X)\\sim\\nu} \\) when \\( {X\\sim\\mu} \\), and thus for all test function \\( {h} \\),<\/p>\n<p style=\"text-align: center;\">\\[ \\int_{\\mathbb{R}^n} h \\mathrm{d}\\nu = \\int_{\\mathbb{R}^n} h \\circ T \\mathrm{d}\\mu. \\]<\/p>\n<p style=\"text-align: justify;\"><b>The Brenier theorem.<\/b> It states that if \\( {\\mu} \\) and \\( {\\nu} \\) are two probability measures on \\( {\\mathbb{R}^n} \\) with \\( {\\mu} \\) absolutely continuous with respect to the Lebesgue measure then there exists a unique map \\( {T:\\mathbb{R}^n\\rightarrow\\mathbb{R}^n} \\) pushing forward \\( {\\mu} \\) to \\( {\\nu} \\) and \\( {T=\\nabla\\phi} \\) with \\( {\\phi} \\) convex.<\/p>\n<p style=\"text-align: justify;\">The uniqueness of the map \\( {T} \\) must be understood almost everywhere.<\/p>\n<p style=\"text-align: justify;\">The convex function \\( {\\phi} \\) is obviously not unique but its gradient is unique.<\/p>\n<p style=\"text-align: justify;\">When \\( {n=1} \\) then \\( {T=F^{-1}\\circ G} \\) where \\( {F=\\mu((-\\infty,\\bullet])} \\) and \\( {G=\\nu((-\\infty,\\bullet])} \\) are the cumulative distribution functions of \\( {\\mu} \\) and \\( {\\nu} \\). The Brenier theorem states that in arbitrary dimension, it is still possible to pushforward using a multivariate analogue of the notion of non-decreasing function: the gradient of a convex function.<\/p>\n<p style=\"text-align: justify;\"><b>Relation to Wasserstein-Kantorovich coupling distance.<\/b> If \\( {\\mu} \\) and \\( {\\nu} \\) have finite second moment and if \\( {T=\\nabla\\phi} \\) is the Brenier map pushing forward \\( {\\mu} \\) to \\( {\\nu} \\) then<\/p>\n<p style=\"text-align: center;\">\\[ W_2(\\mu,\\nu)^2 =\\min_{\\pi\\in\\Pi(\\mu,\\nu)}\\int\\frac{|x-y|^2}{2}\\pi(\\mathrm{d}x,\\mathrm{d}y) =\\int\\frac{|x-T(x)|^2}{2}\\mathrm{d}\\mu(\\mathrm{d}x). \\]<\/p>\n<p style=\"text-align: justify;\">In other words the optimal coupling is deterministic: \\( {\\pi(\\mathrm{d}x,\\mathrm{d}y)=\\mu(\\mathrm{d}x)\\delta_{T(x)}(\\mathrm{d}y)} \\).<br \/> The transport map \\( {T=\\nabla\\phi} \\) realizes an <em>optimal transport<\/em> of \\( {\\mu} \\) to \\( {\\nu} \\).<br \/> A key here is the Kantorovich-Rubinstein dual formulation of \\( {W_2} \\):<\/p>\n<p style=\"text-align: center;\">\\[ W_2(\\mu,\\nu)^2 =\\sup_{f,g}\\int f\\mathrm{d}\\mu-\\int g\\mathrm{d}\\nu \\]<\/p>\n<p style=\"text-align: justify;\">where the infimum runs over the set of bounded and Lipschitz \\( {f,g:\\mathbb{R}^n\\rightarrow\\mathbb{R}} \\) such that \\( {f(x)\\leq g(y)+\\frac{|x-y|^2}{2}} \\). We can also take the inf-convolution \\( {f(x)=\\inf_{y\\in\\mathbb{R}^n}(g(x)+\\frac{|x-y|^2}{2})} \\).<\/p>\n<p style=\"text-align: justify;\"><b>Reverse Brenier map, Legendre transform, convex duality.<\/b> If \\( {\\nu} \\) is absolutely continuous with respect to the Lebesgue measure then \\( {\\nabla \\phi} \\) is invertible and \\( {(\\nabla \\phi)^{-1}=\\nabla \\phi^*} \\) is the Brenier map between \\( {\\nu} \\) and \\( {\\mu} \\), where<\/p>\n<p style=\"text-align: center;\">\\[ \\phi^* (y) = \\sup_x \\left\\{ \\langle x , y \\rangle - \\phi (x) \\right\\}. \\]<\/p>\n<p style=\"text-align: justify;\">is the Legendre transform of \\( {\\phi} \\) (it is the also the gradient of a convex function).<\/p>\n<p style=\"text-align: justify;\"><b>Regularity of Brenier map.<\/b> The Brenier map is not always continuous. For example if \\( {\\mu} \\) is uniform on \\( {[0,1]} \\) and \\( {\\nu} \\) is uniform on \\( {[0,1\/2] \\cup [3\/2 , 2]} \\) then the Brenier map must be the identity on \\( {[0,1\/2[} \\) and identity plus \\( {1} \\) on \\( {]1\/2,1]} \\).<\/p>\n<p style=\"text-align: justify;\">A correct hypothesis for the regularity of the Brenier map is convexity of the support of the target measure. Indeed, <a href= \"https:\/\/en.wikipedia.org\/wiki\/Luis_Caffarelli\">Luis Caffarelli<\/a> has proved that if \\( {\\mu} \\) and \\( {\\nu} \\) are absolutely continuous, and if their supports \\( {K} \\) and \\( {L} \\) are convex, and if their densities \\( {f,g} \\) are bounded away from \\( {0} \\) and \\( {+\\infty} \\) on \\( {K} \\) and \\( {L} \\) respectively, then the Brenier map \\( {\\nabla \\phi} \\) is an homeomorphism between the interior of \\( {K} \\) and that of \\( {L} \\). Moreover if \\( {f} \\) and \\( {g} \\) are continuous then \\( {\\nabla \\phi} \\) is a \\( {\\mathcal C^1} \\) diffeomorphism.<\/p>\n<p style=\"text-align: justify;\">The regularity theory of transportation of measure is a delicate subject that was explored in the recent years by a bunch of mathematicians including <a href= \"https:\/\/en.wikipedia.org\/wiki\/Alessio_Figalli\">Alessio Figalli<\/a>.<\/p>\n<p style=\"text-align: justify;\"><b>Monge-Amp\u00e8re equation.<\/b> When \\( {\\nabla \\phi} \\) is a \\( {\\mathcal C^1} \\) diffeomorphism, the change of variable formula \\( {y=\\phi(x)} \\) gives, for all test function \\( {h} \\), since \\( {\\mathrm{Jac}\\nabla\\phi=\\nabla^2\\phi} \\) (Hessian),<\/p>\n<p style=\"text-align: center;\">\\[ \\int_L h(y) g(y) \\mathrm{d} y = \\int_{K} h \\left( \\nabla \\phi (x) \\right) g \\left( \\nabla \\phi (x) \\right) \\mathrm{det}( \\nabla^2\\phi (x) ) \\mathrm{d} x . \\]<\/p>\n<p style=\"text-align: justify;\">On the other hand, by definition of the Brenier map<\/p>\n<p style=\"text-align: center;\">\\[ \\int_L h(y) g(y)\\mathrm{d} y = \\int_{\\mathbb{R}^n} h \\mathrm{d}\\nu = \\int_{\\mathbb{R}^n} h \\circ \\nabla \\phi \\mathrm{d}\\mu = \\int_K h \\left( \\nabla \\phi (x) \\right) f(x)\\mathrm{d} x . \\]<\/p>\n<p style=\"text-align: justify;\">Since this is valid for every test function \\( {h} \\) we obtain the following equality <a id=\"eqmongeampere\" id= \"eqmongeampere\"><\/a><\/p>\n<p style=\"text-align: center;\">\\[ g \\left( \\nabla \\phi (x) \\right) \\, \\mathrm{det}( \\nabla^2\\phi (x) ) = f(x) , \\ \\ \\ \\ \\ (1) \\]<\/p>\n<p style=\"text-align: justify;\">for every \\( {x} \\) in the interior of \\( {K} \\). This is called <b>Monge-Amp\u00e8re equation<\/b>. This is an important basic nonlinear equation in mathematics and physics.<\/p>\n<p style=\"text-align: justify;\"><b>From Monge-Amp\u00e8re to Poisson-Langevin.<\/b> When \\( {\\phi(x)=\\frac{1}{2}|x|^2} \\) the Monge-Amp\u00e8re simply reads \\( {g(x)=f(x)} \\). Let us consider a perturbation or linearization around this case by taking \\( {\\phi(x)=\\frac{1}{2}|x|^2+\\varepsilon\\psi(x)+O(\\varepsilon^2)} \\) and \\( {g(x)=(1+\\varepsilon h(x)+O(\\varepsilon^2))f(x)} \\), then, as \\( {\\varepsilon\\rightarrow0} \\), we find the Poisson equation for the Langevin operator:<\/p>\n<p style=\"text-align: center;\">\\[ \\left(-\\Delta-\\frac{\\nabla f}{f}\\cdot\\nabla\\right)\\psi=h. \\]<\/p>\n<p style=\"text-align: justify;\">In other words, this reads \\( {-(\\Delta-\\nabla V\\cdot\\nabla)\\psi=h} \\) if we write \\( {f=\\mathrm{e}^{-V}} \\). In the same spirit, the Wasserstein-Kantorovich distance can be interpreted as an inverse Sobolev norm.<\/p>\n<p style=\"text-align: justify;\"><b>The Caffarelli contraction theorem.<\/b> If \\( {\\mu=\\mathrm{e}^{-V}\\mathrm{d}x} \\) and \\( {\\nu=\\mathrm{e}^{-W}\\mathrm{d}x} \\) are two probability measures on \\( {\\mathbb{R}^n} \\) such that \\( {\\frac{\\alpha}{2}\\left|\\cdot\\right|^2-V} \\) and \\( {W-\\frac{\\beta}{2}\\left|\\cdot\\right|^2} \\) are convex for some constants \\( {\\alpha,\\beta&gt;0} \\), then the Brenier map \\( {T=\\nabla \\phi} \\) pushing forward \\( {\\mu} \\) to \\( {\\nu} \\) satisfies \\( {\\left\\Vert T\\right\\Vert_{\\mathrm{Lip}}\\leq\\sqrt{\\alpha \/ \\beta}} \\).<\/p>\n<p style=\"text-align: justify;\">By taking \\( {V=\\frac{\\alpha}{2}\\left|\\cdot\\right|^2} \\) we obtain that a probability measure which is log-concave with respect to a non trivial Gaussian is a Lipschitz deformation of this Gaussian!<\/p>\n<p style=\"text-align: justify;\"><b>Idea of proof.<\/b> We begin with \\( {n=1} \\). Taking the logarithm in the Monge-Amp\u00e8re equation gives \\( {\\frac{1}{2}\\log(\\varphi''^2)=\\log|\\varphi''|=-V+W(\\varphi')} \\), and taking the derivative twice gives<\/p>\n<p style=\"text-align: center;\">\\[ \\frac{\\varphi''''\\varphi''-\\varphi'''^2}{\\varphi''^2}=-V''+W''(\\varphi')\\varphi''^2+W'(\\varphi')\\varphi'''. \\]<\/p>\n<p style=\"text-align: justify;\">Now if \\( {\\varphi''} \\) has a maximum at \\( {x=x_*} \\) then \\( {\\varphi'''(x_*)=0} \\) and \\( {\\varphi''''(x_*)\\leq0} \\), and thus<\/p>\n<p style=\"text-align: center;\">\\[ 0\\geq-V''(x_*)+W''(\\varphi'(x_*))\\varphi''^2(x_*) \\quad\\text{hence}\\quad \\varphi''^2(x_*)\\leq\\alpha\/\\beta. \\]<\/p>\n<p style=\"text-align: justify;\">This maximum principle argument is attractive but a maximum at the boundary may produce difficulties. Let us follow now the same idea in the case \\( {n\\geq1} \\). Observe first that the Lipschitz constant of \\( {\\nabla \\phi} \\) is the supremum of the operator norm of \\( {\\nabla^2\\phi} \\). So it is enough to prove \\( {\\Vert \\nabla^2\\phi (x) \\Vert_{\\mathrm{op}} \\leq \\sqrt{ \\alpha \/ \\beta }} \\) for every \\( {x} \\). Besides since \\( {\\phi} \\) is convex \\( {\\nabla^2\\phi} \\) is a positive matrix so this amounts to proving that \\( {\\langle \\nabla^2\\phi (x) u, u \\rangle \\leq\\sqrt{\\alpha\/\\beta}} \\) for every unit vector \\( {u} \\) and every \\( {x\\in \\mathbb{R}^n} \\). Now we fix a direction \\( {u} \\) and we assume that the map<\/p>\n<p style=\"text-align: center;\">\\[ \\ell \\colon x\\mapsto \\langle \\nabla^2\\phi (x) u , u \\rangle \\]<\/p>\n<p style=\"text-align: justify;\">attains its maximum for \\( {x=x_*} \\). The logarithm of the Monge-Amp\u00e8re equation gives<\/p>\n<p style=\"text-align: center;\">\\[ \\log \\mathrm{det} \\left( \\nabla^2\\phi (x) \\right) = - V (x) + W \\left( \\nabla \\phi (x) \\right). \\]<\/p>\n<p style=\"text-align: justify;\">Now we differentiate this equation twice in the direction \\( {u} \\). To differentiate the left hand side, observe that if \\( {A} \\) is an invertible matrix<\/p>\n<p style=\"text-align: center;\">\\[ \\begin{array}{rcl} \\log \\mathrm{det} ( A + H ) & =& \\log \\mathrm{det} ( A ) + \\mathrm{tr} ( A^{-1} H ) + o (H)\\\\ (A+H)^{-1} & =& A^{-1} - A^{-1} H A^{-1} + o ( H ). \\end{array} \\]<\/p>\n<p style=\"text-align: justify;\">We obtain (omitting variables) \\begin{multline*} -\\mathrm{tr} \\left( (\\nabla^2\\phi)^{-1} (\\partial_u \\nabla^2\\phi) (\\nabla^2\\phi)^{-1} (\\partial_u \\nabla^2\\phi) \\right) + \\mathrm{tr} \\left( (\\nabla^2\\phi)^{-1} \\partial_{uu} \\nabla^2\\phi \\right)<br \/> = - \\partial_{uu} V + \\sum_i \\partial_i W \\partial_{iuu} \\phi + \\sum_{ij} \\partial_{ij} W (\\partial_{iu} \\phi ) ( \\partial_{ju} \\phi ) . \\end{multline*} We shall use this equation at \\( {x_*} \\). We claim that<\/p>\n<p style=\"text-align: center;\">\\[ \\mathrm{tr} \\left( (\\nabla^2\\phi)^{-1} (\\partial_u \\nabla^2\\phi) (\\nabla^2\\phi)^{-1} (\\partial_u \\nabla^2\\phi) \\right) \\geq 0 . \\]<\/p>\n<p style=\"text-align: justify;\">Indeed, \\( {\\nabla^2\\phi\\geq0} \\) so \\( {(\\nabla^2\\phi)^{-1}\\geq0} \\) and since \\( {\\partial_u \\nabla^2\\phi} \\) is symmetric, we get<\/p>\n<p style=\"text-align: center;\">\\[ (\\partial_u \\nabla^2\\phi) (\\nabla^2\\phi)^{-1} (\\partial_u \\nabla^2\\phi) \\geq 0 . \\]<\/p>\n<p style=\"text-align: justify;\">Now it remains to recall that the product of two positive matrices has positive trace, namely if \\( {A} \\) and \\( {B} \\) are \\( {n\\times n} \\) real symmetric positive semidefinite then<\/p>\n<p style=\"text-align: center;\">\\[ \\mathrm{Tr}(AB)=\\mathrm{Tr}(\\sqrt{A}\\sqrt{B}(\\sqrt{A}\\sqrt{B})^\\top)\\geq0. \\]<\/p>\n<p style=\"text-align: justify;\">Since function \\( {\\ell} \\) attains its maximum at \\( {x_*} \\) we have \\( {\\nabla^2\\ell (x_*)\\leq 0} \\). Therefore<\/p>\n<p style=\"text-align: center;\">\\[ \\mathrm{tr} \\left( (\\nabla^2\\phi)^{-1} \\partial_{uu} \\nabla^2\\phi \\right) = \\mathrm{tr} \\left( (\\nabla^2\\phi)^{-1} \\nabla^2\\ell \\right) \\leq 0 . \\]<\/p>\n<p style=\"text-align: justify;\">In the same way<\/p>\n<p style=\"text-align: center;\">\\[ \\sum_i \\partial_i W \\partial_{iuu} \\phi = \\langle \\nabla W , \\nabla \\ell \\rangle = 0 . \\]<\/p>\n<p style=\"text-align: justify;\">So at point \\( {x_*} \\) the main identity above gives<\/p>\n<p style=\"text-align: center;\">\\[ \\sum_{ij} \\partial_{ij} W (\\partial_{iu} \\phi ) ( \\partial_{ju} \\phi ) \\leq \\partial_{uu} V . \\]<\/p>\n<p style=\"text-align: justify;\">Now the hypothesis made on \\( {V} \\) and \\( {W} \\) give \\( {\\partial_{uu} V \\leq \\alpha} \\) and<\/p>\n<p style=\"text-align: center;\">\\[ \\sum_{ij} \\partial_{ij} W (\\partial_{iu} \\phi ) ( \\partial_{ju} \\phi ) \\geq \\beta \\sum_{i} (\\partial_{iu} \\phi )^2 = \\beta \\vert \\nabla^2\\phi (u) \\vert^2 . \\]<\/p>\n<p style=\"text-align: justify;\">Since \\( {u} \\) has norm \\( {1} \\), we get<\/p>\n<p style=\"text-align: center;\">\\[ \\ell ( x_* ) = \\langle \\nabla^2\\phi (x_*) u , u \\rangle \\leq \\vert \\nabla^2\\phi (x_*) (u) \\vert \\leq \\sqrt{ \\frac \\alpha \\beta } . \\]<\/p>\n<p style=\"text-align: justify;\">Therefore \\( {\\ell ( x ) \\leq \\sqrt{ \\alpha \/ \\beta }} \\) for every \\( {x} \\) which is the desired inequality.<\/p>\n<p style=\"text-align: justify;\"><b>Application to functional inequalities.<\/b> The Poincar\u00e9 inequality for the standard Gaussian measure \\( {\\gamma_n=\\mathcal{N}(0,I_n)=(2\\pi)^{-\\frac{n}{2}}\\mathrm{e}^{-\\frac{|x|^2}{2}}\\mathrm{d}x} \\) on \\( {\\mathbb{R}^n} \\) states that for an arbitrary say \\( {\\mathcal{C}^1} \\) and compactly supported test function \\( {f:\\mathbb{R}^n\\rightarrow\\mathbb{R}} \\),<\/p>\n<p style=\"text-align: center;\">\\[ \\int f^2\\mathrm{d}\\gamma_n-\\left(\\int f\\mathrm{d}\\gamma_n\\right)^2 \\leq\\int|\\nabla f|^2\\mathrm{d}\\gamma_n. \\]<\/p>\n<p style=\"text-align: justify;\">Let \\( {\\mu} \\) be a probability measure on \\( {\\mathbb{R}^n} \\), image of \\( {\\gamma_n} \\) by a \\( {\\mathcal{C}^1} \\) map \\( {T:\\mathbb{R}^n\\rightarrow\\mathbb{R}^n} \\). The Poincar\u00e9 inequality above with \\( {f=g\\circ T} \\) for an arbitrary \\( {g:\\mathbb{R}^n\\rightarrow\\mathbb{R}} \\) gives<\/p>\n<p style=\"text-align: center;\">\\[ \\int g^2\\mathrm{d}\\mu-\\left(\\int g\\mathrm{d}\\mu\\right)^2 \\leq\\left\\Vert T\\right\\Vert_{\\mathrm{Lip}}^2\\int|\\nabla g|^2\\mathrm{d}\\mu. \\]<\/p>\n<p style=\"text-align: justify;\">This is a Poincar\u00e9 inequality for \\( {\\mu} \\), provided that \\( {T} \\) is Lipschitz.<\/p>\n<p style=\"text-align: justify;\">The Caffarelli contraction theorem states that if \\( {\\mu=\\mathrm{e}^{-V}\\mathrm{d}x} \\) with \\( {V-\\frac{\\rho}{2}\\left|\\cdot\\right|^2} \\) convex for some constant \\( {\\rho&gt;0} \\) then the map \\( {T} \\) pushing forward \\( {\\gamma_n} \\) to \\( {\\mu} \\) satisfies \\( {\\left\\Vert T\\right\\Vert_{\\mathrm{Lip}}^2\\leq1\/\\rho} \\), which implies by the argument above that \\( {\\mu} \\) satisfies a Poincar\u00e9 inequality of constant \\( {1\/\\rho} \\). The same argument works for other Sobolev type functional inequalities satisfied by the Gaussian measure, such as the logarithmic Sobolev inequality and the Bobkov isoperimetric functional inequalities. This transportation argument is a striking alternative to the Bakry-\u00c9mery curvature criterion in order to establish functional inequalities, but it does not prove the Gaussian case and does not have the extensibility of the latter to manifolds and abstract Markovian settings.<\/p>\n<p style=\"text-align: justify;\"><b>From Monge-Amp\u00e8re to Gaussian log-Sobolev.<\/b> Let us give a proof of the optimal logarithmic Sobolev inequality for the standard Gaussian measure \\( {\\gamma_n} \\) by using directly the Monge-Amp\u00e8re equation. Let \\( {f:\\mathbb{R}^n\\rightarrow\\mathbb{R}_+} \\) be such that \\( {\\int f\\mathrm{d}\\gamma_n=1} \\). Let \\( {T=\\nabla\\phi} \\) be the Brenier map pushing forward \\( {f\\mathrm{d}\\gamma_n} \\) to \\( {\\gamma_n} \\). We set \\( {\\theta(x):=\\phi(x)-\\frac{1}{2}|x|^2} \\) so that \\( {\\nabla\\phi(x)=x+\\nabla\\theta(x)} \\). We have \\( {\\mathrm{Hess}(\\theta)(x)+I_n\\geq0} \\), and Monge-Amp\u00e8re gives<\/p>\n<p style=\"text-align: center;\">\\[ f(x)\\mathrm{e}^{-\\frac{|x|^2}{2}} =\\det(I_n+\\mathrm{Hess}(\\theta)(x))\\mathrm{e}^{-\\frac{|x+\\nabla\\theta(x)|^2}{2}}. \\]<\/p>\n<p style=\"text-align: justify;\">Taking the logarithm gives<\/p>\n<p style=\"text-align: center;\">\\[ \\begin{array}{rcl} \\log f(x) &=&-\\frac{|x+\\nabla\\theta(x)|^2}{2}+\\frac{|x|^2}{2}+\\log\\det(I_n+\\mathrm{Hess}(\\theta)(x))\\\\ &=&-x\\cdot\\nabla\\theta(x)-\\frac{|\\nabla\\theta(x)|^2}{2}+\\log\\det(I_n+\\mathrm{Hess}(\\theta)(x))\\\\ &\\leq&-x\\cdot\\nabla\\theta(x)-\\frac{|\\nabla\\theta(x)|^2}{2}+\\Delta\\theta(x), \\end{array} \\]<\/p>\n<p style=\"text-align: justify;\">where we have used \\( {\\log(1+t)\\leq t} \\) for \\( {1+t&gt;0} \\) and the eigenvalues of the positive symmetric matrix \\( {I_n+\\mathrm{Hess}(\\theta)(x)} \\). Now integration with respect to \\( {f\\mathrm{d}\\gamma_n} \\) gives<\/p>\n<p style=\"text-align: center;\">\\[ \\int f\\log f\\mathrm{d}\\gamma_n \\leq \\int f(\\Delta\\theta-x\\cdot\\nabla\\theta)\\mathrm{d}\\gamma_n -\\int\\frac{|\\nabla\\theta|^2}{2}f\\mathrm{d}\\gamma_n. \\]<\/p>\n<p style=\"text-align: justify;\">Finally, using integration by parts (\\( {\\Delta\\theta-x\\cdot\\nabla\\theta} \\) is O.-U.!), we get<\/p>\n<p style=\"text-align: center;\">\\[ \\begin{array}{rcl} \\int f\\log f\\mathrm{d}\\gamma_n &\\leq&-\\int\\frac{1}{2}\\Bigr|\\sqrt{f}\\nabla\\theta+\\frac{\\nabla f}{\\sqrt{f}}\\Bigr|^2\\mathrm{d}\\gamma_n +\\frac{1}{2}\\int\\frac{|\\nabla f|^2}{f}\\mathrm{d}\\gamma_n\\\\ &\\leq&\\frac{1}{2}\\int\\frac{|\\nabla f|^2}{f}\\mathrm{d}\\gamma_n. \\end{array} \\]<\/p>\n<p style=\"text-align: justify;\">Recall that \\( {T=\\nabla\\phi=x+\\nabla\\theta} \\) pushes forward \\( {\\nu} \\) to \\( {\\gamma_n} \\), where \\( {\\mathrm{d}\\nu=f\\mathrm{d}\\gamma_n} \\). Therefore<\/p>\n<p style=\"text-align: center;\">\\[ \\int\\frac{|\\nabla\\theta|^2}{2}f\\mathrm{d}\\gamma_n =\\int\\frac{|x-T(x)|^2}{2}\\mathrm{d}\\nu =W_2^2(\\nu,\\gamma_n). \\]<\/p>\n<p style=\"text-align: justify;\">Beyond the log-Sobolev inequality for the Gaussian measure, it is possible to obtain by this way, from the Monge-Amp\u00e8re equation, HWI (H,W,I for entropy, Wasserstein, and Fisher information) functional inequalities for strongly log-concave measures. From this point of view, optimal transportation provides a partial alternative to the Bakry-\u00c9mery criterion on \\( {\\mathbb{R}^n} \\).<\/p>\n<p style=\"text-align: justify;\"><b>Further reading<\/b><\/p>\n<ul>\n<li><a href=\"https:\/\/fr.wikipedia.org\/wiki\/Yann_Brenier\">Yann Brenier<\/a><br \/> <a href=\"https:\/\/zbmath.org\/?q=an%3A0738.46011\">Polar factorization and monotone rearrangement of vector-valued functions<\/a><br \/> Communications on Pure and Applied Mathematics 44, No. 4, 375-417 (1991)<\/li>\n<li><a href= \"https:\/\/en.wikipedia.org\/wiki\/Robert_McCann_(mathematician)\">Robert J. McCann<\/a><br \/> <a href=\"https:\/\/zbmath.org\/?q=an%3A0873.28009\">Existence and uniqueness of monotone measure-preserving maps<\/a><br \/> Duke Mathematical Journal 80, No. 2, 309-323 (1995)<\/li>\n<li><a href=\"https:\/\/en.wikipedia.org\/wiki\/Luis_Caffarelli\">Luis Caffarelli<\/a><br \/> <a href=\"https:\/\/zbmath.org\/?q=an%3A0978.60107\">Monotonicity properties of optimal transportation and the FKG and related inequalities<\/a><br \/> Communications in Mathematics Physics 214, No. 3, 547-563 (2000)<\/li>\n<li><a href= \"https:\/\/en.wikipedia.org\/wiki\/C%C3%A9dric_Villani\">C\u00e9dric Villani<\/a><br \/> <a href=\"https:\/\/zbmath.org\/?q=an%3A1106.90001\">Topics in optimal transportation<\/a><br \/> Graduate Studies in Mathematics 58. American Mathematical Society, xvi, 370 p. (2003)<\/li>\n<li><a href= \"https:\/\/en.wikipedia.org\/wiki\/Dominique_Bakry\">Dominique Bakry<\/a>, <a href= \"https:\/\/genealogy.math.ndsu.nodak.edu\/id.php?id=55943\">Ivan Gentil<\/a>, and <a href= \"https:\/\/en.wikipedia.org\/wiki\/Michel_Ledoux\">Michel Ledoux<\/a><br \/> <a href=\"https:\/\/zbmath.org\/?q=an%3A1376.60002\">Analysis and geometry of Markov diffusion operators<\/a><br \/> Grundlehren der Mathematischen Wissenschaften 348 Springer. xx, 552 p. (2014)<\/li>\n<li><a href= \"https:\/\/www.genealogy.math.ndsu.nodak.edu\/id.php?id=126939\">Dario Cordero-Erausquin<\/a><br \/> <a href=\"https:\/\/zbmath.org\/?q=an%3A01743158\">Some applications of mass transport to Gaussian-type inequalities<\/a><br \/> Arch. Ration. Mech. Anal. 161, No. 3, 257-269 (2002)<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>This post is about some aspects of transportation of measure. It is mostly inspired from the lecture notes of an advanced master course prepared few&#8230;<\/p>\n<div class=\"more-link-wrapper\"><a class=\"more-link\" href=\"https:\/\/djalil.chafai.net\/blog\/2022\/03\/19\/few-bits-of-optimal-transportation\/\">Continue reading<span class=\"screen-reader-text\">Few bits of optimal transportation<\/span><\/a><\/div>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"iawp_total_views":1028},"categories":[1],"tags":[],"_links":{"self":[{"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/posts\/15112"}],"collection":[{"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/comments?post=15112"}],"version-history":[{"count":87,"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/posts\/15112\/revisions"}],"predecessor-version":[{"id":15956,"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/posts\/15112\/revisions\/15956"}],"wp:attachment":[{"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/media?parent=15112"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/categories?post=15112"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/tags?post=15112"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}