{"id":5245,"date":"2012-11-21T19:42:19","date_gmt":"2012-11-21T18:42:19","guid":{"rendered":"http:\/\/djalil.chafai.net\/blog\/?p=5245"},"modified":"2023-02-12T16:03:16","modified_gmt":"2023-02-12T15:03:16","slug":"about-wigner-theorem","status":"publish","type":"post","link":"https:\/\/djalil.chafai.net\/blog\/2012\/11\/21\/about-wigner-theorem\/","title":{"rendered":"About the Wigner theorem"},"content":{"rendered":"<p style=\"text-align: center;\"><a href=\"http:\/\/djalil.chafai.net\/blog\/wp-content\/uploads\/2011\/05\/matrix.jpg\" rel=\"attachment wp-att-1822\"><img loading=\"lazy\" class=\"aligncenter wp-image-1822\" title=\"Matrix\" src=\"\/blog\/wp-content\/uploads\/2011\/05\/matrix.jpg\" alt=\"Matrix\" width=\"370\" height=\"285\" srcset=\"https:\/\/djalil.chafai.net\/blog\/wp-content\/uploads\/2011\/05\/matrix.jpg 463w, https:\/\/djalil.chafai.net\/blog\/wp-content\/uploads\/2011\/05\/matrix-300x230.jpg 300w\" sizes=\"(max-width: 370px) 100vw, 370px\" \/><\/a><\/p>\n<p style=\"text-align: justify;\">Recall that a random \\( {n\\times n} \\) Hermitian matrix \\( {M} \\) belongs to the Gaussian Unitary Ensemble (GUE) when it admits a density proportional to<\/p>\n<p style=\"text-align: center;\">\\[ M\\mapsto \\exp\\left(-\\frac{n}{2}\\mathrm{Tr}(M^2)\\right). \\]<\/p>\n<p style=\"text-align: justify;\">If \\( {\\lambda_1,\\ldots,\\lambda_n} \\) are the eigenvalues of \\( {M} \\) then the Wigner theorem gives<\/p>\n<p style=\"text-align: center;\">\\[ \\mathbb{E}\\left(\\frac{\\mathrm{Card}\\{1\\leq i\\leq n:\\lambda_i\\in I\\}}{n}\\right) \\underset{n\\rightarrow\\infty}{\\longrightarrow} \\mu_{\\mathrm{SC}}(I) \\]<\/p>\n<p style=\"text-align: justify;\">for every interval \\( {I\\subset\\mathbb{R}} \\), where \\( {\\mu_{\\mathrm{SC}}} \\) is the semicircle distribution with density \\( {\\frac{1}{2\\pi}\\sqrt{4-x^2}\\mathbf{1}_{[-2,2]}(x)} \\). In other words, the averaged empirical spectral distribution<\/p>\n<p style=\"text-align: center;\">\\[ \\mu_n :=\\mathbb{E}\\left(\\frac{1}{n}\\sum_{i=1}^n\\delta_{\\lambda_i}\\right) \\]<\/p>\n<p style=\"text-align: justify;\">tends weakly to \\( {\\mu_{\\mathrm{SC}}} \\) as \\( {n\\rightarrow\\infty} \\). There are many proofs of this result. We present in this post a simple analytic proof due to Pastur.<\/p>\n<p style=\"text-align: justify;\"><b>Cauchy-Stieltjes transform.<\/b> Set \\( {\\mathbb{C}_+=\\{z\\in\\mathbb{C}:\\mathfrak{I}z&gt;0\\}} \\). The Cauchy-Stieltjes transform \\( {s_\\mu:\\mathbb{C}_+\\rightarrow\\mathbb{C}} \\) of a probability measure \\( {\\mu} \\) on \\( {\\mathbb{R}} \\) is given by<\/p>\n<p style=\"text-align: center;\">\\[ s_\\mu(z):=\\int\\!\\frac{1}{\\lambda-z}\\,d\\mu(\\lambda). \\]<\/p>\n<p style=\"text-align: justify;\">The knowledge of \\( {s_\\mu} \\) on \\( {\\mathbb{C}_+} \\) (and even on an small ball of \\( {\\mathbb{C}_+} \\) by analyticity) is enough to characterize \\( {\\mu} \\). We always have the universal bound for any \\( {z\\in\\mathbb{C}_+} \\):<\/p>\n<p style=\"text-align: center;\">\\[ |s_\\mu(z)|\\leq\\frac{1}{\\mathfrak{I}z}. \\]<\/p>\n<p style=\"text-align: justify;\">Furthermore, if \\( {{(\\mu_n)}} \\) is a sequence of probability measures on \\( {\\mathbb{R}} \\) such that \\( {s_{\\mu_n}(z)\\rightarrow s(z)} \\) as \\( {n\\rightarrow\\infty} \\) for every \\( {z\\in\\mathbb{C}_+} \\) then there exists a unique probability measure \\( {\\mu} \\) on \\( {\\mathbb{R}} \\) such that \\( {s_\\mu=s} \\) and such that \\( {\\mu_n\\rightarrow\\mu} \\) weakly as \\( {n\\rightarrow\\infty} \\).<\/p>\n<p style=\"text-align: justify;\">Our problem reduces to show that \\( {s_{\\mu_n}(z)\\rightarrow s_{\\mu_{\\mathrm{SC}}}(z)} \\) for every \\( {z\\in\\mathbb{C}_+} \\).<\/p>\n<p style=\"text-align: justify;\"><b>Resolvent.<\/b> The resolvent \\( {G} \\) of a Hermitian matrix \\( {H} \\) at point \\( {z\\in\\mathbb{C}_+} \\) is<\/p>\n<p style=\"text-align: center;\">\\[ G(z)=(H-zI)^{-1}. \\]<\/p>\n<p style=\"text-align: justify;\">The map \\( {z\\mapsto G(z)} \\) is holomorphic on \\( {\\mathbb{C}_+} \\) and<\/p>\n<p style=\"text-align: center;\">\\[ \\left\\Vert G(z)\\right\\Vert_{2\\rightarrow2}\\leq \\frac{1}{\\mathfrak{I}z}. \\]<\/p>\n<p style=\"text-align: justify;\">Denoting \\( {\\tau:=\\frac{1}{n}\\mathrm{Tr}} \\) and \\( {\\mu:=\\frac{1}{n}\\sum_{k=1}^n\\delta_{\\lambda_k(H)}} \\), we have<\/p>\n<p style=\"text-align: center;\">\\[ s_\\mu(z)=\\tau(G(z)). \\]<\/p>\n<p style=\"text-align: justify;\">The <em>resolvent identity<\/em> states that if \\( {G_1} \\) and \\( {G_2} \\) are the resolvent at point \\( {z} \\) of two Hermitian matrices \\( {H_1} \\) and \\( {H_2} \\) then<\/p>\n<p style=\"text-align: center;\">\\[ G_2-G_1=-G_1(H_2-H_1)G_2. \\]<\/p>\n<p style=\"text-align: justify;\">Used for \\( {H_1=M} \\) and \\( {H_2=0} \\), this gives, taking expectations and multiplying both sides by \\( {z} \\), and denoting \\( {G} \\) the resolvent at point \\( {z} \\) of \\( {M} \\),<\/p>\n<p style=\"text-align: center;\">\\[ -I-z\\mathbb{E}(G)=-\\mathbb{E}(GM). \\]<\/p>\n<p style=\"text-align: justify;\">From this identity, we will obtain an equation satisfied by \\( {s_{\\mu_n}=\\mathbb{E}(\\tau(M))} \\).<\/p>\n<p style=\"text-align: justify;\"><b>Integration by parts.<\/b> For a random vector \\( {X} \\) with density \\( {e^{-V}} \\) on \\( {\\mathbb{R}^d} \\) and a smooth \\( {F:\\mathbb{R}^d\\rightarrow\\mathbb{C}} \\), an integration by parts gives<\/p>\n<p style=\"text-align: center;\">\\[ \\mathbb{E}(F(X)\\nabla V(X))=\\mathbb{E}(\\nabla F(X)) \\]<\/p>\n<p style=\"text-align: justify;\">as soon as \\( {e^{-V(x)}F(x)\\rightarrow0} \\) as \\( {\\left\\Vert x\\right\\Vert\\rightarrow\\infty} \\). In particular, for every \\( {1\\leq p,q\\leq n} \\), by denoting \\( {\\partial_{M_{pq}}:=\\frac{1}{2}(\\partial_{\\Re M_{pq}}-i\\partial_{\\mathfrak{I} M_{pq}})} \\) if \\( {p\\neq q} \\),<\/p>\n<p style=\"text-align: center;\">\\[ \\mathbb{E}(F(M)M_{pq}) =\\frac{1}{n}\\mathbb{E}(\\partial_{M_{qp}}F(M)). \\]<\/p>\n<p style=\"text-align: justify;\">Let us examine the case \\( {F(M)=G_{ab}} \\) with \\( {1\\leq a,b\\leq n} \\). The resolvent identity used twice gives<\/p>\n<p style=\"text-align: center;\">\\[ \\begin{array}{rcl} G_2-G_1 &amp;=&amp;-G_1(M_2-M_1)G_2 \\\\ &amp;=&amp;-G_1(M_2-M_1)(G_1-G_1(M_2-M_1)G_2)\\\\ &amp;=&amp;-G_1(M_2-M_1)G_1+o(\\left\\Vert M_2-M_1\\right\\Vert) \\end{array} \\]<\/p>\n<p style=\"text-align: justify;\">which gives in particular that for every \\( {1\\leq a,b,p,q\\leq n} \\),<\/p>\n<p style=\"text-align: center;\">\\[ \\partial_{M_{pq}}G_{ab}=-G_{ap}G_{qb} \\]<\/p>\n<p style=\"text-align: justify;\">As a consequence, we get, using the integration by parts, for every \\( {1\\leq p\\leq n} \\),<\/p>\n<p style=\"text-align: center;\">\\[ \\mathbb{E}(GM)_{pp} =\\sum_{q=1}^n\\mathbb{E}(G_{pq}M_{qp}) =\\frac{1}{n}\\sum_{q=1}^n\\mathbb{E}(\\partial_{pq}G_{pq}) =-\\frac{1}{n}\\sum_{q=1}^n\\mathbb{E}(G_{pp}G_{qq}) =-\\mathbb{E}(G_{pp}\\tau(G)). \\]<\/p>\n<p style=\"text-align: justify;\">Summing up, we obtain, after taking normalized traces,<\/p>\n<p style=\"text-align: center;\">\\[ -1-zs_{\\mu_n}(z)=\\mathbb{E}(\\tau(G)^2). \\]<\/p>\n<p style=\"text-align: justify;\"><b>Poincar\u00e9 inequality.<\/b> To compare \\( {\\mathbb{E}(\\tau(G)^2)} \\) in the right hand side with \\( {\\mathbb{E}(\\tau(G))^2=s_{\\mu_n}(z)^2} \\), we may use <a href=\"\/blog\/2010\/06\/10\/concentration-for-empirical-spectral-distributions\/\"> concentration<\/a>, or alternatively and following Pastur, a Poincar\u00e9 inequality. Namely, we set \\( {f(M):=\\tau(G)} \\) and we write<\/p>\n<p style=\"text-align: center;\">\\[ \\left|\\mathbb{E}(\\tau(G)^2)-(\\mathbb{E}\\tau(G))^2\\right| = \\left|\\mathbb{E}((\\tau(G)-\\mathbb{E}\\tau(G))^2)\\right| \\leq \\mathbb{E}(\\left|\\tau(G)-\\mathbb{E}\\tau(G)\\right|^2) =\\mathrm{Var}(f(M)). \\]<\/p>\n<p style=\"text-align: justify;\">Now \\( {M} \\) is Gaussian with covariance matrix \\( {C} \\) such that \\( {\\left\\Vert C\\right\\Vert_{2\\rightarrow2}\\leq \\mathcal{O}(n^{-1})} \\). Thus, the Poincar\u00e9 inequality for the law of \\( {M} \\) and for function \\( {f} \\) gives<\/p>\n<p style=\"text-align: center;\">\\[ \\mathrm{Var}(f(M)) \\leq \\mathcal{O}(n^{-1})\\mathbb{E}(\\left\\Vert \\nabla f(M)\\right\\Vert_2^2). \\]<\/p>\n<p style=\"text-align: justify;\">But<\/p>\n<p style=\"text-align: center;\">\\[ \\partial_{M_{pq}}\\tau(G) =\\frac{1}{n}\\sum_a\\partial_{M_{pq}}G_{aa} =-\\frac{1}{n}\\sum_aG_{ap}G_{qa} =-\\frac{1}{n}G^2_{qp} \\]<\/p>\n<p style=\"text-align: justify;\">and therefore<\/p>\n<p style=\"text-align: center;\">\\[ \\left\\Vert \\nabla f\\right\\Vert_2^2 =\\frac{1}{n^2}\\sum_{p,q}\\left|\\partial_{M_{pq}}\\tau(G)\\right|^2 =\\frac{1}{n^2}\\left\\Vert G^2\\right\\Vert_{\\mathrm{HS}}^2 \\leq \\frac{1}{n}\\left\\Vert G^2\\right\\Vert_{2\\rightarrow2}^2 \\leq \\frac{1}{n}\\left\\Vert G\\right\\Vert_{2\\rightarrow2}^4 \\leq \\frac{1}{n(\\mathfrak{I} z)^4}. \\]<\/p>\n<p style=\"text-align: justify;\">Putting all together, this gives finally<\/p>\n<p style=\"text-align: center;\">\\[ \\left|s_{\\mu_n}(z)^2+zs_{\\mu_n}(z)+1\\right| = \\mathcal{O}\\left(\\frac{1}{n^2(\\mathfrak{I} z)^4}\\right) \\underset{n\\rightarrow\\infty}{\\longrightarrow}0. \\]<\/p>\n<p style=\"text-align: justify;\"><b>Conclusion.<\/b> The equation \\( {s^2+zs+1=0} \\) in \\( {s} \\) has two roots \\( {s_\\pm(z):=\\frac{1}{2}(-z\\pm\\sqrt{z^2-4})} \\). We observe that \\( {\\mathfrak{I}s_{\\mu}(z)\\mathfrak{I}z&gt;0} \\). This gives that \\( {\\lim_{n\\rightarrow\\infty}s_n(z)=\\frac{1}{2}(-z+\\sqrt{z^2-4})} \\) for every \\( {z\\in\\mathbb{C}_+} \\), which is the Cauchy-Stieltjes transform of \\( {\\mu_{\\mathrm{SC}}} \\). We conclude that \\( {\\mu_n} \\) converges weakly as \\( {n\\rightarrow\\infty} \\) to \\( {\\mu_{\\mathrm{SC}}} \\). One may drop the expectation in \\( {\\mu_n} \\) and obtain an almost sure weak convergence by using <a href=\"\/blog\/2010\/06\/10\/concentration-for-empirical-spectral-distributions\/\"> concentration<\/a> and the first Borel-Cantelli lemma.<\/p>\n<p style=\"text-align: justify;\"><b>Further reading.<\/b> This method is quite powerful and can be extended to many models of random matrices, including Wigner matrices (not necessarily Gaussian), as explained in the <a href=\"http:\/\/www.ams.org\/mathscinet-getitem?mr=2808038\">book by Pastur and Shcherbina<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Recall that a random \\( {n\\times n} \\) Hermitian matrix \\( {M} \\) belongs to the Gaussian Unitary Ensemble (GUE) when it admits a density&#8230;<\/p>\n<div class=\"more-link-wrapper\"><a class=\"more-link\" href=\"https:\/\/djalil.chafai.net\/blog\/2012\/11\/21\/about-wigner-theorem\/\">Continue reading<span class=\"screen-reader-text\">About the Wigner theorem<\/span><\/a><\/div>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"iawp_total_views":159},"categories":[1],"tags":[],"_links":{"self":[{"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/posts\/5245"}],"collection":[{"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/comments?post=5245"}],"version-history":[{"count":19,"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/posts\/5245\/revisions"}],"predecessor-version":[{"id":16934,"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/posts\/5245\/revisions\/16934"}],"wp:attachment":[{"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/media?parent=5245"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/categories?post=5245"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/djalil.chafai.net\/blog\/wp-json\/wp\/v2\/tags?post=5245"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}