<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Libres pensées d&#039;un mathématicien ordinaire</title>
	<atom:link href="http://djalil.chafai.net/blog/feed/" rel="self" type="application/rss+xml" />
	<link>http://djalil.chafai.net/blog</link>
	<description>Blog-notes de Djalil Chafaï</description>
	<lastBuildDate>Wed, 22 Feb 2012 07:22:30 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.5</generator>
		<item>
		<title>From seductive theory to concrete applications</title>
		<link>http://djalil.chafai.net/blog/2012/02/21/from-seductive-theory-to-concrete-applications/</link>
		<comments>http://djalil.chafai.net/blog/2012/02/21/from-seductive-theory-to-concrete-applications/#comments</comments>
		<pubDate>Tue, 21 Feb 2012 18:23:22 +0000</pubDate>
		<dc:creator>Djalil Chafaï</dc:creator>
				<category><![CDATA[Analysis]]></category>
		<category><![CDATA[Computers]]></category>
		<category><![CDATA[IT]]></category>
		<category><![CDATA[Probability]]></category>
		<category><![CDATA[Statistics]]></category>

		<guid isPermaLink="false">http://djalil.chafai.net/blog/?p=4217</guid>
		<description><![CDATA[Many mathematicians believe that in concrete numerical applications such as image, sound, and video compression, wavelets have superseded the more classical Fourier transform. This is not really the case, unfortunately, for various reasons, ranging from non competitive implementations to poor standardizations. A symptomatic example is given by the JPEG image format for lossy compression. Our [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a href="http://en.wikipedia.org/wiki/Lenna"><img class="aligncenter size-full wp-image-4218" title="Lossy compressed Lenna - The image file is in JPG!" src="http://djalil.chafai.net/blog/wp-uploads/2012/02/Lena.jpg" alt="Lossy compressed Lenna - The image file is in JPG!" width="538" height="203" /></a></p>
<p style="text-align: justify;">Many mathematicians believe that in concrete numerical applications such as image, sound, and video compression, <a href="http://en.wikipedia.org/wiki/Wavelets">wavelets</a> have superseded the more classical <a href="http://en.wikipedia.org/wiki/Fourier_transform">Fourier transform</a>. This is not really the case, unfortunately, for various reasons, ranging from non competitive implementations to poor standardizations. A symptomatic example is given by the <a href="http://en.wikipedia.org/wiki/JPEG" title="For Joint Photographic Experts Group, who created a commonly used method of lossy compression for digital photography (image).">JPEG</a> image format for <a href="http://en.wikipedia.org/wiki/Lossy_compression">lossy compression</a>. Our .jpg or .jpeg files are based on the following algorithm: the image is partitioned into blocs, and on each bloc, a spatial <a href="http://en.wikipedia.org/wiki/Discrete_cosine_transform">discrete Fourier transform</a> is computed (actually a Discrete Cosine Transform, DCT), a frequency filter is then applied to kill hardly visible frequencies (this is the lossy part), and finally a <a href="http://en.wikipedia.org/wiki/Lossless">lossless</a> algorithm (<a href="http://en.wikipedia.org/wiki/Huffman_encoding" title="Variable length encoding algorithm used for lossless data compression, reaching asymptotically the Shannon entropy bound.">Huffman</a>) is used to compress the result. Regarding sound compression, our .mp3 music files (actually <a href="http://en.wikipedia.org/wiki/Mp3">MPEG-1 MPEG-2 audio layer 3)</a> are based on similar ideas. At the end of the past century, a new JPEG format called <a href="http://en.wikipedia.org/wiki/JPEG2000">JPEG2000</a> was  designed, involving an algorithm in which the discrete Fourier transform is replaced by a <a href="http://en.wikipedia.org/wiki/Discrete_wavelet_transform">discrete wavelet transform</a> and in which the lossless final step is performed with more efficient entropic coding. JPEG2000 is not the sole image format based on wavelets, and one can mention the more specialized <a href="http://en.wikipedia.org/wiki/ECW_%28file_format%29" title="Enhanced Compression Wavelet is a proprietary wavelet compression image format optimized for aerial and satellite imagery.">ECW</a> format for instance. Sadly, JPEG2000 has not  reached the expected success, and very few people are using .jp2|.jpx files.</p>
<p style="text-align: justify;">Regarding wavelets for video compression, the situation is not really better. One may read the <a href="http://x264dev.multimedia.cx/archives/317"><strong>blog post</strong></a> dated 2010-02-26 by Jason Garrett-Glaser, an <a href="http://en.wikipedia.org/wiki/X264">x264</a> developer. Recall that x264 is a free software library used in the <a href="http://en.wikipedia.org/wiki/VideoLAN">VideoLAN</a> projet (<a href="http://en.wikipedia.org/wiki/VLC_media_player">VLC media player</a>!) to encode <a href="http://en.wikipedia.org/wiki/H.264">H.264</a> video streams (MPEG-4 AVC). In particular, he ends his post by saying:</p>
<p style="text-align: justify;">« <em>JPEG2000 is a classic example of wavelet failure: despite having more advanced <a href="http://en.wikipedia.org/wiki/Entropy_coding">entropy coding</a>, being designed much later than JPEG, being much more computationally intensive, and having much better <a href="http://en.wikipedia.org/wiki/PSNR" title="Peak signal-to-noise ratio">PSNR</a>, comparisons have consistently shown it to be visually worse than JPEG at sane filesizes.  By comparison, H.264′s intra coding, when used for still image compression, can beat JPEG by a factor of 2 or more (I’ll make a post on this later).  With the various advancements in DCT intra coding since H.264, I suspect that a state-of-the-art DCT compressor could win by an even larger factor. <strong>Despite the promised benefits of wavelets, a wavelet encoder even close to competitive with x264 has yet to be created.  With some tests even showing <a href="http://en.wikipedia.org/wiki/Dirac_%28video_compression_format%29" title="Lossy video compression format based on wavelets, developed by the BBC.">Dirac</a> losing to <a href="http://en.wikipedia.org/wiki/Theora" title="Lossy video compression format based on Fourier transform, used for Ogg Vorbis.">Theora</a> in visual comparisons, it’s clear that many problems remain to be solved before wavelets can eliminate the ugliness of block-based transforms once and for all.</strong></em> »</p>
<p style="text-align: justify;">This shows the long path from seductive theory to concrete applications. We will see in the forthcoming decades if the wavelets paradigm will beat optimized Fourier transforms. Data compression is a rapidly evolving and quite active domain lying between applied mathematics, computer science, and information technology. Finally, the described phenomenon is quite common in applied mathematics, and is not specific to wavelets. Other instances are for example Model Selection in Statistics and Compressive Sensing in Information Technology.</p>
]]></content:encoded>
			<wfw:commentRss>http://djalil.chafai.net/blog/2012/02/21/from-seductive-theory-to-concrete-applications/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Spectrum of Markov generators of random graphs</title>
		<link>http://djalil.chafai.net/blog/2012/02/13/spectrum-of-markov-generators-of-random-graphs/</link>
		<comments>http://djalil.chafai.net/blog/2012/02/13/spectrum-of-markov-generators-of-random-graphs/#comments</comments>
		<pubDate>Sun, 12 Feb 2012 23:01:07 +0000</pubDate>
		<dc:creator>Djalil Chafaï</dc:creator>
				<category><![CDATA[Algebra]]></category>
		<category><![CDATA[Analysis]]></category>
		<category><![CDATA[Combinatorics]]></category>
		<category><![CDATA[Graphs]]></category>
		<category><![CDATA[LinearAlgebra]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Probability]]></category>

		<guid isPermaLink="false">http://djalil.chafai.net/blog/?p=4152</guid>
		<description><![CDATA[I had the pleasure to upload recently on arXiv and on HAL a collaborative work with Charles Bordenave and Pietro Caputo, entitled Spectrum of Markov generators on sparse random graphs. Let \( {X=(X_{ij})_{1\leq i,j\leq n}} \) be a random matrix in \( {\mathbb{C}} \) whose entries are i.i.d. with mean \( {m} \), covariance matrix [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://djalil.chafai.net/blog/2012/02/13/spectrum-of-markov-generators-of-random-graphs/expo/" rel="attachment wp-att-4163"><img src="http://djalil.chafai.net/blog/wp-uploads/2012/02/expo.jpg" alt="Spectrum of 50 iid copies of random generators in dimension 500 with exponential off diagonal entries" title="Spectrum of 50 iid copies of random generators in dimension 500 with exponential off diagonal entries" width="600" height="163" class="aligncenter size-full wp-image-4163" /></a></p>
<p style="text-align: justify;">I had the pleasure to upload recently on <a href= "http://fr.arxiv.org/abs/1202.0644">arXiv</a> and on <a href= "http://hal.archives-ouvertes.fr/hal-00665907/en">HAL</a> a collaborative work with <a href= "/Scientists/bordenave.charles.html">Charles Bordenave</a> and <a href="/Scientists/caputo.pietro.html">Pietro Caputo</a>, entitled <em>Spectrum of Markov generators on sparse random graphs</em>.</p>
<p style="text-align: justify;">Let \( {X=(X_{ij})_{1\leq i,j\leq n}} \) be a random matrix in \( {\mathbb{C}} \) whose entries are i.i.d. with mean \( {m} \), covariance matrix \( {K=\mathrm{Cov}(\Re X_{11},\Im X_{11})} \), and variance \( {\mathrm{Tr}(K)=1} \). The sparse regime is obtained by allowing the law of \( {X_{11}} \) to depend on \( {n} \) with \( {\mathrm{Tr}(K)\rightarrow0} \) as \( {n\rightarrow\infty} \), the basic example being the adjacency matrix of Erd&ouml;s-R&eacute;nyi random graphs. The analysis of the sparse regime was promoted by Charles Bordenave, who has a pretty Hungarian mathematical soul. However, to simplify the exposition, this blog post restricts to the non sparse regime, which captures most of the rigid algebraic-geometric structure: we thus assume that the law of \( {X_{11}} \) does not depend on \( {n} \). We consider the random matrix defined by</p>
<p style="text-align: center;">\[ L=X-D \]</p>
<p style="text-align: justify;">where \( {D} \) is the diagonal matrix obtained from the row sums of \( {X} \), namely \( {D_{ii}=\sum_{k=1}^nX_{ik}} \). If \( {X} \) is interpreted as the adjacency matrix of a weighted oriented graph, then \( {L} \) is the associated Laplacian matrix, with zero row sums. In particular, if the weights \( {X_{ij}} \) take values in \( {[0,\infty)} \), then \( {L} \) is the infinitesimal generator of the continuous time random walk on that graph, and properties of the spectrum of \( {L} \) can be used to study its long-time behavior. Clearly, \( {L} \) has non independent entries but independent rows. A related model is obtained by considering the stochastic matrix \( {P=D^{-1}X} \), which corresponds to discrete time random walk (considered in <a href= "2010/05/12/circular-law-theorem-for-random-markov-matrices/">arXiv:0808.1502</a>). In order to analyze the spectrum of \( {L} \), it is more convenient to introduce the following affine transformation of \( {L} \):</p>
<p style="text-align: center;">\[ M =\frac{L+nmI}{\sqrt{n}} =\frac{X}{\sqrt{n}}-\frac{D-nmI}{\sqrt{n}}. \]</p>
<p style="text-align: justify;">By the central limit theorem, the distribution of \( {n^{-1/2}(D_{ii}-nm)} \) converges to the Gaussian law \( {\mathcal{N}(0,K)} \). Combined with the circular law for \( {n^{-1/2}X} \), this suggests the interpretation of the spectral distribution of \( {M} \), in the limit \( {n\rightarrow\infty} \), as an additive Gaussian deformation of the circular law. In a sense, our model is a non-Hermitian analogue of a model already studied by <a href="http://arxiv.org/abs/math/0307330">Bryc, Dembo, and Jiang</a> years ago with the method of moments.</p>
<p style="text-align: justify;"><b>Basic notations and concepts.</b> Recall that if \( {A} \) is an \( {n\times n} \) matrix, we denote by \( {\lambda_1(A),\ldots,\lambda_n(A)} \) its eigenvalues, i.e. the roots in \( {\mathbb{C}} \) of its characteristic polynomial. We label them in such a way that \( {|\lambda_1(A)|\geq\cdots\geq|\lambda_n(A)|} \). We denote by \( {s_1(A),\ldots,s_n(A)} \) the singular values of \( {A} \), i.e. the eigenvalues of the Hermitian positive semidefinite matrix \( {| A |= \sqrt{A^*A}} \), labeled so that \( {s_1(A)\geq\cdots \geq s_n(A) \geq 0} \). The <em>operator norm</em> of \( {A} \) is \( {\| A \|= s_1(A)} \) while the <em>spectral radius</em> is \( {|\lambda_1(A)|} \). We define the discrete probability measures</p>
<p style="text-align: center;">\[ \mu_A=\frac{1}{n}\sum_{k=1}^n\delta_{\lambda_k(A)} \quad\text{and}\quad \nu_A=\mu_{|A|} =\frac{1}{n}\sum_{k=1}^n\delta_{s_k(A)}. \]</p>
<p style="text-align: justify;">In the sequel, \( {G} \) is a Gaussian random variable on \( {\mathbb{R}^2\cong\mathbb{C}} \) with law \( {\mathcal{N}(0,K)} \) i.e. mean \( {0} \) and covariance matrix \( {K} \). This law has a Lebesgue density on \( {\mathbb{R}^2} \) if and only if \( {K} \) is invertible, given by \( {z=(x,y)\mapsto(2\pi\sqrt{\det(K)})^{-1}\exp(-\frac{1}{2}&lt;(x,y)^{\top}K^{-1}(x,y)&gt;)} \). Note that \( {K} \) is not invertible when \( {X_{11}} \) is supported in \( {\mathbb{R}} \).</p>
<p style="text-align: justify;"><b>Singular values of shifts.</b> Our first result concerns the singular values of shifts of the matrix \( {M} \), a useful proxy to the eigenvalues. It states that for every \( {z\in\mathbb{C}} \), there exists a probability measure \( {\nu_z} \) on \( {\mathbb{R}_+} \) which depends only on \( {z} \) and \( {K} \) such that with probability one,</p>
<p style="text-align: center;">\[ \nu_{M-zI} \underset{n\rightarrow\infty}{\longrightarrow} \nu_z. \]</p>
<p style="text-align: justify;">Moreover, the limiting law \( {\nu_z} \) is characterized as follows: its symmetrization \( {\check \nu_z} \) is the unique symmetric probability measure on \( {\mathbb{R}} \) with Cauchy-Stieltjes transform satisfying, for every \( {\eta\in\mathbb{C}_+=\{z\in\mathbb{C}:\Im(z)&gt;0\}} \),</p>
<p style="text-align: center;">\[ S_{\check \nu_z}(\eta) =\int_{\mathbb{C}}\!\frac{1}{z-\eta}\,d\check\nu(z) =\mathbb{E}\left(\frac{S_{\check \nu_z}(\eta)+\eta } {|G-z|^2-(\eta+ S_{\check \nu_z}(\eta))^2}\right). \]</p>
<p style="text-align: justify;">It is in fact classical to express the Cauchy-Stieltjes transform of the limiting singular values distribution as a fixed point of a non linear equation, which comes from a recursion on the trace of the resolvent exploiting the recursive structure of the model. The real difficulty in the proof of the result above lies in the fact the entries of \( {L} \) are dependent (but asymptotically independent).</p>
<p style="text-align: justify;"><b>Eigenvalues convergence.</b> The next result concerns the eigenvalues of \( {M} \):</p>
<p style="text-align: center;">\[ \mu_{M} \underset{n\rightarrow\infty}{\longrightarrow} \mu \]</p>
<p style="text-align: justify;">where \( {\mu} \) is the probability measure on \( {\mathbb{C}} \) defined by</p>
<p style="text-align: center;">\[ \mu=\frac{1}{2\pi}\Delta\int_0^\infty\!\log(t)\,d\nu_z(t), \]</p>
<p style="text-align: justify;">where the Laplacian \( {\Delta=\partial_z\partial_{\overline z}=\partial_x^2+\partial_y^2} \) is taken in the sense of Schwartz-Sobolev distributions in the space \( {\mathcal{D}&#8217;(\mathbb{R}^2)} \). The limiting distribution \( {\mu} \) is independent of the mean \( {m} \) of \( {X_{11}} \), and this is rather natural since shifting the entries produces a deterministic rank one perturbation. As in other known circumstances, a rank one additive perturbation produces essentially a single outlier, and therefore does not affect the limiting spectral distribution. Our proof of the convergence to \( {\mu} \) is inspired from the logarithmic potential approach developed by <a href="http://arxiv.org/abs/0807.4898">Tao and Vu</a> for the standard circular law (see also <a href= "http://arxiv.org/abs/1109.3343">arXiv:1109.3343</a>). As usual, the main difficulty lies in the control of the small singular values of shifts \( {M-zI} \), in particular the norm of the resolvent. We solve this difficulty by using essentially the techniques developed by <a href= "http://arxiv.org/abs/math/0703503">Rudelson and Vershynin</a> and others.</p>
<p style="text-align: justify;"><b>Rigid analysis of the limit.</b> To obtain further properties of \( {\mu} \), we turn to a flavor of free probability extended to possibly unbounded operators to interpret \( {n^{-1/2}(D-nmI)} \) as \( {n\rightarrow\infty} \). Following Brown and Haagerup and Schultz, one can define a large operator \( {\star} \)-algebra in which each element \( {a} \) has a Brown spectral measure denoted \( {\mu_a} \), that is the probability measure on \( {\mathbb{C}} \) given by</p>
<p style="text-align: center;">\[ \mu_a=\Delta\int_0^\infty\!\log(s)\,d\nu_{|a-z|}(s) \]</p>
<p style="text-align: justify;">where \( {|b|=\sqrt{bb^*}} \) (i.e. the square root of the self-adjoint operator \( {bb^*} \)). If \( {a} \) is normal (i.e. \( {aa^*=a^*a} \)) then its Brown measure \( {\mu_a} \) coincides with its usual spectral measure. Now let \( {c} \) and \( {g} \) be \( {\star} \)-free operators with \( {c} \) circular, and \( {g} \) normal (i.e. \( {gg^*=g^*g} \)) with spectral measure equal to the Gaussian law \( {\mathcal{N}(0,K)} \). Then we are able to show that</p>
<p style="text-align: center;">\[ \nu_z = \mu_{|c + g - z|} \quad \text{and} \quad \mu = \mu_{c+g}. \]</p>
<p style="text-align: justify;">Having identified the limit law \( {\mu} \), we obtain some additional information on it. Namely, we use the concept of subordination developed by <a href= "http://www.ams.org/mathscinet-getitem?mr=1488333">Biane</a> and <a href= "http://www.ams.org/mathscinet-getitem?mr=1744647">Voiculescu</a>, which allows to show that the support of \( {\mu} \) is given by</p>
<p style="text-align: center;">\[ \mathrm{Supp}(\mu) = \left\{z \in \mathbb{C} : \mathbb{E}\left(\frac{1}{|G-z|^2}\right)\geq 1\right\}. \]</p>
<p style="text-align: justify;">Moreover, there exists a unique function \( {f : \mathrm{Supp} (\mu) \rightarrow [0,1]} \), which is \( {C^\infty} \) in the interior of \( {\mathrm{Supp} (\mu)} \), such that for all \( {z\in \mathrm{Supp} (\mu)} \),</p>
<p style="text-align: center;">\[ \mathbb{E}\left[\frac{1}{|G-z|^2 + f(z)^2}\right]=1. \]</p>
<p style="text-align: justify;">Moreover \( {\mu} \) is absolutely continuous in \( {\mathbb{C}} \) with density given by</p>
<p style="text-align: center;">\[ z\mapsto \frac{1}{\pi} f(z)^2 \mathbb{E}\left[\Phi(G,z)\right] + \frac{1}{\pi} \frac{\left[\mathbb{E}\left[(G-z)\Phi(G,z)\right]\right|^2}{\mathbb{E}\left[\Phi(G,z)\right]} \]</p>
<p style="text-align: justify;">where</p>
<p style="text-align: center;">\[ \Phi(w,z):=\frac{1}{(|w-z|^2 + f(z)^2)^2}. \]</p>
<p style="text-align: justify;">It can be seen that \( {\mu} \) is rotationally invariant when \( {K} \) is a multiple of the identity, while this is not the case if \( {X_{11}} \) is supported in \( {\mathbb{R}} \), in which case \( {K_{22}=K_{12}=K_{21}=0} \) (in this case \( {G} \) does not have a density on \( {\mathbb{C}} \) since \( {K} \) is not invertible). Note also that the support of \( {\mu} \) is unbounded since it contains the support of \( {\mathcal{N}(0,K)} \), and thus \( {\mathrm{Supp}(\mu)=\mathbb{C}} \) if \( {K} \) is invertible. If \( {K} \) is not invertible, it can be checked that the boundary of \( {\mathrm{Supp}(\mu)} \) is</p>
<p style="text-align: center;">\[ \left\{z \in \mathbb{C} : \mathbb{E}\left(\frac{1}{|G-z|^2}\right) = 1\right\} \]</p>
<p style="text-align: justify;">On this set, \( {f(z) = 0} \), but the formula for the density above shows that the density does not vanish there. This phenomenon, not unusual for Brown measures, occurs for the circular law and more generally for \( {R} \)-diagonal operators, see Haagerup and Larsen. Our formula above for the density is slightly more explicit than the formulas given in <a href= "http://arxiv.org/abs/math/9912242">Biane and Lehner</a>. The subordination formula that we use can also be used to compute more general Brown measures of the form \( {\mu_{a + c}} \) with \( {a, c} \) \( {\star} \)-free and \( {c} \) circular.</p>
<p style="text-align: justify;"><b>Spectrum localization.</b> The convergence of \( {\mu_M} \) suggests that the bulk of the spectrum of \( {L} \) is concentrated around the value \( {-mn} \) in a two dimensional window of width \( {\sqrt{n}} \). Actually, it is possible to localize more precisely the support of the spectrum, by controlling the extremal eigenvalues of \( {L} \). Recall that \( {L} \) has always the trivial eigenvalue \( {0} \). We define for convenience the centered matrices \( {\underline X = X-\mathbb{E}X} \) and \( {\underline D=D-\mathbb{E}D} \) and \( {\underline L=L-\mathbb{E}L=\underline X-\underline D} \). If \( {J} \) stands for the \( {n\times n} \) matrix with all entries equal to \( {1} \), then</p>
<p style="text-align: center;">\[ \mathbb{E}L = L-\underline L= mJ - mnI. \]</p>
<p style="text-align: justify;">Now the idea is that if \( {\mathbb{E}(|X_{11}^4|)&lt;\infty} \) then by <a href= "http://www.ams.org/mathscinet-getitem?mr=950344">Bai and Yin theorem</a>, the operator norm of \( {\underline X} \) is \( {\sqrt{n}\,(2+o(1))} \). On the other hand, from the central limit theorem one expects that the operator norm and the spectral radius of the diagonal matrix \( {\underline D} \) are of order \( {\sqrt{2n\log(n)}\,(1+o(1))} \) as for maximum of i.i.d. Gaussian random variables. We show indeed that if \( {X_{11}} \) is supported in \( {\mathbb{R}_+} \) and if \( {\mathbb{E}(|X_{11}|^4)&lt;\infty} \) then with probability one, for \( {n \gg1} \), every eigenvalue \( { \underline \lambda} \) of \( {\underline L} \) satisfies</p>
<p style="text-align: center;">\[ |\Re \underline \lambda| \leq \sqrt{2 n\log(n)}\,(1+o(1)) \quad\text{and}\quad |\Im \underline \lambda| \leq \sqrt{n}(2+o(1)) . \]</p>
<p style="text-align: justify;">Moreover, with probability one, for \( {n \gg 1} \), every eigenvalue \( {\lambda\neq 0} \) of \( {L} \) satisfies</p>
<p style="text-align: center;">\[ |\Re \lambda + m n| \leq \sqrt{2 n \log(n)}\,(1+o(1)) \quad\text{and}\quad |\Im \lambda| \leq \sqrt{n}(2+o(1)). \]</p>
<p style="text-align: justify;">Our proof is simple and relies on classical perturbative methods in matrix analysis: refined <a href= "http://en.wikipedia.org/wiki/Gershgorin_circle_theorem">Gershgorin</a> and the <a href= "http://en.wikipedia.org/wiki/Bauer-Fike_theorem">Bauer-Fike</a> theorems. If one defines a <em>spectral gap</em> \( {\kappa} \) of the Markov generator \( {L} \) as the minimum of \( {|\Re \lambda|} \) for \( {\lambda\neq 0} \) in the spectrum of \( {L} \), then it follows that a.s.</p>
<p style="text-align: center;">\[ \kappa\geq mn - \sqrt{2n\log(n)}\,(1+o(1)). \]</p>
<p style="text-align: justify;"><b>Invariant measure.</b> We turn to the properties of the invariant measure of \( {L} \). If \( {X_{11}} \) is supported in \( {\mathbb{R}_+} \) and if \( {L} \) is irreducible, then from the Perron-Frobenius theorem, the kernel of \( {L} \) has dimension \( {1} \) and there is a unique vector \( {\Pi\in(0,1)^n} \) such that \( {L^\top\Pi = 0} \) and \( {\sum_{i=1}^n\Pi_i =1} \). The vector \( {\Pi} \) is the invariant measure of the Markov process with infinitesimal generator \( {L} \). Actually, we show that a.s. for \( {n \gg 1} \), the Markov generator \( {L} \) is irreducible and</p>
<p style="text-align: center;">\[ \left\Vert\Pi - U_n\right\Vert_1 = \mathcal{O} \left(\sqrt{\frac{\log(n)}{n}}\right)=o(1). \]</p>
<p style="text-align: justify;">where \( {U_n = \frac1n(1 ,\ldots,1)^\top} \) is the uniform probability distribution on the finite set \( {\{1, \ldots , n\}} \) and where \( {\left\Vert\cdot\right\Vert_1} \) is the total variation norm. Our proof relies on the remarkable <a href= "http://en.wikipedia.org/wiki/Sylvester_determinant_theorem">Sylvester determinant theorem</a> which states that if \( {A\in\mathcal{M}_{p,q}} \) and \( {B\in\mathcal{M}_{q,p}} \) are two rectangular matrices with swapped dimensions then</p>
<p style="text-align: center;">\[ \det(I_p+AB)=\det(I_q+BA). \]</p>
<p style="text-align: justify;">To understand it, recall that \( {AB} \) and \( {BA} \) have the same spectrum up to the multiplicity of the eigenvalue zero, and thus, their characteristic polynomials in \( {z} \) are identical up to a multiplication by a power of \( {z} \), which gives the result taking \( {z=1} \). In particular, this formula allows to pass from a high dimensional problem to a one dimensional problem.</p>
<p style="text-align: justify;"><b>Interpolation.</b> Our results for the matrix \( {L=X-D} \) can be extended with minor modifications to the case of the matrix \( {L_{(t)}=X-tD} \), where \( {t\in\mathbb{R}} \) provided the law \( {\mathcal{N}(0,K)} \) characterizing our limiting spectral distributions is replaced by \( {\mathcal{N}(0,t^2K)} \). This gives back the circular law for \( {t=0} \). One may also interpolate between the Gaussian and the circular laws by considering \( {(1-t)X-tD} \) with \( {t\in[0,1]} \) or other parametrizations.</p>
<p style="text-align: justify;"><b>Open problems.</b></p>
<ul>
<li><b>Almost sure convergence.</b> The mode of convergence of spectral distributions is the weak convergence in probability. We believe that this can be upgraded to almost sure weak convergence, but this requires stronger bounds on the smallest singular values of \( {M-zI} \) (i.e. norm of the resolvent). This is not a problem if \( {X_{11}} \) has a bounded density.</li>
<li><b>Sparsity.</b> We are able to show that the results remain essentially available in the sparse case in which the law of \( {X_{11}} \) depends on \( {n} \). However, our treatment is not optimal. Note that an optimal answer in the sparse case is still pending even for the circular law model.</li>
<li><b>Heavy tails.</b> A different model for random Markov generators is obtained when the law of \( {X_{11}} \) has heavy tails, with e.g. infinite first moment. In this context, we refer to <a href= "2010/06/12/spectrum-of-non-hermitian-heavy-tailed-random-matrices/"> arXiv:1006.1713</a> and to <a href= "2011/09/16/around-the-circular-law/">arXiv:1109.3343</a> for the spectral analysis of non-Hermitian matrices with i.i.d. entries, and to <a href="http://arxiv.org/abs/0903.3528">arXiv:0903.3528</a> for the case of reversible Markov transition matrices. It is natural to expect that, in contrast with the cases considered here, there is no asymptotic independence of the matrices \( {X} \) and \( {D} \) in the heavy tailed case.</li>
<li><b>Spectral edge and spectral gap.</b> Concerning the localization of the spectrum, it seems natural to conjecture the asymptotic behavior \( {\kappa= mn &#8211; \sqrt{2n\log(n)}\,(1+o(1))} \) for the spectral gap, but we do not have a proof of the corresponding upper bound. In the same spirit, we believe that with probability one, with \( {\underline L=L-\mathbb{E}L} \),
<p style="text-align: center;">\[ \lim_{n\rightarrow\infty} \frac{s_1(\underline L)}{\sqrt{2 n\log(n)}} =\lim_{n\rightarrow\infty}\frac{|\lambda_1(\underline L)|}{\sqrt{2 n\log(n)}}=1, \]</p>
<p> which contrasts with the behavior of \( {\underline X} \) for which \( {s_1/|\lambda_1|\rightarrow2} \) as \( {n\rightarrow\infty} \).</li>
</ul>
<p><a href="http://djalil.chafai.net/blog/2012/02/13/spectrum-of-markov-generators-of-random-graphs/expo3d/" rel="attachment wp-att-4166"><img src="http://djalil.chafai.net/blog/wp-uploads/2012/02/expo3d.jpg" alt="Kernel estimator for the data used in the upper graphic" title="Kernel estimator for the data used in the upper graphic" width="600" height="451" class="aligncenter size-full wp-image-4166" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://djalil.chafai.net/blog/2012/02/13/spectrum-of-markov-generators-of-random-graphs/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Publications: science, money, and human comedy</title>
		<link>http://djalil.chafai.net/blog/2012/02/01/publications-science-money-and-human-comedy/</link>
		<comments>http://djalil.chafai.net/blog/2012/02/01/publications-science-money-and-human-comedy/#comments</comments>
		<pubDate>Tue, 31 Jan 2012 23:00:59 +0000</pubDate>
		<dc:creator>Djalil Chafaï</dc:creator>
				<category><![CDATA[Mathematicians]]></category>

		<guid isPermaLink="false">http://djalil.chafai.net/blog/?p=3965</guid>
		<description><![CDATA[In principle, a mathematician can communicate to the others his own work using solely public free access Internet academic sites (either personal or global). A famous example is the work of the Fields Medalist Grisha Perelman on Thurston&#8217;s geometrization conjecture published on arXiv.org. Even if arXiv.org is moderated, it does not provide a peer reviewing [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_3966" class="wp-caption aligncenter" style="width: 260px"><a rel="attachment wp-att-3966" href="http://djalil.chafai.net/blog/2012/02/01/publications-science-money-and-human-comedy/euleragnesi/"><img class="size-full wp-image-3966" title="Euler, on a Swiss bank note" src="http://djalil.chafai.net/blog/wp-uploads/2012/01/euleragnesi.jpg" alt="" width="250" height="166" /></a><p class="wp-caption-text">Euler, on a Swiss bank note</p></div>
<p style="text-align: justify;">In principle, a mathematician can communicate to the others his own work using solely public free access Internet academic sites (either personal or global). A famous example is the work of the Fields Medalist <a href="http://en.wikipedia.org/wiki/Grigori_Perelman">Grisha Perelman</a> on <a href="http://en.wikipedia.org/wiki/Geometrization_conjecture">Thurston&#8217;s geometrization conjecture</a> published on <a href="http://en.wikipedia.org/wiki/Arxiv.org">arXiv.org</a>. Even if arXiv.org is moderated, it does not provide a peer reviewing process. For historical and sociological reasons, most professional mathematicians publish their results in peer reviewed mathematical journals. Many of these journals are handled by capitalistic companies such as Elsevier, Springer, Taylor and Francis, etc. The aim of these companies is to make profit, and some of them are not willing to make knowledge freely accessible. Basically, the mathematical publications rely on three pillars: science, money, and human comedy:</p>
<ul>
<li style="text-align: justify;"><strong>Science.</strong> Mathematicians use peer reviewing in order to improve science quality and to reduce pollution. Peer reviewing is the first step in the long transformation of live mathematics into &#8220;dead&#8221; mathematics which can be considered as true and certified. This does not mean that peer reviewing is good for everything. Mathematicians need also media and space to discuss ideas. Internet has good tools for such live interactions, such as <a href="http://en.wikipedia.org/wiki/Mathoverflow">MathOverflow</a>, a sort of modern version of the venerable sci.math <a href="http://en.wikipedia.org/wiki/Usenet">usenet</a> newsgroup. Actually, MathOverflow implements some kind of peer rating, while keeping freedom.</li>
<li style="text-align: justify;"><strong>Money.</strong> Everything has a cost. Non free journals are actually paied by readers via their institution. Freely accessible journals or sites have also a cost, but the circuit of money is different. For instance, the cost of arXiv.org is supported mostly by North American and European Universities and Research Centers. Other circuits of money are possible. For instance, some journals are freely accessible but the authors have to pay at publication time (some private companies such as Springer propose this scheme to the authors of non free journals in order to make their papers freely accessible).  Many prestigious journals are handled by capitalistic companies, asking for  high prices to make profit. On the other hand, being handled by an academic institution does not guarantee a low price.</li>
<li style="text-align: justify;"><strong>Human comedy.</strong> Mathematicians are first of all humans, members of an academic community. The academic community needs a way to measure the merits of each individual in order to decide recognition (and thus promotion and funding). In general,  recognition is gained by obtaining good or influential results, published in respected journals. Publishing in respected journals can be seen as a way to obtain a label of quality, a sort of delegation of evaluation. The respect or prestige of a journal comes from its history, its editorial board, and its content. The editorial and peer reviewing job of most peer reviewed mathematical  journals (free or  not) is done for free by mathematicians (they gain social recognition).</li>
</ul>
<p style="text-align: justify;">Many aspects are not specific to mathematics. Ideally, one can imagine the <b>introduction of editorial labels in arXiv.org</b>, attributed by editorial boards to papers at author request and after peer reviewing (these boards may leave completely capitalistic publishers). One can also imagine to <b>make arXiv.org less static</b> by adding on top of it a dynamic collaborative <a href="http://en.wikipedia.org/wiki/Stack_Exchange_Network">stack-exchange</a> structure such as MathOverflow, allowing the discussion of each paper. These ideas are floating around for a while. Unfortunately, discussing these matters with many colleagues and with the Editors of certain leading journals reveals that the mathematical community is a bit conservative. However, it is remarkable that some few scientific leaders such as the Fields Medalist <a href="http://en.wikipedia.org/wiki/Timothy_Gowers">Tim Gowers</a> are promoting such a revolution.</p>
<p><strong>Some related reading:</strong></p>
<ul>
<li><a href="http://gowers.wordpress.com/2012/01/21/elsevier-my-part-in-its-downfall/">Elsevier — my part in its downfall</a> by Tim Gowers</li>
<li><a href="http://terrytao.wordpress.com/2012/01/26/the-cost-of-knowledge/">The Cost of Knowledge</a> by Terry Tao</li>
<li><a href="http://gowers.wordpress.com/2012/01/29/whats-wrong-with-electronic-journals/">What&#8217;s wrong with electronic journals?</a> by Tim Gowers</li>
<li><a href="http://nuit-blanche.blogspot.com/2011/11/tim-gowers-model-of-mathematical.html">Tim Gowers&#8217; Model of Mathematical Publishing</a> by NuitBlanche</li>
<li><a href="/blog/2010/04/28/science-fiction">Science fiction ?</a> (in French) by the author of this blog</li>
<li><a href="http://people.ccmr.cornell.edu/~ginsparg/blurb/pg02pr.html">Can Peer Review be better Focused?</a> by Paul Ginsparg</li>
</ul>
<p>&nbsp;</p>
<p style="text-align: center;">
<p><a href="http://www.youtube.com/watch?v=XP9cfQx2OZY">http://www.youtube.com/watch?v=XP9cfQx2OZY</a></p></p>
]]></content:encoded>
			<wfw:commentRss>http://djalil.chafai.net/blog/2012/02/01/publications-science-money-and-human-comedy/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Diff-icile de faire plus simple</title>
		<link>http://djalil.chafai.net/blog/2012/01/27/3929/</link>
		<comments>http://djalil.chafai.net/blog/2012/01/27/3929/#comments</comments>
		<pubDate>Fri, 27 Jan 2012 19:57:06 +0000</pubDate>
		<dc:creator>Djalil Chafaï</dc:creator>
				<category><![CDATA[Combinatorics]]></category>
		<category><![CDATA[Computers]]></category>
		<category><![CDATA[IT]]></category>

		<guid isPermaLink="false">http://djalil.chafai.net/blog/?p=3929</guid>
		<description><![CDATA[Diff is a standard utility from the Unix world outputting the differences between two files. Even if diff can handle binary files, it is used mostly to compare two versions of a text file. For mathematicians, diff is helpful for the comparison of $\LaTeX$ files written with co-authors, in particular via its graphical user interface [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: justify;"><a href="http://sourceforge.net/projects/tkdiff/"><img class="aligncenter size-medium wp-image-3930" title="tkdiff" src="http://djalil.chafai.net/blog/wp-uploads/2012/01/tkdiff-300x125.jpg" alt="tkdiff" width="300" height="125" /></a></p>
<p style="text-align: justify;"><strong><a href="http://en.wikipedia.org/wiki/Diff">Diff</a></strong> is a standard utility from the Unix world outputting the differences between two files. Even if diff can handle binary files, it is used mostly to compare two versions of a text file. For mathematicians, diff is helpful for the comparison of <a href="http://en.wikipedia.org/wiki/LaTeX">$\LaTeX$</a> files written with co-authors, in particular via its graphical user interface <strong><a href="http://sourceforge.net/projects/tkdiff/">tkdiff</a></strong> written in <a href="http://en.wikipedia.org/wiki/Tcl">Tcl</a>/<a href="http://en.wikipedia.org/wiki/Tk_%28framework%29">Tk</a>. The is also <strong><a href="http://www.qtrac.eu/diffpdf.html">diffpdf</a></strong>, which is very convenient to compare two versions of a PDF file. There is another well known utility called <strong><a href="http://en.wikipedia.org/wiki/Patch_%28Unix%29">patch</a></strong>, a sort of reversed diff, used millions of times a day for software development, worldwide. On the technical side, diff relies on an <a href="http://en.wikipedia.org/wiki/Diff#Algorithm">algorithm</a> for solving the <strong><a href="http://en.wikipedia.org/wiki/Longest_common_subsequence_problem">longest common subsequence problem</a></strong>. By the way, if you are curious about text algorithms, you may take a look at the (free!) books written by <a href="http://igm.univ-mlv.fr/~mac/">Maxime Crochemore</a>,  from <a href="http://en.wikipedia.org/wiki/Marne-la-Vall%C3%A9e">Marne-la-Vallée</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://djalil.chafai.net/blog/2012/01/27/3929/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

