
This post is about the work arXiv:2012.05602 in collaboration with Charles Bordenave and David García-Zelada on the spectral radius. Clearly this is my favorite work so far!
Consider a square random matrix with independent and identically distributed entries of mean zero and unit variance. We show that as the dimension tends to infinity, the spectral radius is equivalent to the square root of the dimension in probability. This result can also be seen as the convergence of the support in the circular law theorem under optimal moment conditions. In the proof we establish the convergence in law of the reciprocal characteristic polynomial to a random analytic function outside the unit disc, related to a hyperbolic Gaussian analytic function. The proof is short and differs from the usual approaches for the spectral radius. It relies on a tightness argument and a joint central limit phenomenon for traces of fixed powers.
Model. Let {ajk}j,k≥1 be independent and identically distributed complex random variables with mean zero and unit variance, namely E[a11]=0andE[|a11|2]=1. For all n≥1, let us consider the Girko random matrix An=(ajk)1≤j,k≤n. When a11 is Gaussian with independent and identically distributed real and imaginary parts then An belongs to the complex Ginibre ensemble. We are interested in the matrix 1√nAn.By the law of large numbers, almost surely, as n→∞, the rows (and the columns) have unit Euclidean norm and are orthogonal. Its characteristic polynomial at point z∈C is pn(z)=det(z−An√n) where z stands for z times identity matrix. The n roots of pn in C are the eigenvalues of 1√nAn. They form a multiset Λn which is the spectrum of 1√nAn. The spectral radius is ρn=maxλ∈Λn|λ|. Following Ginibre, Girko, Bai, ..., and finally Tao and Vu, the circular law (of nature) states that the empirical measure of the elements of Λn tends weakly as n→∞ to the uniform distribution on the closed unit disc: almost surely, for every nice Borel set B⊂C, limn→∞card(B∩Λn)n=area(B∩¯D)π, where area'' stands for the Lebesgue measure on C, and where ¯D={z∈C:|z|≤1} is the closed unit disc. The circular law, which involves weak convergence, does not provide the convergence of the spectral radius, it gives only lim_n→∞ρn≥1.
Our result on the spectral radius. We have ρnP⟶n→∞1, in the sense that for all ε>0, limn→∞P(|ρn−1|≥ε)=0. The moments assumptions of zero mean and unit variance are optimal. Moreover the 1√n scaling is no longer adequate for entries of infinite variance.
We have ρn≤σn where σn is the operator norm of 1√nAn (largest singular value). Bai and al have proved that the condition E[|a11|4]<∞ is necessary and sufficient for the convergence of σn as n→∞ and that the limit is then 2. A striking aspect of the spectral radius, in comparison with the operator norm, is that it converges without such an extra condition.
Proof. It does not involve any Hermitization or norms of powers in the spirit of Gelfand's spectral radius formula. The idea is to show that on C∪{∞}∖¯D, the function z↦z−npn(z) tends as n→∞ to a random analytic function which does not vanish. The first step for mathematical convenience is to convert C∪{∞}∖¯D into D={z∈C:|z|<1} by noting that pn(z)=znqn(1/z), z∉¯D, where for all z∈D,
qn(z)=det(1−zAn√n)is the reciprocal polynomial of the characteristic polynomial pn. Let H(D) be the set of holomorphic or complex analytic functions on D, equipped with the topology of uniform convergence on compact subsets, the compact-open topology, studied by Shirai. This allows to see qn as a random variable on H(D) and gives a meaning to convergence in law of qn as n→∞, namely, qn converges in law to some random element q of H(D) if for every bounded real continuous function f on H(D), E[f(qn)]→E[f(q)]. Namely, we prove that qnlaw→n→∞κe−F, where F is the random holomorphic function on D defined by F(z)=∞∑k=1Xkzk√k where {Xk}k≥1 are independent complex Gaussian random variables such that
E[Xk]=0,E[|Xk|2]=1andE[X2k]=E[a211]k, and where κ:D→C is the holomorphic function defined for all z∈D by κ(z)=√1−z2E[a211]. The square root defining κ is the one such that κ(0)=1. Notice that it is a well-defined holomorphic function on the simply connected domain D since the function z↦1−z2E[a211] does not vanish on D, indeed |E[a211]|≤E[|a11|2]=1.
The proof of the convergence of qn is partially inspired by a work due to Basak and Zeitouni and relies crucially on a joint combinatorial central limit theorem for traces of fixed powers inspired from a work by Janson and Nowicki. Unlike previous arguments used in the literature for the analysis of Girko matrices, the approach does not rely on Girko Hermitization, Gelfand spectral radius formula, high order traces, resolvent method or Cauchy-Stieltjes transform. The first step consists in showing the tightness of (qn)n≥1, by using a decomposition of the determinant into orthogonal elements related to determinants of submatrices, as in the work of Basak and Zeitouni:det(1−zAn√n)=1+n∑k=1(−z)n∑I⊂{1,…,n}|I|=kdet((An)I,I)√nk.Knowing this tightness, the problem is reduced to show the convergence in law of these elements. A reduction step, inspired by the work of Janson and Nowicki, consists in truncating the entries, reducing the analysis to the case of bounded entries. The next step consists in a central limit theorem for product of traces of powers of fixed orderdet(1−zAn√n)=exp(−∞∑k=1Trace(Akn)√nkzkk).It is important to note that we truncate with a fixed threshold with respect to n, and the order of the powers in the traces are fixed with respect to n. This is in sharp contrast with the usual Füredi-Komlós truncation-trace approach related to the Gelfand spectral radius formula used in all the previous approaches for the spectral radius.
Moment assumptions. The universality for the first order global asymptotics stated by the circular law depends only on the trace E[|a11|2] of the covariance matrix of ℜa11 and ℑa11. The universality stated by the convergence of qn, just like for the central limit theorem, depends on the whole covariance matrix. Since
E[a211]=E[(ℜa11)2]−E[(ℑa11)2]+2iE[ℜa11ℑa11], we can see that E[a211]=0 if and only if E[(ℜa11)2]=E[(ℑa11)2]andE[ℜa11ℑa11]=0. Moreover, we cannot in general get rid of E[a211] by simply multiplying An by a phase.
Hyperbolic Gaussian analytic function. When E[a211]=0 then κ=1 while the random analytic function F which appears in the limit is a degenerate case of the well-known hyperbolic Gaussian Analytic Functions (GAFs). It can also be obtained as the antiderivative of the L=2 hyperbolic GAF which is 0 at z=0. This L=2 hyperbolic GAF is related to the Bergman kernel and could be called the Bergman GAF. These GAFs appear also at various places in mathematics and physics and, in particular, in the asymptotic analysis of Haar unitary matrices.
Cauchy-Stieltjes transform. If E[a211]=0 then by returning to pn, taking the logarithm and the derivative with respect to z in the convergence of qn, we obtain the convergence in law of the Cauchy-Stieltjes transform (complex conjugate of the electric field) minus n/z towards z↦F′(1/z)/z2 which is a Gaussian analytic function on C∖¯D with covariance given by a Bergman kernel.
Central Limit Theorem. The convergence of qn can be seen as a central limit theorem for the log-determinant (global second order analysis). Namely for all z∈D, we have |qn(1/z)|=exp[−n(Un(z)−U(z))] where Un(z)=−1nlog|pn(z)|andU(z)=−log|z| are the logarithmic potential at the point z of the empirical spectral distribution of 1√nAn and of the uniform distribution on the unit disc D.
Moreover, it is possible to extract from the convergence of qn a CLT for linear spectral statistics with respect to analytic functions in a neighborhood of ¯D. This can be done by using the Cauchy formula for an analytic function f,
∫f(λ)μ(dλ)=12πi∫(∮f(z)z−λdz)μ(dλ)=12πi∮f(z)(∫μ(dλ)z−λ)dz=12πi∮f(z)(logdet(z−A))′dzwhere μ is the counting measure of the eigenvalues of A, where the contour integral is around a centered circle of radius strictly larger than 1, and where we have taken any branch of the logarithm. The approach is purely complex analytic. In particular, it is different from the usual approach with the logarithmic potential of μ based on the real function given by z↦∫log|z−λ|μ(dλ)=log|det(z−A)|.
Wigner case and elliptic interpolation. The finite second moment assumption is optimal. We could explore its relation with the finite fourth moment assumption for the convergence of the spectral edge of Wigner random matrices, which is also optimal. Heuristic arguments tell us that the interpolating condition on the matrix entries is E[|ajk|2|akj|2]<∞,j≠k, which is a finite second moment condition for Girko matrices and a finite fourth moment condition for Wigner matrices. This is work in progress.
Coupling and almost sure convergence. For simplicity, we define our random matrix An for all n≥1 by truncating from the upper left corner the infinite random matrix {ajk}j,k≥1. This imposes a coupling for the matrices {An}n≥1. However, since our result of the spectral radius involves a convergence in probability, it remains valid for an arbitrary coupling, in the spirit of the triangular arrays assumptions used for classical CLTs. In another direction, one could ask about the upgrade to almost sure convergence. This is an open problem.
Heavy tails. An analogue of the circular law in the heavy-tailed case E[|a11|2]=∞ is already available but requires another scaling than 1√n. The spectral radius of this model tends to infinity as n→∞ but it could be possible to analyze the limiting point process at the edge as n→∞ and its universality. This is an open problem.
Julia code. Here is a Julia code illustrating the high dimensional phenomenon.
using LinearAlgebra, Plots # for eigvals(), scatter(), heatmap() function phaseportrait(n) M = (randn(n,n)+im*randn(n,n))/sqrt(2*n) Spec = eigvals(M) scatter(real(Spec),imag(Spec),aspect_ratio=:equal,legend=false) r = 1000 c = max.(abs.(Spec)) c = c + c/2 M = zeros(r,r) for i in 1:r for j in 1:r z = (-c + 2*c*i/r) + im * (-c + 2*c*j/r) M[i,j] = angle(prod(z.*Spec.-1)) # reciprocal charpoly phase end end heatmap(M,aspect_ratio=:equal,legend=false,background_color=:transparent,c=:hsv) end # function phaseportrait(250)
Discrete analogue. The method can be adapted to sparse Boolean matrices, replacing the Gaussian limiting regime by a Poisson limiting regime, in relation to important aspects of the high dimensional analysis of random graphs. This was done by Simon Coste.
Further reading.
- Charles Bordenave, Pietro Caputo, Djalil Chafai, Konstantin Tikhomirov
On the spectral radius of a random matrix: an upper bound without fourth moment
The Annals of Probability 46(4) 2268-2286 (2018)
arXiv:1607.05484 - Charles Bordenave, Djalil Chafaï, and David García-Zelada
Convergence of spectral radius of random matrix through characteristic polynomial
Probability Theory and Related Fields (to appear 2021)
arXiv:2012.05602 - Svante Janson and Krzysztof Nowicki
Asymptotic distributions of generalized U-statistics - Applications to random graphs
Probability Theory Related Fields 90(3) 341-375 (1991) - Anirban Basak and Ofer Zeitouni
Outliers of random perturbations of Toeplitz matrices with finite symbols
Probability Theory Related Fields 178(3-4) 771-826 (2020)
arXiv:1905.10244 - Zhi-Dong Bai and Yong-Quan Yin
Limit of norm of products of random matrices and two problems of Geman-Hwang
Probability Theory and Related Fields 73(4) 555-569 (1986) - Simon Coste
Sparse matrices: convergence of the characteristic polynomial seen from infinity
arXiv:2106.00593
