Processing math: 100%
Press "Enter" to skip to content

Spectrum of non-Hermitian heavy tailed random matrices

Spectrum and its module for a single realization of A:=X/n^(1/a) when the law F of Xij is the law of (Uniform^(-1/a)-1)*Rademacher, with n=5000 and a=1. Click to get the GNU Octave code.

I have recently uploaded the paper arXiv:1006.1713 [math.PR] entitled Spectrum of non-Hermitian heavy tailed random matrices, written in collaboration with Charles Bordenave and Pietro Caputo.

We provide a rigorous analysis of the phenomenon behind the pictures above. Let (Xjk)j,k1 be i.i.d. complex random variables with cumulative distribution function F. Our main result is a heavy tailed counterpart of Girko's circular law when F has an infinite second moment. Roughly, if F is in the attraction domain of an α-stable law, with 0<α<2, then we prove that there exist a deterministic sequence ann1/α and a probability measure μα on C depending only on α such that with probability one, the empirical distribution of the eigenvalues of the rescaled matrix (a1nXjk)1j,kn converges weakly to μα as n. Our approach combines Aldous & Steele's objective method with Girko's Hermitization using logarithmic potentials. The underlying limiting object is defined on a bipartized version of Aldous' Poisson Weighted Infinite Tree. Recursive relations on the tree provide some properties of μα. In contrast with the Hermitian and Hermitized cases, we find that μα is not heavy tailed.

Let us first recall that the eigenvalues of an n×n complex matrix M are the roots in C of its characteristic polynomial. We label them λ1(M),,λn(M) with non growing modules and growing phases. We also denote by s1(M)sn(M) the singular values of M, defined for every 1kn by sk(M):=λk(MM). We define the empirical spectral measure and the empirical singular values measure as

μM=1nnk=1δλk(M)and νM=1nnk=1δsk(M).

Let us define the n×n random matrix X=(Xij)1i,jn. Following Dozier and Silverstein, if F has finite positive variance σ2, then for every zC, there exists a probability measure Qσ,z on [0,) depending only on σ and z, with explicit Cauchy-Stieltjes transform, such that a.s. (almost surely)

ν1nXzInQσ,z     (1)

where denotes the weak convergence of probability measures. The proof of (1) is based on a classical approach for Hermitian random matrices with bounded second moment: truncation, centralization, recursion on the resolvent, and cubic equation for the limiting Cauchy-Stieltjes transform (fixed point characterization). In the special case z=0, the statement (1) reduces to the quartercircular law theorem (square version of the Marchenko-Pastur theorem (see e.g. Marchenko and Pastur, Wachter, Yin) and the probability measure Qσ,0 is the quartercircular law with Lebesgue density

x1πσ24σ2x21[0,2σ](x).     (2)

Girko's famous circular law theorem states under the same assumptions that a.s.

μ1nXnUσ     (3)

where Uσ is the uniform law on the disc {zC;|z|σ}. This statement was established through a long sequence of partial results Mehta, Girko, Silverstein, Edelman, Girko, Bai, Girko, Bai and Silverstein, Pan and Zhou, Götze and Tikhomirov, Tao and Vu, the general case (3) being finally obtained by Tao and Vu, by using Girko's Hermitization with logarithmic potentials and uniform integrability, the convergence (1), and polynomial bounds on the extremal singular values. The idea of using directly logarithmic potentials was already present in the work of Goldsheid and Khoruzhenko for non-Hermitian random tridiagonal matrices.

The aim of our work is to investigate what happens when F does not have a finite second moment. We shall consider the following hypothesis:

  • (H1) there exists a slowly varying function L (i.e. limtL(xt)/L(t)=1 for any x>0) and a real number α(0,2) such that for every t1

    P(|X11|t)={zC;|z|t}dF(z)=L(t)tα,

    and there exists a probability measure θ on the unit circle S1:={zC;|z|=1} of the complex plane such that for every Borel set DS1,

    limtP(X11|X11|D||X11|t)=θ(D).

Assumption (H1) states a complex version of the classical criterion for the domain of attraction of a real α-stable law, see e.g. theorem IX.8.1a in Feller's book. For instance, if X11=V1+iV2 with i=1 and where V1 and V2 are independent real random variables both belonging to the domain of attraction of an α-stable law then (H1) holds. When (H1) holds, we define the sequence

an:=inf{a>0 s.t. nP(|X11|a)1}

and (H1) implies that

limnnP(|X11|an)=limnnaαnL(an)=1.

It follows then classically that for every n1

an=n1/α(n)

for some slowly varying function . The additional possible assumptions on F to be considered in the sequel are the following:

  • (H2) P(|X11|t)tctα for some c>0 (this implies annc1/αn1/α)
  • (H3) X11 has a bounded probability Lebesgue density on R or on C.

One can check that (H1-H2-H3) hold e.g. when the module |X11| and the phase X11/|X11| are independent with |X11|=|S| where S is real symmetric α-stable and the phase follows a Dirac mass or an absolutely continuous law. Another basic example is given by X11=εW1/α with ε and W independent such that ε takes values in {1,1} and W is uniform on [0,1].

For every n1, let us define the i.i.d. n×n complex matrix A=An by

Aij:=a1nXij     (4)

for every 1i,jn. Our first result concerns the singular values of AzI, zC.

Theorem 1 (Singular values) If (H1) holds then for all zC, there exists a probability measure να,z on [0,) depending only on α and z such that a.s.

νAzInνα,z.

The case z=0 was already obtained by Belinschi, Dembo and Guionnet. Theorem 1 is a heavy tailed version of the Dozier and Silverstein theorem (1). Our main results below give a heavy tailed version of Girko's circular law theorem (3), as well as a non-Hermitian version of Wigner's theorem for Lévy matrices considered by Bouchaud and Cizeau, Ben Arous and Guionnet, Belinschi, Dembo and Guionnet, and arXiv:0903.3528 [math.PR].

Theorem 2 (Eigenvalues) If (H1-H2-H3) hold then there exists a probability measure μα on C depending only on α such that a.s.

μAnμα.

Theorem 3 (Limiting law) The probability distribution μα from theorem 2 is isotropic and has a continuous density. Its density at z=0 equals

Γ(1+2/α)2Γ(1+α/2)2/α2πΓ(1α/2)2/α.

Furthermore, up to a multiplicative constant, the density of μα is equivalent, as |z|, to

|z|2(α1)eα2|z|α.

Recall that for a normal matrix (i.e. which commutes with its adjoint), the module of the eigenvalues are equal to the singular values. Theorem 3 reveals a striking contrast between μα and να,0. The limiting law of the eigenvalues μα has a stretched exponential tail while the limiting law να,0 of the singular values is heavy tailed with power exponent α, see e.g. Belinschi, Dembo and Guionnet. This does not contradict the identity

nk=1|λk(A)|=nk=1sk(A)

and the Weyl inequalities but it does indicate that A is typically far from being a normal matrix. A similar shrinking phenomenon appears already in the finite second moment case (1-3): the law of the module under the circular law Uσ has density

r2σ2r1[0,σ](r)

in contrast with the density (2) of the quartercircular law Qσ,0 (even the supports differ by a factor 2).

The proof of theorem 1 relies on an extension to non-Hermitian matrices of the objective method'' approach developed in arXiv:0903.3528 [math.PR]. More precisely, we build an explicit operator on Aldous' Poisson Weighted Infinite Tree (PWIT) and prove that it is the local limit of the matrices An in an appropriate sense. While Poisson statistics arises naturally as in all heavy tailed phenomena, the fact that a tree structure appears in the limit is roughly explained by the observation that non vanishing entries of the rescaled matrix An=a1nX can be viewed as the adjacency matrix of a sparse random graph which locally looks like a tree. In particular, the convergence to PWIT is a weighted-graph version of familiar results on the local structure of Erdös-Rényi random graphs. The method relies on the local weak convergence, a notion introduced by Benjamini and Schramm, Aldous and Steele, see also Aldous and Lyons.

The proof of theorem 2 relies on Girko's Hermitization method with logarithmic potentials, on theorem 1, and on polynomial bounds on the extremal singular values needed to establish a uniform integrability property. This extends the Hermitization method to more general settings, by successfully mixing various arguments already developed in arXiv:0903.3528 [math.PR], arXiv:0808.1502 [math.PR], and by Tao and Vu. Following them, one of the key step will be a lower bound on the distance of a row of the matrix A to a subspace of dimension at most nn1γ, for some small γ>0. We also use a concentration for empirical spectral distributions in order to obtain the almost sure convergence in the Hermitization.

Girko's Hermitization method gives a characterization of μα in terms of its logarithmic potential. In our settings, however, this is not convenient to derive properties of the measure μα, and our proof of theorem 3 is based on an analysis of a self-adjoint operator on the PWIT and a recursive characterization of the spectral measure from the resolvent of this operator. We develop an efficient machinery to analyze the complex spectral measures which avoids a direct use of the logarithmic potential and the singular values. Our approach builds upon similar methods in the physics literature, e.g. Feinberg and Zee, Gudowska-Nowak, Nowak, and Pappe, and Rogers and Castillo. The Cauchy-Stieltjes transform is based on complex numbers and constitutes an efficient tool for the spectral analysis of Hermitian random matrices. Beyond Hermitian random matrices, a quaternionic version of it may be used. However, for non normal random matrices, this does not work. We propose a new approach based on bipartization, in which the quaternions are replaced by suitable 2×2 matrices.

The derivation of a Markovian version of theorems 1 and 2 is an interesting open problem that will be analyzed elsewhere, see arXiv:0903.3528 [math.PR] for the symmetric case and arXiv:0808.1502 [math.PR] for the light tailed non-symmetric case. It is also tempting to seek for an interpretation of να,z and μα in terms of a sort of graphical free probability theory. With a proper notion of trace, it is possible to define the spectral measure of an operator, see e.g. Brown, Haagerup and Schultz, Lyons, but we do not pursue this goal here. The study of localization of eigenvectors of heavy tailed random matrices is also an interesting problem.

    Leave a Reply

    Your email address will not be published.

    This site uses Akismet to reduce spam. Learn how your comment data is processed.

    Syntax · Style · .