Libres pensées d'un mathématicien ordinaire

A few words about entropy

Djalil Chafaï — Sat, 06 Apr 2024 13:12:35 +0000

Nicolas Léonard Sadi Carnot (1796 – 1932) A romantic figure behind entropy

Why and how entropy emerges in basic mathematics? This tiny post aims to provide some answers. We have already tried something in this spirit in a previous post almost ten years ago.

Combinatorics. Asymptotic analysis of the multinomial coefficient $\binom{n}{n_1,\ldots,n_r}:=\frac{n!}{n_1!\cdots n_r!}$ :
\[
\frac{1}{n}\log\binom{n}{n_1,\ldots,n_r}
\xrightarrow[n=n_1+\cdots+n_r\to\infty]{\nu_i=\frac{n_i}{n}\to p_i}
\mathrm{S}(p):=-\sum_{i=1}^rp_i\log(p_i).
\] Recall that if $A=\{a_1,\ldots,a_r\}$ is a finite set of cardinal $r$ and $n=n_1+\cdots+n_r$ then
\[
\mathrm{Card}\Bigr\{(x_1,\ldots,x_n)\in A^n:\forall 1\leq i\leq r,\sum_{k=1}^n\mathbf{1}_{x_k=a_i}=n_i\Bigr\}=\binom{n}{n_1,\ldots,n_r}.
\] The multinomial coefficient can be interpreted as the number of microstates $(x_1,\ldots,x_n)$ compatible with the macrostate $(n_1,\ldots,n_r)$, while the quantity $\mathrm{S}(p)$ appears as a normalized asymptotic measure of additive degrees of freedom or disorder. This is already in the work of Ludwig Eduard Boltzmann (1844 — 1906) in kinetic gas theory at the origins of statistical physics. The quantity $\mathrm{S}(p)$ is also the one used by Claude Elwood Shannon (1916 — 2001) in information and communication theory as the average length of optimal lossless coding. It is characterized by the following three natural axioms or properties, denoting $\mathrm{S}^{(n)}$ to remember $n$ :

for all $n\geq1$, $p\mapsto\mathrm{S}^{(n)}(p)$ is continuous
for all $n\geq1$, $\mathrm{S}^{(n)}(\frac{1}{n},\ldots,\frac{1}{n})<\mathrm{S}^{(n+1)}(\frac{1}{n+1},\ldots,\frac{1}{n+1})$
for all $n=n_1+\cdots+n_r\geq1$, $\mathrm{S}^{(n)}(\frac{1}{n},\ldots,\frac{1}{n})=\mathrm{S}^{(r)}(\frac{n_1}{n},\ldots,\frac{n_r}{n})+\sum_{i=1}^r\frac{n_i}{n}\mathrm{S}^{(n_i)}(\frac{1}{n_i},\ldots,\frac{1}{n_i}).$

Probability. If $X_1,\ldots,X_n$ are independent and identically distributed random variables of law $\mu$ on a finite set or alphabet $A=\{a_1,\ldots,a_r\}$, then for all $x_1,\ldots,x_n\in A$,
\begin{align*}
\mathbb{P}((X_1,\ldots,X_n)=(x_1,\ldots,x_n))
&=\prod_{i=1}^r\mu_i^{\sum_{k=1}^n\mathbb{1}_{x_k=a_i}}
=\prod_{i=1}^r\mu_i^{n\nu_i}
=\mathrm{e}^{n\sum_{i=1}^n\nu_i\log\mu_i}\\
&=\mathrm{e}^{-n(\mathrm{S}(\nu)+\mathrm{H}(\nu\mid\mu))},
\end{align*} a remarkable identity where $\mathrm{S}(\nu)$ is the Boltzmann-Shannon entropy considered before, and where $\mathrm{H}(\nu\mid\mu)$ is a new quantity known as the Kullback-Leibler divergence or relative entropy :
\[
\mathrm{S}(\nu):=-\sum_{i=1}^r\nu_i\log\nu_i=-\int f(x)\log f(x)\mathrm{d}x
\] where $f$ is the density of $\nu$ with respect to the counting measure $\mathrm{d}x$, and
\[
\mathrm{H}(\nu\mid\mu):=\sum_{i=1}^r\nu_i\log\frac{\nu_i}{\mu_i}
=\sum_{i=1}^r\frac{\nu_i}{\mu_i}\log\frac{\nu_i}{\mu_i}\mu_i
=\int\frac{\mathrm{d}\nu}{\mathrm{d}\mu}\log\frac{\mathrm{d}\nu}{\mathrm{d}\mu}\mathrm{d}\mu.
\] This comes from information theory after Solomon Kullback (1907 – 1994) and Richard Leibler (1914 — 2003). Here $\mathrm{S}(\nu)$ measures the combinatorics on $x_1,\ldots,x_n$ at prescribed frequencies $\nu$, while $\mathrm{H}(\nu\mid\mu)$ measures the cost or energy of deviation from the actual distribution $\mu$. This is a Boltzmann–Gibbsfication of the probability $\mathbb{P}((X_1,\ldots,X_n)=(x_1,\ldots,x_n))$, see below, leading via the Laplace method to the large deviations principle of Ivan Nikolaevich Sanov (1919 — 1968). The Jensen inequality for the strictly convex function $u\mapsto u\log(u)$ gives
\[
\mathrm{H}(\nu\mid\mu)\geq0\quad\text{with equality iff}\quad\nu=\mu.
\]

Statistics. If $Y_1,\ldots,Y_n$ are independent and identically distributed random variables of law $\mu^{(\theta)}$ in parametric family parametrized by $\theta$, on a finite set $A$, then, following Ronald Aylmer Fisher (1890 – 1962), the likelihood of data $(x_1,\ldots,x_n)\in A^n$ is
\[
\ell_{x_1,\ldots,x_n}(\theta):=\mathbb{P}(Y_1=x_1,\ldots,Y_n=x_n)
=\prod_{i=1}^n\mu^{(\theta)}_{x_i}.
\] It can also be seen as the likelihood of $\theta$ with respect to $x_1,\ldots,x_n$. This dual point of view leads to the following : if $X_1,\ldots,X_n$ is an observed sample of $\mu^{(\theta_*)}$ with $\theta_*$ unknown then the maximum likelihood estimator of $\theta_*$ is
\[
\widehat{\theta}_n:=\arg\max_{\theta\in\Theta}\ell_{X_1,\ldots,X_n}(\theta)
=\arg\max_{\theta\in\Theta}\Bigr(\frac{1}{n}\log\ell_{X_1,\ldots,X_n}(\theta)\Bigr).
\] The asymptotic analysis via the law of large numbers reveals entropy as asymptotic contrast
\begin{align*}
\frac{1}{n}\log\ell_{X_1,\ldots,X_n}(\theta)
&=\frac{1}{n}\sum_{i=1}^n\log\mu^{(\theta)}_{X_i}\\&\xrightarrow[n\to\infty]{\mathrm{a.s.}}
\sum_{k=1}^r\mu^{(\theta_*)}_k\log\mu^{(\theta)}_k
=\underbrace{-\mathrm{S}(\mu^{(\theta_*)})}_{\text{const}}-\mathrm{H}(\mu^{(\theta_*)}\mid\mu^{(\theta)}).
\end{align*}

Analysis. The entropy appears naturally as a derivative of the $L^p$ norm of $f\geq0$ as follows:
\[
\partial_p\|f\|_p^p
=\partial_p\int f^p\mathrm{d}\mu
=\partial_p\int \mathrm{e}^{-p\log(f)}\mathrm{d}\mu
=\int f^p\log(f)\mathrm{d}\mu
=\frac{1}{p}\int f^p\log(f^p)\mathrm{d}\mu.
\] This is at the heart of the Leonard Gross (1931 — ) theorem relating the hypercontractivity of Markov semigroups with the logarithmic Sobolev inequality for the invariant measure. This can also be used to extract from the William Henry Young (1843 – 1942) convolution inequalities certain entropic uncertainty principles.

Boltzmann-Gibbs measures, variational characterizations, and Helmholtz free energy. We take $V:A\to\mathbb{R}$, interpreted as an energy. Maximizing $\mu\mapsto\mathrm{S}(\mu)$ over the constraint of average energy $\int V\mathrm{d}\mu=v$ gives the maximizer \[
\mu_\beta
:=\frac{1}{Z_\beta}\mathrm{e}^{-\beta V}\mathrm{d}x
\quad\text{where}\quad
Z_\beta:=\int\mathrm{e}^{-\beta V}\mathrm{d}x.
\] We use integrals instead of sums to lightnen notation. The notation $\mathrm{d}x$ stands for the counting measure on $A$. The parameter $\beta>0$, interpreted as inverse temperature, is dictated by $v$. Such a probability distribution $\mu_\beta$ is known as a Boltzmann-Gibbs distribution, after Ludwig Eduard Boltzmann (1844 – 1906) and Josiah Willard Gibbs (1839 – 1903). We have a variational characterization as a maximum entropy at fixed average energy :
\[
\int V\mathrm{d}\mu=\int V\mathrm{d}\mu_\beta
\quad\Rightarrow\quad
\mathrm{S}(\mu_\beta)-\mathrm{S}(\mu)
=\mathrm{H}(\mu\mid\mu_\beta).
\] There is a dual point of view in which instead of fixing the average energy, we fix the inverse temperature $\beta$ and we introduce the Hermann von Helmholtz (1821 – 1894) free energy
\[
\mathrm{F}(\mu):=\int V\mathrm{d}\mu-\frac{1}{\beta}\mathrm{S}(\mu)
\] This can be seen as a Joseph-Louis Lagrange (1736 – 1813) point of view in which the constraint is added to the functional. We have
\[
\mathrm{F}(\mu_\beta)=-\frac{1}{\beta}\log(Z_\beta)
\quad\text{since}\quad
\mathrm{S}(\mu_\beta)=\beta\int V\mathrm{d}\mu_\beta+\log Z_\beta.
\] We have then a new variational characterization as a minimum free energy at fixed temperature :
\[
\mathrm{F}(\mu)-\mathrm{F}(\mu_\beta)=\frac{1}{\beta}\mathrm{H}(\mu\mid\mu_\beta).
\] This explains why $\mathrm{H}$ is often called free energy.

Legrendre transform. The relative entropy $\nu\mapsto\mathrm{H}(\nu\mid\mu)$ is the Legendre transform of the log-Laplace transform, in the sense that
\[
\sup_g\Bigr\{\int g\mathrm{d}\nu-\log\int\mathrm{e}^g\mathrm{d}\mu\Bigr\}=\mathrm{H}(\nu\mid\mu).
\] Indeed, for all $h$ such that $\int\mathrm{e}^h\mathrm{d}\mu=1$, by the Jensen inequality, with $f:=\frac{\mathrm{d}\nu}{\mathrm{d}\mu}$,
\begin{align*}
\int h\mathrm{d}\nu
&=\int f\log(f)\mathrm{d}\mu+\int\log\frac{\mathrm{e}^h}{f}f\mathrm{d}\mu\\
&\leq\int f\log(f)\mathrm{d}\mu+\log\int\mathrm{e}^h\mathrm{d}\mu
=\int f\log(f)\mathrm{d}\mu=\mathrm{H}(\nu\mid\mu),
\end{align*} and equality is achieved for $h=\log f$. It remains to reparametrize with $h=g-\log\int\mathrm{e}^g\mathrm{d}\mu$. Conversely, the Legendre transform of the relative entropy is the log-Laplace transform :
\[
\sup_{\nu}
\Bigr\{\int g\mathrm{d}\nu-\mathrm{H}(\nu\mid\mu)\Bigr\}=\log\int\mathrm{e}^g\mathrm{d}\mu.
\] This is an instance of the convex duality for the convex functional $\nu\mapsto\mathrm{H}(\nu\mid\mu)$.

Same story for $-\mathrm{S}$ which is convex as a function of the Lebesgue density of its argument.

Heat equation. The heat equation $\partial_tf_t=\Delta f_t$ is the gradient flow of entropy :$$\partial_t\int f_t\log(f_t)\mathrm{d}x=-\int\frac{\|\nabla f_t\|^2}{f_t}\mathrm{d}x$$where we used integration by parts, the right hand side is the Fisher information. In other words, the entropy is a Lyapunov function for the heat equation seen as an infinite dimensional ODE.

Further reading.

Boltzmann-Gibbs entropic variational principle
On this blog (2022)
Entropy ubiquity
On this blog (2015)
Bosons and fermions
On this blog (2012)

Libres pensées de l’équinoxe

Djalil Chafaï — Thu, 21 Mar 2024 20:21:05 +0000

Ludvik Glazer-Naudé – Der Rollenspieler – Peinture acrylique sur bois

En France comme dans bien d’autres pays développés, les temples de l’élitisme sont avant tout ceux de l’auto-reproduction de dominants socio-culturels. Ceci explique peut-être l’effacement relatif de la question sociale dans certains de ces établissements, au profit de questions sociétales qui préoccupent les dominants et leur progéniture. Il faut dire que ces bourgeois, petits ou grands, bohèmes ou pas, d’extrême gauche ou pas, font des enfants, mais pas des enfants d’ouvriers, et sont pratiquement les seuls à pouvoir optimiser le parcours scolaire.

L’establishment, travaillé par ses propres convictions et une certaine militance parmi ses gouvernés, peut même aller jusqu’à tenter d’imposer à tous un point de vue dogmatique bien-pensant sur des sujets de société ou d’actualité. C’est qu’il faut faire en sorte que tout le monde réfléchisse correctement, éclairer les déviants, intimider les dissidents. Totalitarisme d’opérette ? Maccarthysme, inquisition, chasse aux sorcières qui ne disent pas leur nom ? Ces termes sont excessifs ? Nombreux sont ceux qui haussent les épaules, courbent l’échine, préfèrent se taire et attendre des jours meilleurs. Après tout, staliniens et autres maoïstes n’ont fait que passer.

L’Histoire suggère que la vérité tient plus d’une quête permanente que d’un aboutissement définitif. Il va sans dire que toutes les certitudes sont revisitées ou revisitables, mais toutes ne sont pas à mettre sur le même plan, certaines sont plus étayées que d’autres. Et il ne suffit pas de se nourrir de déconstruction ou de confiance pour avoir raison. La subversion et la nouveauté, pas plus que le conformisme et la tradition, ne garantissent justesse et pertinence, même s’ils exercent un fort pouvoir de séduction sur les esprits. Curieusement, les dominants socio-culturels, héritiers et pratiquants de la liberté de pensée et d’expression, sont parfois les premiers à vouloir la contrôler, au nom d’une orthodoxie morale de nature religieuse, convaincus de détenir la vérité, et d’avoir le devoir de faire taire ceux qui pensent différemment. L’Histoire nous enseigne qu’un totalitarisme peut se bâtir sur une absence de doute et d’esprit critique parmi les puissants, une médiocrité intellectuelle vécue comme juste, visionnaire, et à l’avant-garde.

Archimedes theorem on sphere and cylinder

Djalil Chafaï — Mon, 18 Mar 2024 17:22:01 +0000

The Fields Medal and its portrait of Archimedes.

Archimedes theorem. Archimedes (Ἀρχιμήδης) of Syracuse (-287 – -212) is one of the greatest minds of all times. One of his discoveries is as follows : if we place a sphere in the tightest cylinder, then the surface of the sphere and of the cylinder are the same, and more generally, if we cut the whole in two pieces by any perpendicular horizontal plane, then this remains for each pieces : the surface of the spherical cap is equal to the surface of the corresponding slice of cylinder. Archimedes was so proud of it that he put the picture of it on his tombstone. This allowed his admirer Marcus Tullius Cicero (-106 – -43) to identify the tomb, in -75, almost 150 years after the murder of Archimedes by a Roman soldier during the siege of Syracuse.

The Archimedes theorem allows to recover the formula for the surface of the sphere : if the sphere has radius $r$, then the surface of the cylinder is $2\pi r\times 2r=4\pi r^2$. The cutting plane part of the Archimedes theorem says that the uniform distribution on the sphere, when projected on the vertical diameter, gives the uniform distribution on the diameter. Archimedes used antic geometrical methods. Nowadays, with the development of modern analytic and probabilistic tools, we can prove easily an extension of his theorem to arbitrary dimension. More precisely, let $(X_1,\ldots,X_n)$ be a random vector of $\mathbb{R}^n$, $n\geq3$, uniformly distributed on the sphere $\mathbb{S}^{n-1}$. Then its projection $(X_1,\ldots,X_{n-2})$ on $\mathbb{R}^{n-2}$ is uniform on the unit ball of $\mathbb{R}^{n-2}$. Indeed, since $$(X_1,\ldots,X_n)\overset{\mathrm{d}}{=}\frac{Z}{\|Z\|}=\frac{(Z_1,\ldots,Z_n)}{\sqrt{Z_1^2+\cdots+Z_n^2}}$$where $Z=(Z_1,\ldots,Z_n)\sim\mathcal{N}(0,I_n)$, we get, for all $1\leq k\leq n$,
$$\begin{align*}\|(X_1,\ldots,X_k)\|^2
&=X_1^2+\cdots+X_k^2\\
&\overset{\mathrm{d}}{=}
\frac{Z_1^2+\cdots+Z_k^2}{Z_1^2+\cdots+Z_k^2+Z_{k+1}^2+\cdots+Z_n^2}\\
&=\frac{A}{A+B}\\
&\sim\mathrm{Beta}\Bigr(\frac{k}{2},\frac{n-k}{2}\Bigr),\end{align*}$$ where the last step comes from the fact that $\frac{A}{A+B}\sim\mathrm{Beta}(\frac{a}{2},\frac{b}{2})$ when $$A\sim\chi^2(a)=\Gamma\Bigr(\frac{a}{2},\frac{1}{2}\Bigr)\quad\text{and}\quad B\sim\chi^2(b)=\Gamma\Bigr(\frac{b}{2},\frac{1}{2}\Bigr)\quad\text{are independent}.$$Recall that the law $\mathrm{Beta}(\alpha,\beta)$ has density proportional to $t\in[0,1]\mapsto t^{\alpha-1}(1-t)^{\beta-1}$, which is a power when $\beta=1$. In particular, in the special case where $k=n-2$, we find $$\mathrm{Beta}\Bigr(\frac{k}{2},\frac{n-k}{2}\Bigr)=\mathrm{Beta}\Bigr(\frac{n}{2}-1,1\Bigr),$$and this law has a power density, proportional to $t\in[0,1]\mapsto t^{n/2-2}$. Now, a rotationally invariant random vector $X’$ of $\mathbb{R}^{n-2}$ is uniformly distributed on the unit ball of $\mathbb{R}^{n-2}$ iff $\|X’\|$ has density proportional to $r\in[0,1]\mapsto r^{n-3}$, in other words $\|X’\|^2$ has density proportional to $t\in[0,1]\mapsto\sqrt{t}^{n-3}/\sqrt{t}=t^{n/2-2}$, which matches the Beta law above.

Reverse Archimedes theorem. It states that if $Z_1,\ldots,Z_n$ are i.i.d. $\mathcal{N}(0,1)$, $n\geq1$, and if $E$ is exponential of unit mean independent of $Z_1,\ldots,Z_n$, then the random vector $$
\frac{(Z_1,\ldots,Z_n)}{\sqrt{Z_1^2+\cdots+Z_n^2+2E}}
$$ is uniformly distributed on the unit ball of $\mathbb{R}^n$. To see it, it suffices to use an extended Gaussian sequence $(Z_1,\ldots,Z_{n+2})\sim\mathcal{N}(0,I_{n+2})$, the Archimedes principle, and the observation that $$Z_{n+1}^2+Z_{n+2}^2\sim\chi^2(2)=\Gamma\Bigr(\frac{2}{2},\frac{1}{2}\Bigr)=\mathrm{Exp}\Bigr(\frac{1}{2}\Bigr)\sim 2E.$$ The reverse Archimedes principle reveals that the uniform law on the ball concentrates, in high dimension, around the extremal sphere at its edge. Indeed, by the law of large numbers,$$\frac{\sqrt{Z_1^2+\cdots+Z_n^2}}{\sqrt{Z_1^2+\cdots+Z_n^2+2E}}=\frac{1}{\sqrt{1+O\bigr(\frac{1}{n}\bigr)}}\underset{n\to\infty}{\longrightarrow}1\quad\text{almost surely}.$$This is an instance of the thin-shell phenomenon for convex bodies. From this point of view, in high dimension $n$, the sphere of radius $\sqrt{n}$ behaves approximately as an isotropic convex body.

Note. The Archimedes theorem on the sphere and the cylinder is sometimes referred to as the Archimedes principle. But this last term is more classically used for the fact that a body at rest in a fluid is acted upon by a force pushing upward called the buoyant force, equal to the weight of the fluid that the body displaces, related to the famous Eurêka! These are distinct discoveries.

Opposite side of the Medal, with Archimedes’ tomb depicting his theorem on the sphere and the cylinder.

Further reading.

Archimedes of Syracuse
On the Sphere and Cylinder
Two volumes (-225)
Bernard Beauzamy
Archimedes’ Modern Works
Real Life Mathematics, Société de Calcul Mathématique (2012)
Gérard Letac
From Archimedes to statistics: the area of the sphere
Jim Pitman and Nathan Ross
Archimedes, Gauss, and Stein
Notices American Mathematical Society 59(10) 1416-1421 (2012)
Author of this blog
Phénomènes de grande dimension
Notes de cours – École normale supérieure (2024)
Author of this blog
The Funk-Hecke formula
Libres pensées d’un mathématicien ordinaire (2021)
Author of this blog
Central limit theorem for convex bodies
Libres pensées d’un mathématicien ordinaire (2011)

Archimedes death, by Édouard Vimont (1846 – 1930). A sphere in a cylinder is drawn on the wall behind him.

Modern times. The reasoning above involving a Beta law is actually a very special aspect of a more general and deeper probabilistic structure involving the Dirichlet law, a generalizatoin of the Euler Beta law, defined using the Euler Gamma law. More precisely, for real parameters $a_1>0,\ldots,a_n>0$, the law $\mathrm{Dirichlet}(a_1,\ldots,a_n)$ is the law of the random vector
$$(D_1,\ldots,D_n):=\frac{(G_1,\ldots,G_n)}{G_1+\cdots+G_n}$$of $\mathbb{R}^n$, where $G_1,\ldots,G_n$ are independent with $G_i\sim\mathrm{Gamma}(a_i,\lambda)$, for an arbitrary scaling parameter $\lambda>0$. We can safely take without loss of generality $\lambda=1$. The support of this law is the simplex of discrete probability distributions with $n$ atoms $$\Delta_n:=\{(p_1,\ldots,p_n):p_1\geq0,\ldots,p_n\geq0,p_1+\cdots+p_n=1\}\subset[0,1]^n.$$ Its density is
$$
(x_1,\ldots,x_{n-1})\mapsto
\frac{\Gamma(a_1+\cdots+a_n)}{\Gamma(a_1)\cdots\Gamma(a_n)}
\prod_{i=1}^{n-1}x_i^{a_i-1}\Bigr(1-\sum_{i=1}^{n-1}x_i\Bigr)^{a_n-1}
\mathbf{1}_{(x_1,\ldots,x_{n})\in\Delta_n}.
$$ Moreover, for all $1\leq i\leq n$, the $i^{\mathrm{th}}$ component follows a Beta law :
$$\frac{G_i}{G_1+\cdots+G_n}\sim\mathrm{Beta}(a_i,a_1+\cdots+a_n-a_i).$$ The Dirichlet law structure is stable by summation by blocks. More precisely, for any partitition $I_1,\ldots,I_k$ of the finite set $\{1,\ldots,n\}$ into non-empty subsets or blocks, we have
$$
\Bigr(\sum_{i\in I_1}D_i,\ldots,\sum_{i\in I_k}D_i\Bigr)
\sim\mathrm{Dirichlet}\Bigr(\sum_{i\in I_1}a_i,\ldots,\sum_{i\in I_k}a_i\Bigr),$$ and for any non-empty subset $I$ of $\{1,\ldots,n\}$, $$\sum_{i\in I}D_i\sim\mathrm{Beta}\Bigr(\sum_{i\in I}a_i,\sum_{i\not\in I}a_i\Bigr).$$The Dirichlet law plays an important role for spatial points processes, stochastic simulation, as well as in Statistics due to their Bayesian duality with multinomial laws. If $U_1,\ldots,U_n$ are independent uniform random variables on $[0,1]$, and if $U_{(0)}\leq\cdots\leq U_{(n)}$ is their non-decreasing reordering, known as the order statistics, then, denoting $U_{(0)}:=0$ and $U_{(n+1)}:=1$, $$(U_{(1)}-U_{(0)},\ldots,U_{(n+1)}-U_{(n)})\overset{\mathrm{d}}{=}\frac{(E_1,\ldots,E_{n+1})}{E_1+\cdots+E_{n+1}}\sim\mathrm{Dirichlet}(1,\ldots,1),$$ where $E_1,\ldots,E_{n+1}$ are independent and identically distributed exponential random variables (with arbitrary parameter). This fact is at the heart of Poisson point processes.

Another important side of what we used is the link between chi-square laws and Gamma laws. In terms of probabilistic culture and structure, there are two basic facts. The first one is $$\chi^2(1):=\Bigr(\mathcal{N}(0,1)\Bigr)^2=\mathrm{Gamma}\Bigr(\frac{1}{2},\frac{1}{2}\Bigr)$$ while the second one is the additivity related to the Gamma shape parameter $$\mathrm{Gamma}(a,\lambda)*\mathrm{Gamma}(b,\lambda)=\mathrm{Gamma}(a+b,\lambda).$$ This gives the important fact $$\chi^2(n)=(\chi^2(1))^{*n}=\Gamma\Bigr(\frac{n}{2},\frac{1}{2}\Bigr).$$ In particular the heart of the Box-Muller simulation algorithm involves the special case $$\chi^2(2)=\mathrm{Gamma}\Bigr(1,\frac{1}{2}\Bigr)=\mathrm{Expo}\Bigr(\frac{1}{2}\Bigr)=-2\log(\mathrm{Uniform}[0,1]).$$

Algériennes – جزائريات

Djalil Chafaï — Fri, 08 Mar 2024 07:08:11 +0000

Cheikha Remitti (1923 – 2006)

Cheb Khaled (1960 – ) الشاب خالد est peut-être le chanteur de raï le plus connu. Son album le plus réussi est sans doute Kutché (1988), en collaboration avec Safy Boutella (1950 – ) صافي بوتلة. La plupart des chanteurs de raï de cette génération, et en particulier Khaled lui-même, notamment dans Kutché, ont été influencés par Cheikha Remitti (1923 – 2006) شيخة ريميتي. Connaissez-vous cette grand-mère et reine du raï ? On trouve chez elle la même puissance existentielle que dans le Delta blues du Mississippi. En voici une version modernisée tardive :

La plus aristocratique Taos Amrouche (1913 – 1976) طاووس عمروش est pratiquement de la même génération. Connaissez-vous sa mère, Fadhma Aït Mansour Amrouche (1882 – 1967) فاطمة آيت منصور عمروش ? Son autobiographie intitulée Histoire de ma vie, parue en 1968 après sa mort, vaut vraiment la peine d’être lue.

Fadhma Aït Mansour Amrouche (1882 – 1967)

Voici enfin une algérienne contemporaine du hirak : Raja Meziane (1988 – ) رجاء مزيان