No Arabic abstract
The spectra of empirical correlation matrices, constructed from multivariate data, are widely used in many areas of sciences, engineering and social sciences as a tool to understand the information contained in typically large datasets. In the last two decades, random matrix theory-based tools such as the nearest neighbour eigenvalue spacing and eigenvector distributions have been employed to extract the significant modes of variability present in such empirical correlations. In this work, we present an alternative analysis in terms of the recently introduced spacing ratios, which does not require the cumbersome unfolding process. It is shown that the higher order spacing ratio distributions for the Wishart ensemble of random matrices, characterized by the Dyson index $beta$, is related to the first order spacing ratio distribution with a modified value of co-dimension $beta$. This scaling is demonstrated for Wishart ensemble and also for the spectra of empirical correlation matrices drawn from the observed stock market and atmospheric pressure data. Using a combination of analytical and numerics, such scalings in spacing distributions are also discussed.
When dealing with non-stationary systems, for which many time series are available, it is common to divide time in epochs, i.e. smaller time intervals and deal with short time series in the hope to have some form of approximate stationarity on that time scale. We can then study time evolution by looking at properties as a function of the epochs. This leads to singular correlation matrices and thus poor statistics. In the present paper, we propose an ensemble technique to deal with a large set of short time series without any consideration of non-stationarity. We randomly select subsets of time series and thus create an ensemble of non-singular correlation matrices. As the selection possibilities are binomially large, we will obtain good statistics for eigenvalues of correlation matrices, which are typically not independent. Once we defined the ensemble, we analyze its behavior for constant and block-diagonal correlations and compare numerics with analytic results for the corresponding correlated Wishart ensembles. We discuss differences resulting from spurious correlations due to repeatitive use of time-series. The usefulness of this technique should extend beyond the stationary case if, on the time scale of the epochs, we have quasi-stationarity at least for most epochs.
We consider the empirical eigenvalue distribution of an $mtimes m$ principal submatrix of an $ntimes n$ random unitary matrix distributed according to Haar measure. For $n$ and $m$ large with $frac{m}{n}=alpha$, the empirical spectral measure is well-approximated by a deterministic measure $mu_alpha$ supported on the unit disc. In earlier work, we showed that for fixed $n$ and $m$, the bounded-Lipschitz distance between the empirical spectral measure and the corresponding $mu_alpha$ is typically of order $sqrt{frac{log(m)}{m}}$ or smaller. In this paper, we consider eigenvalues on a microscopic scale, proving concentration inequalities for the eigenvalue counting function and for individual bulk eigenvalues.
Nowadays, strict finite size effects must be taken into account in condensed matter problems when treated through models based on lattices or graphs. On the other hand, the cases of directed bonds or links are known as highly relevant, in topics ranging from ferroelectrics to quotation networks. Combining these two points leads to examine finite size random matrices. To obtain basic materials properties, the Green function associated to the matrix has to be calculated. In order to obtain the first finite size correction a perturbative scheme is hereby developed within the framework of the replica method. The averaged eigenvalue spectrum and the corresponding Green function of Wigner random sign real symmetric N x N matrices to order 1/N are in fine obtained analytically. Related simulation results are also presented. The comparison between the analytical formulae and finite size matrices numerical diagonalization results exhibits an excellent agreement, confirming the correctness of the first order finite size expression.
The structure of the turbulence-driven power fluctuations in a wind farm is fundamentally described from basic concepts. A derived tuning-free model, supported with experiments, reveals the underlying spectral content of the power fluctuations of a wind farm. It contains two power-law trends and oscillations in the relatively low- and high-frequency ranges. The former is mostly due to the turbulent interaction between the flow and the turbine properties; whereas the latter is due to the advection between turbine pairs. The spectral wind-farm scale power fluctuations $Phi_P$ exhibits a power-law decay proportional to $f^{-5/3-2}$ in the region corresponding to the turbulence inertial subrange and at relatively large scales, $Phi_Psim f^{-2}$. Due to the advection and turbulent diffusion of large-scale structures, a spectral oscillation exists with the product of a sinusoidal behavior and an exponential decay in the frequency domain.
We study the dependence of the spectral density of the covariance matrix ensemble on the power spectrum of the underlying multivariate signal. The white noise signal leads to the celebrated Marchenko-Pastur formula. We demonstrate results for some colored noise signals.