No Arabic abstract
When dealing with non-stationary systems, for which many time series are available, it is common to divide time in epochs, i.e. smaller time intervals and deal with short time series in the hope to have some form of approximate stationarity on that time scale. We can then study time evolution by looking at properties as a function of the epochs. This leads to singular correlation matrices and thus poor statistics. In the present paper, we propose an ensemble technique to deal with a large set of short time series without any consideration of non-stationarity. We randomly select subsets of time series and thus create an ensemble of non-singular correlation matrices. As the selection possibilities are binomially large, we will obtain good statistics for eigenvalues of correlation matrices, which are typically not independent. Once we defined the ensemble, we analyze its behavior for constant and block-diagonal correlations and compare numerics with analytic results for the corresponding correlated Wishart ensembles. We discuss differences resulting from spurious correlations due to repeatitive use of time-series. The usefulness of this technique should extend beyond the stationary case if, on the time scale of the epochs, we have quasi-stationarity at least for most epochs.
The spectra of empirical correlation matrices, constructed from multivariate data, are widely used in many areas of sciences, engineering and social sciences as a tool to understand the information contained in typically large datasets. In the last two decades, random matrix theory-based tools such as the nearest neighbour eigenvalue spacing and eigenvector distributions have been employed to extract the significant modes of variability present in such empirical correlations. In this work, we present an alternative analysis in terms of the recently introduced spacing ratios, which does not require the cumbersome unfolding process. It is shown that the higher order spacing ratio distributions for the Wishart ensemble of random matrices, characterized by the Dyson index $beta$, is related to the first order spacing ratio distribution with a modified value of co-dimension $beta$. This scaling is demonstrated for Wishart ensemble and also for the spectra of empirical correlation matrices drawn from the observed stock market and atmospheric pressure data. Using a combination of analytical and numerics, such scalings in spacing distributions are also discussed.
The performance of the multifractal detrended analysis on short time series is evaluated for synthetic samples of several mono- and multifractal models. The reconstruction of the generalized Hurst exponents is used to determine the range of applicability of the method and the precision of its results as a function of the decreasing length of the series. As an application the series of the daily exchange rate between the U.S. dollar and the euro is studied.
The process of collecting and organizing sets of observations represents a common theme throughout the history of science. However, despite the ubiquity of scientists measuring, recording, and analyzing the dynamics of different processes, an extensive organization of scientific time-series data and analysis methods has never been performed. Addressing this, annotated collections of over 35 000 real-world and model-generated time series and over 9000 time-series analysis algorithms are analyzed in this work. We introduce reduced representations of both time series, in terms of their properties measured by diverse scientific methods, and of time-series analysis methods, in terms of their behaviour on empirical time series, and use them to organize these interdisciplinary resources. This new approach to comparing across diverse scientific data and methods allows us to organize time-series datasets automatically according to their properties, retrieve alternatives to particular analysis methods developed in other scientific disciplines, and automate the selection of useful methods for time-series classification and regression tasks. The broad scientific utility of these tools is demonstrated on datasets of electroencephalograms, self-affine time series, heart beat intervals, speech signals, and others, in each case contributing novel analysis techniques to the existing literature. Highly comparative techniques that compare across an interdisciplinary literature can thus be used to guide more focused research in time-series analysis for applications across the scientific disciplines.
We develop a method for the multifractal characterization of nonstationary time series, which is based on a generalization of the detrended fluctuation analysis (DFA). We relate our multifractal DFA method to the standard partition function-based multifractal formalism, and prove that both approaches are equivalent for stationary signals with compact support. By analyzing several examples we show that the new method can reliably determine the multifractal scaling behavior of time series. By comparing the multifractal DFA results for original series to those for shuffled series we can distinguish multifractality due to long-range correlations from multifractality due to a broad probability density function. We also compare our results with the wavelet transform modulus maxima (WTMM) method, and show that the results are equivalent.
We introduce a generalization of Higuchis estimator of the fractal dimension as a new way to characterize the multifractal spectrum of univariate time series. The resulting multifractal Higuchi dimension analysis (MF-HDA) method considers the order-$q$ moments of the partition function provided by the length of the time series graph at different levels of subsampling. The results obtained for different types of stochastic processes as well as real-world examples of word length series from fictional texts demonstrate that MF-HDA provides a reliable estimate of the multifractal spectrum already for moderate time series lengths. Practical advantages as well as disadvantages of the new approach as compared to other state-of-the-art methods of multifractal analysis are discussed, highlighting the particular potentials of MF-HDA to distinguish mono- from multi-fractal dynamics based on relatively short time series.