No Arabic abstract
Given nonstationary data, one generally wants to extract the trend from the noise by smoothing or filtering. However, it is often important to delineate a third intermediate category, that we call high frequency (HF) features: this is the case in our motivating example, which consists in experimental measurements of the time-dynamics of depolymerising protein fibrils average size. One may intuitively visualise HF features as the presence of fast, possibly nonstationary and transient oscillations, distinct from a slowly-varying trend envelope. The aim of this article is to propose an empirical definition of HF features and construct estimators and statistical tests for their presence accordingly, when the data consists of a noisy nonstationary 1-dimensional signal. We propose a parametric characterization in the Fourier domain of the HF features by defining a maximal amplitude and distance to low frequencies of significant energy. We introduce a data-driven procedure to estimate these parameters, and compute a p-value proxy based on a statistical test for the presence of HF features. The test is first conducted on simulated signals where the ratio amplitude of the HF features to the level of the noise is controlled. The test detects HF features even when the level of noise is five times larger than the amplitude of the oscillations. In a second part, the test is conducted on experimental data from Prion disease experiments and it confirms the presence of HF features in these signals with significant confidence.
The detection and characterization of partial discharge (PD) are crucial for the insulation diagnosis of overhead lines with covered conductors. With the release of a large dataset containing thousands of naturally obtained high-frequency voltage signals, data-driven analysis of fault-related PD patterns on an unprecedented scale becomes viable. The high diversity of PD patterns and background noise interferences motivates us to design an innovative pulse shape characterization method based on clustering techniques, which can dynamically identify a set of representative PD-related pulses. Capitalizing on those pulses as referential patterns, we construct insightful features and develop a novel machine learning model with a superior detection performance for early-stage covered conductor faults. The presented model outperforms the winning model in a Kaggle competition and provides the state-of-the-art solution to detect real-time disturbances in the field.
Orthogonal frequency-division multiplexing (OFDM) has been selected as the basis for the fifth-generation new radio (5G-NR) waveform developments. However, effective signal processing tools are needed for enhancing the OFDM spectrum in various advanced transmission scenarios. In earlier work, we have shown that fast-convolution (FC) processing is a very flexible and efficient tool for filtered-OFDM signal generation and receiver-side subband filtering, e.g., for the mixed-numerology scenarios of the 5G-NR. FC filtering approximates linear convolution through effective fast Fourier transform (FFT)-based circular convolutions using partly overlapping processing blocks. However, with the continuous overlap-and-save and overlap-and-add processing models with fixed block-size and fixed overlap, the FC-processing blocks cannot be aligned with all OFDM symbols of a transmission frame. Furthermore, 5G-NR numerology does not allow to use transform lengths shorter than 128 because this would lead to non-integer cyclic prefix (CP) lengths. In this article, we present new FC-processing schemes which solve the mentioned limitations. These schemes are based on dynamically adjusting the overlap periods and extrapolating the CP samples, which make it possible to align the FC blocks with each OFDM symbol, even in case of variable CP lengths. This reduces complexity and latency, e.g., in mini-slot transmissions and, as an example, allows to use 16-point transforms in case of a 12-subcarrier-wide subband allocation, greatly reducing the implementation complexity. On the receiver side, the proposed scheme makes it possible to effectively combine cascaded inverse and forward FFT units in FC-filtered OFDM processing. Transform decomposition is used to simplify these computations. Very extensive set of numerical results is also provided, in terms of radio-link performance and associated processing complexity.
A neural network is essentially a high-dimensional complex mapping model by adjusting network weights for feature fitting. However, the spectral bias in network training leads to unbearable training epochs for fitting the high-frequency components in broadband signals. To improve the fitting efficiency of high-frequency components, the PhaseDNN was proposed recently by combining complex frequency band extraction and frequency shift techniques [Cai et al. SIAM J. SCI. COMPUT. 42, A3285 (2020)]. Our paper is devoted to an alternative candidate for fitting complex signals with high-frequency components. Here, a parallel frequency function-deep neural network (PFF-DNN) is proposed to suppress computational overhead while ensuring fitting accuracy by utilizing fast Fourier analysis of broadband signals and the spectral bias nature of neural networks. The effectiveness and efficiency of the proposed PFF-DNN method are verified based on detailed numerical experiments for six typical broadband signals.
Training machine learning algorithms from a small and imbalanced dataset is often a daunting challenge in medical research. However, it has been shown that the synthetic data generated by data augmentation techniques can enlarge the dataset and contribute to alleviating the imbalance situation. In this study, we propose a novel generative adversarial network (GAN) architecture-Welch-GAN and focused on examining how its influence on classifier performance is related to signal quality and class imbalance within the context of photoplethysmography (PPG)-based atrial fibrillation (AF) detection. Pulse oximetry data were collected from 126 adult patients and augmented using the permutation technique to build a large training set for training an AF detection model based on a one-dimensional residual neural network. To test the model, PPG data were collected from 13 stroke patients and utilized. Four data augmentation methods, including both traditional and GANs, are leveraged as baseline in this study. Three different experiments are designed to investigate each data augmentation methods from the aspect of performance gain, robustness to motion artifact and training sample size, respectively. Compared to the un-augmented data, by training the same AF classification algorithm using augmented data, the AF detection accuracy was significantly improved from 80.36% to over 90% with no compromise on sensitivity nor on negative predicted value. Within each data augmentation techniques, Welch-GAN has shown around 3% superiority in terms of AF detection accuracy compared to the baseline methods, which suggests the state-of-the-art of our proposed Welch-GAN.
The design of high-resolution and cross-term (CT) free time-frequency distributions (TFDs) has been an open problem. Classical kernel based methods are limited by the trade-off between TFD resolution and CT suppression, even under optimally derived parameters. To break the current limitation, we propose a data-driven kernel learning model directly based on Wigner-Ville distribution (WVD). The proposed kernel learning based TFD (KL-TFD) model includes several stacked multi-channel learning convolutional kernels. Specifically, a skipping operator is utilized to maintain correct information transmission, and a weighted block is employed to exploit spatial and channel dependencies. These two designs simultaneously achieve high TFD resolution and CT elimination. Numerical experiments on both synthetic and real-world data confirm the superiority of the proposed KL-TFD over traditional kernel function methods.