No Arabic abstract
Calibrating a Reynolds-averaged Navier-Stokes (RANS) model against data leads to an improvement. Determining {it a priori} if such an improvement generalizes to flows outside the calibration data is an outstanding challenge. This work attempts to address this challenge via global epistemic Uncertainty Quantification (UQ). Unlike the available epistemic UQ methods that are local and tell us a models uncertainty at one specific flow condition, the global epistemic UQ method presented in this work tells us also whether a perturbation of the original model would generalize. Specifically, the global epistemic UQ method evaluates a potential improvement in terms of its effectiveness and inconsistency. Any improvement can be put in one of the following four quadrants: first, high effectiveness, low inconsistency; second, high effectiveness, high inconsistency; third, low effectiveness, low inconsistency; and fourth, low effectiveness, high inconsistency. An improvement would generalize if and only if it is in the high effectiveness and low inconsistency quadrant. To demonstrate the concept, we apply the global epistemic UQ to full Reynolds stress modeling of a stratified shear layer. The global epistemic UQ results point to a model coefficient in the pressure-strain correlation closure (among others) as effective and consistent for predicting the quantity of interest of shear layers growth. We calibrate the model coefficient such that our RANS matches direct numerical simulation data at one flow condition. We show that the calibrated model generalizes to several other test flow conditions. On the other hand, when calibrating a high inconsistency term, we get a model that works at only the calibrated condition.
We develop a fast multi-fidelity modeling method for very complex correlations between high- and low-fidelity data by working in modal space to extract the proper correlation function. We apply this method to infer the amplitude of motion of a flexible marine riser in cross-flow, subject to vortex-induced vibrations (VIV). VIV are driven by an absolute instability in the flow, which imposes a frequency (Strouhal) law that requires a matching with the impedance of the structure; this matching is easily achieved because of the rapid parametric variation of the added mass force. As a result, the wavenumber of the riser spatial response is within narrow bands of uncertainty. Hence, an error in wavenumber prediction can cause significant phase-related errors in the shape of the amplitude of response along the riser, rendering correlation between low- and high-fidelity data very complex. Working in modal space as outlined herein, dense data from low-fidelity data, provided by the semi-empirical computer code VIVA, can correlate in modal space with few high-fidelity data, obtained from experiments or fully-resolved CFD simulations, to correct both phase and amplitude and provide predictions that agree very well overall with the correct shape of the amplitude response. We also quantify the uncertainty in the prediction using Bayesian modeling and exploit this uncertainty to formulate an active learning strategy for the best possible location of the sensors providing the high fidelity measurements.
We quantify the strength of the waves and their impact on the energy cascade in rotating turbulence by studying the wave number and frequency energy spectrum, and the time correlation functions of individual Fourier modes in numerical simulations in three dimensions in periodic boxes. From the spectrum, we find that a significant fraction of the energy is concentrated in modes with wave frequency $omega approx 0$, even when the external forcing injects no energy directly into these modes. However, for modes for which the period of the inertial waves $tau_omega$ is faster than the turnover time $tau_textrm{NL}$, a significant fraction of the remaining energy is concentrated in the modes that satisfy the dispersion relation of the waves. No evidence of accumulation of energy in the modes with $tau_omega = tau_textrm{NL}$ is observed, unlike what critical balance arguments predict. From the time correlation functions, we find that for modes with $tau_omega < tau_textrm{sw}$ (with $tau_textrm{sw}$ the sweeping time) the dominant decorrelation time is the wave period, and that these modes also show a slower modulation on the timescale $tau_textrm{NL}$ as assumed in wave turbulence theories. The rest of the modes are decorrelated with the sweeping time, including the very energetic modes modes with $omega approx 0$.
Obtaining accurate estimates of machine learning model uncertainties on newly predicted data is essential for understanding the accuracy of the model and whether its predictions can be trusted. A common approach to such uncertainty quantification is to estimate the variance from an ensemble of models, which are often generated by the generally applicable bootstrap method. In this work, we demonstrate that the direct bootstrap ensemble standard deviation is not an accurate estimate of uncertainty and propose a calibration method to dramatically improve its accuracy. We demonstrate the effectiveness of this calibration method for both synthetic data and physical datasets from the field of Materials Science and Engineering. The approach is motivated by applications in physical and biological science but is quite general and should be applicable for uncertainty quantification in a wide range of machine learning regression models.
Modeling realistic fluid and plasma flows is computationally intensive, motivating the use of reduced-order models for a variety of scientific and engineering tasks. However, it is challenging to characterize, much less guarantee, the global stability (i.e., long-time boundedness) of these models. The seminal work of Schlegel and Noack (JFM, 2015) provided a theorem outlining necessary and sufficient conditions to ensure global stability in systems with energy-preserving, quadratic nonlinearities, with the goal of evaluating the stability of projection-based models. In this work, we incorporate this theorem into modern data-driven models obtained via machine learning. First, we propose that this theorem should be a standard diagnostic for the stability of projection-based and data-driven models, examining the conditions under which it holds. Second, we illustrate how to modify the objective function in machine learning algorithms to promote globally stable models, with implications for the modeling of fluid and plasma flows. Specifically, we introduce a modified trapping SINDy algorithm based on the sparse identification of nonlinear dynamics (SINDy) method. This method enables the identification of models that, by construction, only produce bounded trajectories. The effectiveness and accuracy of this approach are demonstrated on a broad set of examples of varying model complexity and physical origin, including the vortex shedding in the wake of a circular cylinder.
Machine learning systems and also, specifically, automatic speech recognition (ASR) systems are vulnerable against adversarial attacks, where an attacker maliciously changes the input. In the case of ASR systems, the most interesting cases are targeted attacks, in which an attacker aims to force the system into recognizing given target transcriptions in an arbitrary audio sample. The increasing number of sophisticated, quasi imperceptible attacks raises the question of countermeasures. In this paper, we focus on hybrid ASR systems and compare four acoustic models regarding their ability to indicate uncertainty under attack: a feed-forward neural network and three neural networks specifically designed for uncertainty quantification, namely a Bayesian neural network, Monte Carlo dropout, and a deep ensemble. We employ uncertainty measures of the acoustic model to construct a simple one-class classification model for assessing whether inputs are benign or adversarial. Based on this approach, we are able to detect adversarial examples with an area under the receiving operator curve score of more than 0.99. The neural networks for uncertainty quantification simultaneously diminish the vulnerability to the attack, which is reflected in a lower recognition accuracy of the malicious target text in comparison to a standard hybrid ASR system.