No Arabic abstract
This paper proposes a new method that combines check-pointing methods with error-controlled lossy compression for large-scale high-performance Full-Waveform Inversion (FWI), an inverse problem commonly used in geophysical exploration. This combination can significantly reduce data movement, allowing a reduction in run time as well as peak memory. In the Exascale computing era, frequent data transfer (e.g., memory bandwidth, PCIe bandwidth for GPUs, or network) is the performance bottleneck rather than the peak FLOPS of the processing unit. Like many other adjoint-based optimization problems, FWI is costly in terms of the number of floating-point operations, large memory footprint during backpropagation, and data transfer overheads. Past work for adjoint methods has developed checkpointing methods that reduce the peak memory requirements during backpropagation at the cost of additional floating-point computations. Combining this traditional checkpointing with error-controlled lossy compression, we explore the three-way tradeoff between memory, precision, and time to solution. We investigate how approximation errors introduced by lossy compression of the forward solution impact the objective function gradient and final inverted solution. Empirical results from these numerical experiments indicate that high lossy-compression rates (compression factors ranging up to 100) have a relatively minor impact on convergence rates and the quality of the final solution.
Recently, the Wasserstein loss function has been proven to be effective when applied to deterministic full-waveform inversion (FWI) problems. We consider the application of this loss function in Bayesian FWI so that the uncertainty can be captured in the solution. Other loss functions that are commonly used in practice are also considered for comparison. Existence and stability of the resulting Gibbs posteriors are shown on function space under weak assumptions on the prior and model. In particular, the distribution arising from the Wasserstein loss is shown to be quite stable with respect to high-frequency noise in the data. We then illustrate the difference between the resulting distributions numerically, using Laplace approximations to estimate the unknown velocity field and uncertainty associated with the estimates.
Seismic full-waveform inversion (FWI), which uses iterative methods to estimate high-resolution subsurface models from seismograms, is a powerful imaging technique in exploration geophysics. In recent years, the computational cost of FWI has grown exponentially due to the increasing size and resolution of seismic data. Moreover, it is a non-convex problem and can encounter local minima due to the limited accuracy of the initial velocity models or the absence of low frequencies in the measurements. To overcome these computational issues, we develop a multiscale data-driven FWI method based on fully convolutional networks (FCN). In preparing the training data, we first develop a real-time style transform method to create a large set of synthetic subsurface velocity models from natural images. We then develop two convolutional neural networks with encoder-decoder structure to reconstruct the low- and high-frequency components of the subsurface velocity models, separately. To validate the performance of our data-driven inversion method and the effectiveness of the synthesized training set, we compare it with conventional physics-based waveform inversion approaches using both synthetic and field data. These numerical results demonstrate that, once our model is fully trained, it can significantly reduce the computation time, and yield more accurate subsurface velocity models in comparison with conventional FWI.
In this article, continuous Galerkin finite elements are applied to perform full waveform inversion (FWI) for seismic velocity model building. A time-domain FWI approach is detailed that uses meshes composed of variably sized triangular elements to discretize the domain. To resolve both the forward and adjoint-state equations, and to calculate a mesh-independent gradient associated with the FWI process, a fully-explicit, variable higher-order (up to degree $k=5$ in $2$D and $k=3$ in 3D) mass lumping method is used. By adapting the triangular elements to the expected peak source frequency and properties of the wavefield (e.g., local P-wavespeed) and by leveraging higher-order basis functions, the number of degrees-of-freedom necessary to discretize the domain can be reduced. Results from wave simulations and FWIs in both $2$D and 3D highlight our developments and demonstrate the benefits and challenges with using triangular meshes adapted to the material proprieties. Software developments are implemented an open source code built on top of Firedrake, a high-level Python package for the automated solution of partial differential equations using the finite element method.
The Hessian matrix plays an important role in correct interpretation of the multiple scattered wave fields inside the FWI frame work. Due to the high computational costs, the computation of the Hessian matrix is not feasible. Consequently, FWI produces overburden related artifacts inside the target zone model, due to the lack of the exact Hessian matrix. We have shown here that Marchenko-based target-oriented Full Waveform Inversion can compensate the need of Hessian matrix inversion by reducing the non-linearity due to overburden effects. This is achieved by exploiting Marchenko-based target replacement to remove the overburden response and its interactions with the target zone from residuals and inserting the response of the updated target zone into the response of the entire medium. We have also shown that this method is more robust with respect to prior information than the standard gradient FWI. Similarly to standard Marchenko imaging, the proposed method only requires knowledge of the direct arrival time from a focusing point to the surface and the reflection response of the medium.
In this paper, authors focus effort on improving the conventional discrete velocity method (DVM) into a multiscale scheme in finite volume framework for gas flow in all flow regimes. Unlike the typical multiscale kinetic methods unified gas-kinetic scheme (UGKS) and discrete unified gas-kinetic scheme (DUGKS), which concentrate on the evolution of the distribution function at the cell interface, in the present scheme the flux for macroscopic variables is split into the equilibrium part and the nonequilibrium part, and the nonequilibrium flux is calculated by integrating the discrete distribution function at the cell center, which overcomes the excess numerical dissipation of the conventional DVM in the continuum flow regime. Afterwards, the macroscopic variables are finally updated by simply integrating the discrete distribution function at the cell center, or by a blend of the increments based on the macroscopic and the microscopic systems, and the multiscale property is achieved. Several test cases, involving unsteady, steady, high speed, low speed gas flows in all flow regimes, have been performed, demonstrating the good performance of the multiscale DVM from free molecule to continuum Navier-Stokes solutions and the multiscale property of the scheme is proved.