No Arabic abstract
We refine the OrbNet model to accurately predict energy, forces, and other response properties for molecules using a graph neural-network architecture based on features from low-cost approximated quantum operators in the symmetry-adapted atomic orbital basis. The model is end-to-end differentiable due to the derivation of analytic gradients for all electronic structure terms, and is shown to be transferable across chemical space due to the use of domain-specific features. The learning efficiency is improved by incorporating physically motivated constraints on the electronic structure through multi-task learning. The model outperforms existing methods on energy prediction tasks for the QM9 dataset and for molecular geometry optimizations on conformer datasets, at a computational cost that is thousand-fold or more reduced compared to conventional quantum-chemistry calculations (such as density functional theory) that offer similar accuracy.
An overview of computational methods to describe high-dimensional potential energy surfaces suitable for atomistic simulations is given. Particular emphasis is put on accuracy, computability, transferability and extensibility of the methods discussed. They include empirical force fields, representations based on reproducing kernels, using permutationally invariant polynomials, and neural network-learned representations and combinations thereof. Future directions and potential improvements are discussed primarily from a practical, application-oriented perspective.
Machine Learning techniques can be used to represent high-dimensional potential energy surfaces for reactive chemical systems. Two such methods are based on a reproducing kernel Hilbert space representation or on deep neural networks. They can achieve a sub-1 kcal/mol accuracy with respect to reference data and can be used in studies of chemical dynamics. Their construction and a few typical examples are briefly summarized in the present contribution.
The calculation of potential energy surfaces for quantum dynamics can be a time consuming task -- especially when a high level of theory for the electronic structure calculation is required. We propose an adaptive interpolation algorithm based on polyharmonic splines combined with a partition of unity approach. The adaptive node refinement allows to greatly reduce the number of sample points by employing a local error estimate. The algorithm and its scaling behavior is evaluated for a model function in 2, 3 and 4 dimensions. The developed algorithm allows for a more rapid and reliable interpolation of a potential energy surface within a given accuracy compared to the non-adaptive version.
Dynamics of flexible molecules are often determined by an interplay between local chemical bond fluctuations and conformational changes driven by long-range electrostatics and van der Waals interactions. This interplay between interactions yields complex potential-energy surfaces (PES) with multiple minima and transition paths between them. In this work, we assess the performance of state-of-the-art Machine Learning (ML) models, namely sGDML, SchNet, GAP/SOAP, and BPNN for reproducing such PES, while using limited amounts of reference data. As a benchmark, we use the cis to trans thermal relaxation in an azobenzene molecule, where at least three different transition mechanisms should be considered. Although GAP/SOAP, SchNet, and sGDML models can globally achieve chemical accuracy of 1 kcal mol-1 with fewer than 1000 training points, predictions greatly depend on the ML method used as well as the local region of the PES being sampled. Within a given ML method, large differences can be found between predictions of close-to-equilibrium and transition regions, as well as for different transition mechanisms. We identify key challenges that the ML models face in learning long-range interactions and the intrinsic limitations of commonly used atom-based descriptors. All in all, our results suggest switching from learning the entire PES within a single model to using multiple local models with optimized descriptors, training sets, and architectures for different parts of complex PES.
A novel algorithm was recently presented to utilize emerging time dependent probability density data to extract molecular potential energy surfaces. This paper builds on the previous work and seeks to enhance the capabilities of the extraction algorithm: An improved method of removing the generally ill-posed nature of the inverse problem is introduced via an extended Tikhonov regularization and methods for choosing the optimal regularization parameters are discussed. Several ways to incorporate multiple data sets are investigated, including the means to optimally combine data from many experiments exploring different portions of the potential. Results are presented on the stability of the inversion procedure, including the optimal combination scheme, under the influence of data noise. The method is applied to the simulated inversion of a double well system.