Do you want to publish a course? Click here

Machine learning configuration interaction for ab initio potential energy curves

69   0   0.0 ( 0 )
 Added by Jeremy Coe
 Publication date 2019
  fields Physics
and research's language is English
 Authors J. P. Coe




Ask ChatGPT about the research

The concept of machine learning configuration interaction (MLCI) [J. Chem. Theory Comput. 2018, 14, 5739], where an artificial neural network (ANN) learns on the fly to select important configurations, is further developed so that accurate ab initio potential energy curves can be efficiently calculated. This development includes employing the artificial neural network also as a hash function for the efficient deletion of duplicates on the fly so that the singles and doubles space does not need to be stored and this barrier to scalability is removed. In addition configuration state functions are introduced into the approach so that pure spin states are guaranteed, and the transferability of data between geometries is exploited. This improved approach is demonstrated on potential energy curves for the nitrogen molecule, water, and carbon monoxide. The results are compared with full configuration interaction values, when available, and different transfer protocols are investigated. It is shown that, for all of the considered systems, accurate potential energy curves can now be efficiently computed with MLCI. For the potential curves of N$_{2}$ and CO, MLCI can achieve lower errors than stochastically selecting configurations while also using substantially less processor hours.

rate research

Read More

We introduce vibrational heat-bath configuration interaction (VHCI) as an accurate and efficient method for calculating vibrational eigenstates of anharmonic systems. Inspired by its origin in electronic structure theory, VHCI is a selected CI approach that uses a simple criterion to identify important basis states with a pre-sorted list of anharmonic force constants. Screened second-order perturbation theory and simple extrapolation techniques provide significant improvements to variational energy estimates. We benchmark VHCI on four molecules with 12 to 48 degrees of freedom and use anharmonic potential energy surfaces truncated at fourth and sixth order. For all molecules studied, VHCI produces vibrational spectra of tens or hundreds of states with sub-wavenumber accuracy at low computational cost.
We present four-dimensional ab initio potential energy surfaces for the three spin states of the NH-NH complex. The potentials are partially based on the work of Dhont et al. [J. Chem. Phys. 123, 184302 (2005)]. The surface for the quintet state is obtained at the RCCSD(T)/aug-cc-pVTZ level of theory and the energy diferences with the singlet and triplet states are calculated at the CASPTn/aug-cc-pVTZ (n = 2; 3) level of theory. The ab initio potentials are fitted to coupled spherical harmonics in the angular coordinates, and the long range is further expanded as a power series in 1/R. The RCCSD(T) potential is corrected for a size-consistency error prior to fitting. The long-range coeficients obtained from the fit are found to be in good agreement with perturbation theory calculations.
Machine Learning techniques can be used to represent high-dimensional potential energy surfaces for reactive chemical systems. Two such methods are based on a reproducing kernel Hilbert space representation or on deep neural networks. They can achieve a sub-1 kcal/mol accuracy with respect to reference data and can be used in studies of chemical dynamics. Their construction and a few typical examples are briefly summarized in the present contribution.
The recently developed semistochastic heat-bath configuration interaction (SHCI) method is a systematically improvable selected configuration interaction plus perturbation theory method capable of giving essentially exact energies for larger systems than is possible with other such methods. We compute SHCI atomization energies for 55 molecules which have been used as a test set in prior studies because their atomization energies are known from experiment. Basis sets from cc-pVDZ to cc-pV5Z are used, totaling up to 500 orbitals and a Hilbert space of $10^{32}$ Slater determinants for the largest molecules. For each basis, an extrapolated energy well within chemical accuracy (1 kcal/mol or 1.6 mHa/mol) of the exact energy for that basis is computed using only a tiny fraction of the entire Hilbert space. We also use our almost exact energies to benchmark coupled-cluster [CCSD(T)] energies. The energies are extrapolated to the complete basis set limit and compared to the experimental atomization energies. The extrapolations are done both without and with a basis-set correction based on density-functional theory. The mean absolute deviations from experiment for these extrapolations are 0.46 kcal/mol and 0.51 kcal/mol, respectively. Orbital optimization methods used to obtain improved convergence of the SHCI energies are also discussed.
The applications of machine learning techniques to chemistry and materials science become more numerous by the day. The main challenge is to devise representations of atomic systems that are at the same time complete and concise, so as to reduce the number of reference calculations that are needed to predict the properties of different types of materials reliably. This has led to a proliferation of alternative ways to convert an atomic structure into an input for a machine-learning model. We introduce an abstract definition of chemical environments that is based on a smoothed atomic density, using a bra-ket notation to emphasize basis set independence and to highlight the connections with some popular choices of representations for describing atomic systems. The correlations between the spatial distribution of atoms and their chemical identities are computed as inner products between these feature kets, which can be given an explicit representation in terms of the expansion of the atom density on orthogonal basis functions, that is equivalent to the smooth overlap of atomic positions (SOAP) power spectrum, but also in real space, corresponding to $n$-body correlations of the atom density. This formalism lays the foundations for a more systematic tuning of the behavior of the representations, by introducing operators that represent the correlations between structure, composition, and the target properties. It provides a unifying picture of recent developments in the field and indicates a way forward towards more effective and computationally affordable machine-learning schemes for molecules and materials.
comments
Fetching comments Fetching comments
Sign in to be able to follow your search criteria
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا