No Arabic abstract
Molecular-orbital-based machine learning (MOB-ML) enables the prediction of accurate correlation energies at the cost of obtaining molecular orbitals. Here, we present the derivation, implementation, and numerical demonstration of MOB-ML analytical nuclear gradients which are formulated in a general Lagrangian framework to enforce orthogonality, localization, and Brillouin constraints on the molecular orbitals. The MOB-ML gradient framework is general with respect to the regression technique (e.g., Gaussian process regression or neural networks) and the MOB feature design. We show that MOB-ML gradients are highly accurate compared to other ML methods on the ISO17 data set while only being trained on energies for hundreds of molecules compared to energies and gradients for hundreds of thousands of molecules for the other ML methods. The MOB-ML gradients are also shown to yield accurate optimized structures, at a computational cost for the gradient evaluation that is comparable to Hartree-Fock theory or hybrid DFT.
Projection-based embedding provides a simple, robust, and accurate approach for describing a small part of a chemical system at the level of a correlated wavefunction method while the remainder of the system is described at the level of density functional theory. Here, we present the derivation, implementation, and numerical demonstration of analytical nuclear gradients for projection-based wavefunction-in-density functional theory (WF-in-DFT) embedding. The gradients are formulated in the Lagrangian framework to enforce orthogonality, localization, and Brillouin constraints on the molecular orbitals. An important aspect of the gradient theory is that WF contributions to the total WF-in-DFT gradient can be simply evaluated using existing WF gradient implementations without modification. Another simplifying aspect is that Kohn-Sham (KS) DFT contributions to the projection-based embedding gradient do not require knowledge of the WF calculation beyond the relaxed WF density. Projection-based WF-in-DFT embedding gradients are thus easily generalized to any combination of WF and KS-DFT methods. We provide numerical demonstration of the method for several applications, including calculation of a minimum energy pathway for a hydride transfer in a cobalt-based molecular catalyst using the nudged-elastic-band method at the CCSD-in-DFT level of theory, which reveals large differences from the transition state geometry predicted using DFT.
We address the degree to which machine learning can be used to accurately and transferably predict post-Hartree-Fock correlation energies. Refined strategies for feature design and selection are presented, and the molecular-orbital-based machine learning (MOB-ML) method is applied to several test systems. Strikingly, for the MP2, CCSD, and CCSD(T) levels of theory, it is shown that the thermally accessible (350 K) potential energy surface for a single water molecule can be described to within 1 millihartree using a model that is trained from only a single reference calculation at a randomized geometry. To explore the breadth of chemical diversity that can be described, MOB-ML is also applied to a new dataset of thermalized (350 K) geometries of 7211 organic models with up to seven heavy atoms. In comparison with the previously reported $Delta$-ML method, MOB-ML is shown to reach chemical accuracy with three-fold fewer training geometries. Finally, a transferability test in which models trained for seven-heavy-atom systems are used to predict energies for thirteen-heavy-atom systems reveals that MOB-ML reaches chemical accuracy with 36-fold fewer training calculations than $Delta$-ML (140 versus 5000 training calculations).
Machine learning is used to approximate the kinetic energy of one dimensional diatomics as a functional of the electron density. The functional can accurately dissociate a diatomic, and can be systematically improved with training. Highly accurate self-consistent densities and molecular forces are found, indicating the possibility for ab-initio molecular dynamics simulations.
The computational investigation of photochemical processes often entails the calculation of excited state geometries, energies, and energy gradients. The nuclear-electronic orbital (NEO) approach treats specified nuclei, typically protons, quantum mechanically on the same level as the electrons, thereby including the associated nuclear quantum effects and non-Born-Oppenheimer behavior into quantum chemistry calculations. The multicomponent density functional theory (NEO-DFT) and time-dependent DFT (NEO-TDDFT) methods allow efficient calculations of ground and excited states, respectively. Herein, the analytical gradients are derived and implemented for the NEO-TDDFT method and the associated Tamm-Dancoff approximation (NEO-TDA). The programmable equations for these analytical gradients, as well as the NEO-DFT analytical Hessian, are provided. The NEO approach includes the anharmonic zero-point energy and density delocalization associated with the quantum protons, as well as vibronic mixing, in geometry optimizations and energy calculations of ground and excited states. The harmonic zero-point energy associated with the other nuclei can be computed via the NEO Hessian. This approach is used to compute the 0-0 adiabatic excitation energies for a set of nine small molecules with all protons quantized, exhibiting slight improvement over the conventional electronic approach. Geometry optimizations of two excited state intramolecular proton transfer systems are performed with one and two quantized protons, respectively. The NEO calculations for these systems produce electronically excited state geometries with stronger intramolecular hydrogen bonds and similar relative stabilities compared to conventional electronic methods. This work provides the foundation for nonadiabatic dynamics simulations of fundamental processes such as photoinduced proton transfer and proton-coupled electron transfer.
Recent synthetic studies on the organic molecules tetracene and pentacene have found certain dimers and oligomers to exhibit an intense absorption in the visible region of the spectrum which is not present in the monomer or many previously-studied dimers. In this article we combine experimental synthesis with electronic structure theory and spectral computation to show that this absorption arises from an otherwise dark charge-transfer excitation borrowing intensity from an intense UV excitation. Further, by characterizing the role of relevant monomer molecular orbitals, we arrive at a design principle that allows us to predict the presence or absence of an additional absorption based on the bonding geometry of the dimer. We find this rule correctly explains the spectra of a wide range of acene derivatives and solves an unexplained structure-spectrum phenomenon first observed seventy years ago. These results pave the way for the design of highly absorbent chromophores with applications ranging from photovoltaics to liquid crystals.