No Arabic abstract
The molecular polarizability describes the tendency of a molecule to deform or polarize in response to an applied electric field. As such, this quantity governs key intra- and inter-molecular interactions such as induction and dispersion, plays a key role in determining the spectroscopic signatures of molecules, and is an essential ingredient in polarizable force fields and other empirical models for collective interactions. Compared to other ground-state properties, an accurate and reliable prediction of the molecular polarizability is considerably more difficult as this response quantity is quite sensitive to the description of the underlying molecular electronic structure. In this work, we present state-of-the-art quantum mechanical calculations of the static dipole polarizability tensors of 7,211 small organic molecules computed using linear-response coupled-cluster singles and doubles theory (LR-CCSD). Using a symmetry-adapted machine-learning based approach, we demonstrate that it is possible to predict the molecular polarizability with LR-CCSD accuracy at a negligible computational cost. The employed model is quite robust and transferable, yielding molecular polarizabilities for a diverse set of 52 larger molecules (which includes challenging conjugated systems, carbohydrates, small drugs, amino acids, nucleobases, and hydrocarbon isomers) at an accuracy that exceeds that of hybrid density functional theory (DFT). The atom-centered decomposition implicit in our machine-learning approach offers some insight into the shortcomings of DFT in the prediction of this fundamental quantity of interest.
A variant of coupled-cluster theory is described here, wherein the degrees of freedom are fluctuations of fragments between internally correlated states. The effects of intra-fragment correlation on the inter-fragment interaction are pre-computed and permanently folded into an effective Hamiltonian, thus avoiding redundant evaluations of local relaxations associated with coupled fluctuations. A companion article shows that a low-scaling step may be used to cast the electronic Hamiltonians of real systems into the form required. Two proof-of-principle demonstrations are presented here for non-covalent interactions. One uses harmonic oscillators, for which accuracy and algorithm structure can be carefully controlled in comparisons. The other uses small electronic systems (Be atoms) to demonstrate compelling accuracy and efficiency, also when inter-fragment electron exchange and charge transfer must be handled. Since the cost of the global calculation does not depend directly on the correlation models used for the fragments, this should provide a way to incorporate difficult electronic structure problems into large systems. This framework opens a promising path for building tunable, systematically improvable methods to capture properties of systems interacting with a large number of other systems. The extension to excited states is also straightforward.
We consider the sampling of the coupled cluster expansion within stochastic coupled cluster theory. Observing the limitations of previous approaches due to the inherently non-linear behaviour of a coupled cluster wavefunction representation we propose new approaches based upon an intuitive, well-defined condition for sampling weights and on sampling the expansion in cluster operators of different excitation levels. We term these modifications even and truncated selection respectively. Utilising both approaches demonstrates dramatically improved calculation stability as well as reduced computational and memory costs. These modifications are particularly effective at higher truncation levels owing to the large number of terms within the cluster expansion that can be neglected, as demonstrated by the reduction of the number of terms to be sampled at the level of CCSDT by 77% and at CCSDTQ56 by 98%.
We present a scheme to obtain an inexpensive and reliable estimate of the uncertainty associated with the predictions of a machine-learning model of atomic and molecular properties. The scheme is based on resampling, with multiple models being generated based on sub-sampling of the same training data. The accuracy of the uncertainty prediction can be benchmarked by maximum likelihood estimation, which can also be used to correct for correlations between resampled models, and to improve the performance of the uncertainty estimation by a cross-validation procedure. In the case of sparse Gaussian Process Regression models, this resampled estimator can be evaluated at negligible cost. We demonstrate the reliability of these estimates for the prediction of molecular energetics, and for the estimation of nuclear chemical shieldings in molecular crystals. Extension to estimate the uncertainty in energy differences, forces, or other correlated predictions is straightforward. This method can be easily applied to other machine learning schemes, and will be beneficial to make data-driven predictions more reliable, and to facilitate training-set optimization and active-learning strategies.
A block-correlated coupled cluster (BCCC) method based on the generalized valence bond (GVB) wave function (GVB-BCCC in short) is proposed and implemented at the ab initio level, which represents an attractive multireference electronic structure method for strongly correlated systems. The GVB-BCCC method is demonstrated to provide accurate descriptions for multiple bond breaking in small molecules, although the GVB reference function is qualitatively wrong for the studied processes. For a challenging prototype of strongly correlated systems, tridecane with all 12 single C-C bonds at various distances, our calculations have shown that the GVB-BCCC2b method can provide highly comparable results as the density matrix renormalization group method for potential energy surfaces along simultaneous dissociation of all C-C bonds.
We present a coupled cluster and linear response theory to compute properties of many-electron systems at non-zero temperatures. For this purpose, we make use of the thermofield dynamics, which allows for a compact wavefunction representation of the thermal density matrix, and extend our recently developed framework [J. Chem. Phys. 150, 154109 (2019)] to parameterize the so-called thermal state using an exponential ansatz with cluster operators that create thermal quasiparticle excitations on a mean-field reference. As benchmark examples, we apply this method to both model (one-dimensional Hubbard and Pairing) as well as ab-initio (atomic Beryllium and molecular Hydrogen) systems, while comparing with exact results.