No Arabic abstract
An extensive analysis has been carried out of the performance of standard families of basis sets with the hierarchy of coupled cluster methods CC2, CCSD, CC3 and CCSDT in computing selected Oxygen, Carbon and Nitrogen K-edge (vertical) core excitation and ionization energies within a core-valence separated scheme in the molecules water, ammonia, and carbon monoxide. Complete basis set limits for the excitation energies have been estimated via different basis set extrapolation schemes. The importance of scalar relativistic effects has been established within the spin-free exact two-component theory in its one-electron variant (SFX2C-1e).
The traditional Gaussian basis sets used in modern quantum chemistry lack an electron-nuclear cusp, and hence struggle to accurately describe core electron properties. A recently introduced novel type of basis set, mixed ramp-Gaussians, introduce a new primitive function called a ramp function which addresses this issue. This paper introduces three new mixed ramp-Gaussian basis sets - STO-R, STO-RG and STO-R2G, made from a linear combination of ramp and Gaussian primitive functions - which are derived from the single-core-zeta Slater basis sets for the elements Li to Ne. This derivation is done in an analogous fashion to the famous STO-$n$G basis sets. The STO-RG basis functions are found to outperform the STO-3G basis functions and STO-R2G outperforms STO-6G, both in terms of wavefunction fit and other key quantities such as the one-electron energy and the electron-nuclear cusp. The second part of this paper performs preparatory investigations into how standard all-Gaussian basis sets can be converted to ramp-Gaussian basis sets through modifying the core basis functions. Using a test case of the 6-31G basis set for carbon, we determined that the second Gaussian primitive is less important when fitting a ramp-Gaussian core basis function directly to an all-Gaussian core basis function than when fitting to a Slater basis function. Further, we identified the basis sets that are single-core-zeta and thus should be most straightforward to convert to mixed ramp-Gaussian basis sets in the future.
Arguments showing that exchange-only optimized effective potential (xOEP) methods, with finite basis sets, cannot in general yield the Hartree-Fock (HF) ground state energy, but a higher one, are given. While the orbital products of a complete basis are linearly dependent, the HF ground state energy can only be obtained via a basis set xOEP scheme in the special case that all products of occupied and unoccupied orbitals emerging from the employed orbital basis set are linearly independent from each other. In this case, however, exchange potentials leading to the HF ground state energy exhibit unphysical oscillations and do not represent a Kohn-Sham (KS) exchange potential. These findings solve the seemingly paradoxical results of Staroverov, Scuseria and Davidson that certain finite basis set xOEP calculations lead to the HF ground state energy despite the fact that within a real space (or complete basis) representation the xOEP ground state energy is always higher than the HF energy. Moreover, whether or not the occupied and unoccupied orbital products are linearly independent, it is shown that basis set xOEP methods only represent exact exchange-only (EXX) KS methods, i.e., proper density-functional methods, if the orbital basis set and the auxiliary basis set representing the exchange potential are balanced to each other, i.e., if the orbital basis is comprehensive enough for a given auxiliary basis. Otherwise xOEP methods do not represent EXX KS methods and yield unphysical exchange potentials.
In optimization or machine learning problems we are given a set of items, usually points in some metric space, and the goal is to minimize or maximize an objective function over some space of candidate solutions. For example, in clustering problems, the input is a set of points in some metric space, and a common goal is to compute a set of centers in some other space (points, lines) that will minimize the sum of distances to these points. In database queries, we may need to compute such a some for a specific query set of $k$ centers. However, traditional algorithms cannot handle modern systems that require parallel real-time computations of infinite distributed streams from sensors such as GPS, audio or video that arrive to a cloud, or networks of weaker devices such as smartphones or robots. Core-set is a small data summarization of the input big data, where every possible query has approximately the same answer on both data sets. Generic techniques enable efficient coreset changed{maintenance} of streaming, distributed and dynamic data. Traditional algorithms can then be applied on these coresets to maintain the approximated optimal solutions. The challenge is to design coresets with provable tradeoff between their size and approximation error. This survey summarizes such constructions in a retrospective way, that aims to unified and simplify the state-of-the-art.
Calculations of the hyperpolarizability are typically much more difficult to converge with basis set size than the linear polarizability. In order to understand these convergence issues and hence obtain accurate ab initio values, we compare calculations of the static hyperpolarizability of the gas-phase chloroform molecule (CHCl_3) using three different kinds of basis sets: Gaussian-type orbitals, numerical basis sets, and real-space grids. Although all of these methods can yield similar results, surprisingly large, diffuse basis sets are needed to achieve convergence to comparable values. These results are interpreted in terms of local polarizability and hyperpolarizability densities. We find that the hyperpolarizability is very sensitive to the molecular structure, and we also assess the significance of vibrational contributions and frequency dispersion.
We consider the sampling of the coupled cluster expansion within stochastic coupled cluster theory. Observing the limitations of previous approaches due to the inherently non-linear behaviour of a coupled cluster wavefunction representation we propose new approaches based upon an intuitive, well-defined condition for sampling weights and on sampling the expansion in cluster operators of different excitation levels. We term these modifications even and truncated selection respectively. Utilising both approaches demonstrates dramatically improved calculation stability as well as reduced computational and memory costs. These modifications are particularly effective at higher truncation levels owing to the large number of terms within the cluster expansion that can be neglected, as demonstrated by the reduction of the number of terms to be sampled at the level of CCSDT by 77% and at CCSDTQ56 by 98%.