No Arabic abstract
Transitions between different conformational states are ubiquitous in proteins, being involved in signaling, catalysis and other fundamental activities in cells. However, modeling those processes is extremely difficult, due to the need of efficiently exploring a vast conformational space in order to seek for the actual transition path for systems whose complexity is already high in the steady states. Here we report a strategy that simplifies this task attacking the complexity on several sides. We first apply a minimalist coarse-grained model to Calmodulin, based on an empirical force field with a partial structural bias, to explore the transition paths between the apo-closed state and the Ca-bound open state of the protein. We then select representative structures along the trajectory based on a structural clustering algorithm and build a cleaned-up trajectory with them. We finally compare this trajectory with that produced by the online tool MinActionPath, by minimizing the action integral using a harmonic network model, and with that obtained by the PROMPT morphing method, based on an optimal mass transportation-type approach including physical constraints. The comparison is performed both on the structural and energetic level, using the coarse-grained and the atomistic force fields upon reconstruction. Our analysis indicates that this method returns trajectories capable of exploring intermediate states with physical meaning, retaining a very low computational cost, which can allow systematic and extensive exploration of the multi-stable proteins transition pathways.
Bottom-up coarse-grained molecular dynamics models are parameterized using complex effective Hamiltonians. These models are typically optimized to approximate high dimensional data from atomistic simulations. In contrast, human validation of these models is often limited to low dimensional statistics that do not necessarily differentiate between the CG model and said atomistic simulations. We propose that explainable machine learning can directly convey high-dimensional error to scientists and use Shapley additive explanations do so in two coarse-grained protein models.
During the last decade coarse-grained nucleotide models have emerged that allow us to DNA and RNA on unprecedented time and length scales. Among them is oxDNA, a coarse-grained, sequence-specific model that captures the hybridisation transition of DNA and many structural properties of single- and double-stranded DNA. oxDNA was previously only available as standalone software, but has now been implemented into the popular LAMMPS molecular dynamics code. This article describes the new implementation and analyses its parallel performance. Practical applications are presented that focus on single-stranded DNA, an area of research which has been so far under-investigated. The LAMMPS implementation of oxDNA lowers the entry barrier for using the oxDNA model significantly, facilitates future code development and interfacing with existing LAMMPS functionality as well as other coarse-grained and atomistic DNA models.
Numerous molecular systems, including solutions, proteins, and composite materials, can be modeled using mixed-resolution representations, of which the quantum mechanics/molecular mechanics (QM/MM) approach has become the most widely used. However, the QM/MM approach often faces a number of challenges, including the slow sampling of the large configuration space for the MM part, the high cost of repetitive QM computations for changing coordinates of atoms in the MM surroundings, and a difficulty in providing a simple, qualitative interpretation of numerical results in terms of the influence of the molecular environment upon the active QM region. In this paper, we address these issues by combining QM/MM modeling with the methodology of bottom-up coarse-graining (CG) to provide the theoretical basis for a systematic quantum-mechanical/coarse-grained molecular mechanics (QM/CG-MM) mixed resolution approach. A derivation of the method is presented based on a combination of statistical mechanics and quantum mechanics, leading to an equation for the effective Hamiltonian of the QM part, a central concept in the QM/CG-MM theory. A detailed analysis of different contributions to the effective Hamiltonian from electrostatic, induction, dispersion and exchange interactions between the QM part and the surroundings is provided, serving as a foundation for a potential hierarchy of QM/CG-MM methods varying in their accuracy and computational cost. A relationship of the QM/CG-MM methodology to other mixed resolution approaches is also discussed.
We present a protocol for the fully automated construction of quantum mechanical-(QM)-classical hybrid models by extending our previously reported approach on self-parametrizing system-focused atomistic models (SFAM) J. Chem. Theory Comput. 2020, 16, 1646]. In this QM/SFAM approach, the size and composition of the QM region is evaluated in an automated manner based on first principles so that the hybrid model describes the atomic forces in the center of the QM region accurately. This entails the automated construction and evaluation of differently sized QM regions with a bearable computational overhead that needs to be paid for automated validation procedures. Applying SFAM for the classical part of the model eliminates any dependence on pre-existing parameters due to its system-focused quantum mechanically derived parametrization. Hence, QM/SFAM is capable of delivering a high fidelity and complete automation. Furthermore, since SFAM parameters are generated for the whole system, our ansatz allows for a convenient re-definition of the QM region during a molecular exploration. For this purpose, a local re-parametrization scheme is introduced, which efficiently generates additional classical parameters on the fly when new covalent bonds are formed (or broken) and moved to the classical region.
Solvent can occupy up to ~70% of macromolecular crystals and hence having models that predict solvent distributions in periodic systems could improve in the interpretation of crystallographic data. Yet there are few implicit solvent models applicable to periodic solutes while crystallographic structures are commonly solved assuming a flat solvent model. Here we present a newly-developed periodic version of the 3D-RISM integral equation method that is able to solve for efficiently and describe accurately water and ions distributions in periodic systems; the code can compute accurate gradients that can be used in minimizations or molecular dynamics simulations. The new method includes an extension of the OZ equation needed to yield charge neutrality for charged solutes which requires an additional contribution to the excess chemical potential that has not been previously identified; this is an important consideration for nucleic acids or any other charged system where most or all of the counter- and co-ions are part of the disordered solvent. We present of several calculations of protein, RNA and small molecule crystals to show that X-ray scattering intensities and solvent structure predicted by the periodic 3D-RISM solvent model are in closer agreement with experiment than are intensities computed using the default flat solvent model in the refmac5 or phenix refinement programs, with the greatest improvement in the 2 to 4 {AA} range. Prospects for incorporating integral equation models into crystallographic refinement are discussed.