No Arabic abstract
Transport of ions and small molecules across the cell membrane against electrochemical gradients is catalyzed by integral membrane proteins that use a source of free energy to drive the energetically uphill flux of the transported substrate. Secondary active transporters couple the spontaneous influx of a driving ion such as Na+ or H+ to the flux of the substrate. The thermodynamics of such cyclical non-equilibrium systems are well understood and recent work has focused on the molecular mechanism of secondary active transport. The fact that these transporters change their conformation between an inward-facing and outward-facing conformation in a cyclical fashion, called the alternating access model, is broadly recognized as the molecular framework in which to describe transporter function. However, only with the advent of high resolution crystal structures and detailed computer simulations has it become possible to recognize common molecular-level principles between disparate transporter families. Inverted repeat symmetry in secondary active transporters has shed light on how protein structures can encode a bi-stable two-state system. More detailed analysis (based on experimental structural data and detailed molecular dynamics simulations) indicates that transporters can be understood as gated pores with at least two coupled gates. These gates are not just a convenient cartoon element to illustrate a putative mechanism but map to distinct parts of the transporter protein. Enumerating all distinct gate states naturally includes occluded states in the alternating access picture and also suggests what kind of protein conformations might be observable. By connecting the possible conformational states and ion/substrate bound states in a kinetic model, a unified picture emerges in which symporter, antiporter, and uniporter function are extremes in a continuum of functionality.
The $alpha$ and $beta$ subunits comprising the hexameric assembly of F1-ATPase share a high degree of structural identity, though low primary identity. Each subunit binds nucleotide in similar pockets, yet only $beta$ subunits are catalytically active. Why? We re-examine their internal symmetry axes and observe interesting differences. Dividing each chain into an N-terminal head region, a C-terminal foot region, and a central torso, we observe (1) that while the foot and head regions in all chains obtain high and similar mobility, the torsos obtain different mobility profiles, with the $beta$ subunits exhibiting a higher motility compared to the $alpha$ subunits, a trend supported by the crystallographic B-factors. The $beta$ subunits have greater torso mobility by having fewer distributed, nonlocal packing interactions providing a spacious and soft connectivity, and offsetting the resultant softness with local stiffness elements, including an additional $beta$ sheet. (2) A loop near the nucleotide binding-domain of the $beta$ subunits, absent in the $alpha$ subunits, swings to create a large variation in the occlusion of the nucleotide binding region. (3) A combination of the softest three eigenmodes significantly reduces the RMSD between the open and closed conformations of the $beta$ subnits. (4) Comparisons of computed and observed crystallographic B-factors suggest a suppression of a particular symmetry axis in an $alpha$ subunit. (5) Unexpectedly, the soft intra-monomer oscillations pertain to distortions that do not create inter-monomer steric clashes in the assembly, suggesting that structural optimization of the assembly evolved at all levels of complexity.
Secondary structure plays an important role in determining the function of non-coding RNAs. Hence, identifying RNA secondary structures is of great value to research. Computational prediction is a mainstream approach for predicting RNA secondary structure. Unfortunately, even though new methods have been proposed over the past 40 years, the performance of computational prediction methods has stagnated in the last decade. Recently, with the increasing availability of RNA structure data, new methods based on machine-learning technologies, especially deep learning, have alleviated the issue. In this review, we provide a comprehensive overview of RNA secondary structure prediction methods based on machine-learning technologies and a tabularized summary of the most important methods in this field. The current pending issues in the field of RNA secondary structure prediction and future trends are also discussed.
RNA molecules form a sequence-specific self-pairing pattern at low temperatures. We analyze this problem using a random pairing energy model as well as a random sequence model that includes a base stacking energy in favor of helix propagation. The free energy cost for separating a chain into two equal halves offers a quantitative measure of sequence specific pairing. In the low temperature glass phase, this quantity grows quadratically with the logarithm of the chain length, but it switches to a linear behavior of entropic origin in the high temperature molten phase. Transition between the two phases is continuous, with characteristics that resemble those of a disordered elastic manifold in two dimensions. For designed sequences, however, a power-law distribution of pairing energies on a coarse-grained level may be more appropriate. Extreme value statistics arguments then predict a power-law growth of the free energy cost to break a chain, in agreement with numerical simulations. Interestingly, the distribution of pairing distances in the ground state secondary structure follows a remarkable power-law with an exponent -4/3, independent of the specific assumptions for the base pairing energies.
Our work is concerned with the generation and targeted design of RNA, a type of genetic macromolecule that can adopt complex structures which influence their cellular activities and functions. The design of large scale and complex biological structures spurs dedicated graph-based deep generative modeling techniques, which represents a key but underappreciated aspect of computational drug discovery. In this work, we investigate the principles behind representing and generating different RNA structural modalities, and propose a flexible framework to jointly embed and generate these molecular structures along with their sequence in a meaningful latent space. Equipped with a deep understanding of RNA molecular structures, our most sophisticated encoding and decoding methods operate on the molecular graph as well as the junction tree hierarchy, integrating strong inductive bias about RNA structural regularity and folding mechanism such that high structural validity, stability and diversity of generated RNAs are achieved. Also, we seek to adequately organize the latent space of RNA molecular embeddings with regard to the interaction with proteins, and targeted optimization is used to navigate in this latent space to search for desired novel RNA molecules.
For decades, dimethyl sulfate (DMS) mapping has informed manual modeling of RNA structure in vitro and in vivo. Here, we incorporate DMS data into automated secondary structure inference using a pseudo-energy framework developed for 2-OH acylation (SHAPE) mapping. On six non-coding RNAs with crystallographic models, DMS- guided modeling achieves overall false negative and false discovery rates of 9.5% and 11.6%, comparable or better than SHAPE-guided modeling; and non-parametric bootstrapping provides straightforward confidence estimates. Integrating DMS/SHAPE data and including CMCT reactivities give small additional improvements. These results establish DMS mapping - an already routine technique - as a quantitative tool for unbiased RNA structure modeling.