No Arabic abstract
Intrinsically disordered proteins (IDPs) do not possess well-defined three-dimensional structures in solution under physiological conditions. We develop all-atom, united-atom, and coarse-grained Langevin dynamics simulations for the IDP alpha-synuclein that include geometric, attractive hydrophobic, and screened electrostatic interactions and are calibrated to the inter-residue separations measured in recent smFRET experiments. We find that alpha-synuclein is disordered with conformational statistics that are intermediate between random walk and collapsed globule behavior. An advantage of calibrated molecular simulations over constraint methods is that physical forces act on all residues, not only on residue pairs that are monitored experimentally, and these simulations can be used to study oligomerization and aggregation of multiple alpha-synuclein proteins that may precede amyloid formation.
We outline recent developments in artificial intelligence (AI) and machine learning (ML) techniques for integrative structural biology of intrinsically disordered proteins (IDP) ensembles. IDPs challenge the traditional protein structure-function paradigm by adapting their conformations in response to specific binding partners leading them to mediate diverse, and often complex cellular functions such as biological signaling, self organization and compartmentalization. Obtaining mechanistic insights into their function can therefore be challenging for traditional structural determination techniques. Often, scientists have to rely on piecemeal evidence drawn from diverse experimental techniques to characterize their functional mechanisms. Multiscale simulations can help bridge critical knowledge gaps about IDP structure function relationships - however, these techniques also face challenges in resolving emergent phenomena within IDP conformational ensembles. We posit that scalable statistical inference techniques can effectively integrate information gleaned from multiple experimental techniques as well as from simulations, thus providing access to atomistic details of these emergent phenomena.
Experiments indicate that unbinding rates of proteins from DNA can depend on the concentration of proteins in nearby solution. Here we present a theory of multi-step replacement of DNA-bound proteins by solution-phase proteins. For four different kinetic scenarios we calculate the depen- dence of protein unbinding and replacement rates on solution protein concentration. We find (1) strong effects of progressive rezipping of the solution-phase protein onto DNA sites liberated by unzipping of the originally bound protein; (2) that a model in which solution-phase proteins bind non-specifically to DNA can describe experiments on exchanges between the non specific DNA- binding proteins Fis-Fis and Fis-HU; (3) that a binding specific model describes experiments on the exchange of CueR proteins on specific binding sites.
Physically, disordered ensembles of non-homopolymeric polypeptides are expected to be heterogeneous; i.e., they should differ from those homogeneous ensembles of homopolymers that harbor an essentially unique relationship between average values of end-to-end distance $R_{rm EE}$ and radius of gyration $R_{rm g}$. It was posited recently, however, that small-angle X-ray scattering (SAXS) data on conformational dimensions of disordered proteins can be rationalized almost exclusively by homopolymer ensembles. Assessing this perspective, chain-model simulations are used to evaluate the discriminatory power of SAXS-determined molecular form factors (MFFs) with regard to homogeneous versus heterogeneous ensembles. The general approach adopted here is not bound by any assumption about ensemble encodability, in that the postulated heterogeneous ensembles we evaluated are not restricted to those entailed by simple interaction schemes. Our analysis of MFFs for certain heterogeneous ensembles with more narrowly distributed $R_{rm EE}$ and $R_{rm g}$ indicates that while they deviates from MFFs of homogeneous ensembles, the differences can be rather small. Remarkably, some heterogeneous ensembles with asphericity and $R_{rm EE}$ drastically different from those of homogeneous ensembles can nonetheless exhibit practically identical MFFs, demonstrating that SAXS MFFs do not afford unique characterizations of basic properties of conformational ensembles in general. In other words, the ensemble to MFF mapping is practically many-to-one and likely non-smooth. Heteropolymeric variations of the $R_{rm EE}$--$R_{rm g}$ relationship were further showcased using an analytical perturbation theory developed here for flexible heteropolymers. Ramifications of our findings for interpretation of experimental data are discussed.
Problems of search and recognition appear over different scales in biological systems. In this review we focus on the challenges posed by interactions between proteins, in particular transcription factors, and DNA and possible mechanisms which allow for a fast and selective target location. Initially we argue that DNA-binding proteins can be classified, broadly, into three distinct classes which we illustrate using experimental data. Each class calls for a different search process and we discuss the possible application of different search mechanisms proposed over the years to each class. The main thrust of this review is a new mechanism which is based on barrier discrimination. We introduce the model and analyze in detail its consequences. It is shown that this mechanism applies to all classes of transcription factors and can lead to a fast and specific search. Moreover, it is shown that the mechanism has interesting transient features which allow for stability at the target despite rapid binding and unbinding of the transcription factor from the target.
We study the behavior of five proteins at the air-water and oil-water interfaces by all-atom molecular dynamics. The proteins are found to get distorted when pinned to the interface. This behavior is consistent with the phenomenological way of introducing the interfaces in a coarse-grained model through a force that depends on the hydropathy indices of the residues. Proteins couple to the oil-water interface stronger than to the air- water one. They diffuse slower at the oil-water interface but do not depin from it, whereas depinning events are observed at the other interface. The reduction of the disulfide bonds slows the diffusion down.