Structural entanglements in protein complexes

102 0 0.0 ( 0 )

Download Cite

Added by Yani Zhao

Publication date 2017

fields Biology

and research's language is English

Authors Yani Zhao - Mateusz Chwastyk - Marek Cieplak

Biomolecules

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

We consider multi-chain protein native structures and propose a criterion that determines whether two chains in the system are entangled or not. The criterion is based on the behavior observed by pulling at both temini of each chain simultaneously in the two chains. We have identified about 900 entangled systems in the Protein Data Bank and provided a more detailed analysis for several of them. We argue that entanglement enhances the thermodynamic stability of the system but it may have other functions: burying the hydrophobic residues at the interface, and increasing the DNA or RNA binding area. We also study the folding and stretching properties of the knotted dimeric proteins MJ0366, YibK and bacteriophytochrome. These proteins have been studied theoretically in their monomer

rate research

Protein structural variation in computational models and crystallographic data

127 - Dmitry A. Kondrashov , Adam W. Van Wynsberghe , Ryan M. Bannen 2006

Normal mode analysis offers an efficient way of modeling the conformational flexibility of protein structures. Simple models defined by contact topology, known as elastic network models, have been used to model a variety of systems, but the validation is typically limited to individual modes for a single protein. We use anisotropic displacement parameters from crystallography to test the quality of prediction of both the magnitude and directionality of conformational variance. Normal modes from four simple elastic network model potentials and from the CHARMM forcefield are calculated for a data set of 83 diverse, ultrahigh resolution crystal structures. While all five potentials provide good predictions of the magnitude of flexibility, the methods that consider all atoms have a clear edge at prediction of directionality, and the CHARMM potential produces the best agreement. The low-frequency modes from different potentials are similar, but those computed from the CHARMM potential show the greatest difference from the elastic network models. This was illustrated by computing the dynamic correlation matrices from different potentials for a PDZ domain structure. Comparison of normal mode results with anisotropic temperature factors opens the possibility of using ultrahigh resolution crystallographic data as a quantitative measure of molecular flexibility. The comprehensive evaluation demonstrates the costs and benefits of using normal mode potentials of varying complexity. Comparison of the dynamic correlation matrices suggests that a combination of topological and chemical potentials may help identify residues in which chemical forces make large contributions to intramolecular coupling.

Biomolecules Quantitative Methods

Structural changes in barley protein LTP1 isoforms at air-water interfaces

99 - Yani Zhao , Marek Cieplak 2017

We use a coarse-grained model to study the conformational changes in two barley proteins, LTP1 and its ligand adduct isoform LTP1b, that result from their adsorption to the air-water interface. The model introduces the interface through hydropathy indices. We justify the model by all-atom simulations. The choice of the proteins is motivated by making attempts to understand formation and stability of foam in beer. We demonstrate that both proteins flatten out at the interface and can make a continuous stabilizing and denser film. We show that the degree of the flattening depends on the protein -- the layers of LTP1b should be denser than those of LTP1 -- and on the presence of glycation. It also depends on the number ($le 4$) of the disulfide bonds in the proteins. The geometry of the proteins is sensitive to the specificity of the absent bonds. We provide estimates of the volume of cavities of the proteins when away from the interface.

Biomolecules

Pre-Training of Deep Bidirectional Protein Sequence Representations with Structural Information

201 - Seonwoo Min , Seunghyun Park , Siwon Kim 2019

Motivation: Bridging the exponentially growing gap between the number of unlabeled and labeled proteins, a couple of works have adopted semi-supervised learning for protein sequence modeling. They pre-train a model with a substantial amount of unlabeled data and transfer the learned representations to various downstream tasks. Nonetheless, the current pre-training methods mostly rely on a language modeling task and often show limited performances. Therefore, a complementary protein-specific task for pre-training is necessary to better capture the information contained within unlabeled protein sequences. Results: In this paper, we introduce a novel pre-training scheme called PLUS, which stands for Protein sequence representations Learned Using Structural information. PLUS consists of masked language modeling and a complementary protein-specific pre-training task, namely same family prediction. PLUS can be used to pre-train various model architectures. In this work, we mainly use PLUS to pre-train a recurrent neural network (RNN) and refer to the resulting model as PLUS-RNN. It advances state-of-the-art pre-training methods on six out of seven tasks, i.e., (1) three protein(-pair)-level classification, (2) two protein-level regression, and (3) two amino-acid-level classification tasks. Furthermore, we present results from our ablation studies and interpretation analyses to better understand the strengths of PLUS-RNN. Availability: The codes and pre-trained models are available at https://github.com/mswzeus/PLUS/

Biomolecules Machine Learning Genomics

Deep Learning of Protein Structural Classes: Any Evidence for an Urfold?

216 - Menuka Jaiswal , Saad Saleem , Yonghyeon Kweon 2020

Recent computational advances in the accurate prediction of protein three-dimensional (3D) structures from amino acid sequences now present a unique opportunity to decipher the interrelationships between proteins. This task entails--but is not equivalent to--a problem of 3D structure comparison and classification. Historically, protein domain classification has been a largely manual and subjective activity, relying upon various heuristics. Databases such as CATH represent significant steps towards a more systematic (and automatable) approach, yet there still remains much room for the development of more scalable and quantitative classification methods, grounded in machine learning. We suspect that re-examining these relationships via a Deep Learning (DL) approach may entail a large-scale restructuring of classification schemes, improved with respect to the interpretability of distant relationships between proteins. Here, we describe our training of DL models on protein domain structures (and their associated physicochemical properties) in order to evaluate classification properties at CATHs homologous superfamily (SF) level. To achieve this, we have devised and applied an extension of image-classification methods and image segmentation techniques, utilizing a convolutional autoencoder model architecture. Our DL architecture allows models to learn structural features that, in a sense, define different homologous SFs. We evaluate and quantify pairwise distances between SFs by building one model per SF and comparing the loss functions of the models. Hierarchical clustering on these distance matrices provides a new view of protein interrelationships--a view that extends beyond simple structural/geometric similarity, and towards the realm of structure/function properties.

Biomolecules

Void distributions reveal structural link between jammed packings and protein cores

113 - John D. Treado , Zhe Mei , Lynne Regan 2018

Dense packing of hydrophobic residues in the cores of globular proteins determines their stability. Recently, we have shown that protein cores possess packing fraction $phi approx 0.56$, which is the same as dense, random packing of amino acid-shaped particles. In this article, we compare the structural properties of protein cores and jammed packings of amino acid-shaped particles in much greater depth by measuring their local and connected void regions. We find that the distributions of surface Voronoi cell volumes and local porosities obey similar statistics in both systems. We also measure the probability that accessible, connected void regions percolate as a function of the size of a spherical probe particle and show that both systems possess the same critical probe size. By measuring the critical exponent $tau$ that characterizes the size distribution of connected void clusters at the onset of percolation, we show that void percolation in packings of amino acid-shaped particles and protein cores belong to the same universality class, which is different from that for void percolation in jammed sphere packings. We propose that the connected void regions of proteins are a defining feature of proteins and can be used to differentiate experimentally observed proteins from decoy structures that are generated using computational protein design software. This work emphasizes that jammed packings of amino acid-shaped particles can serve as structural and mechanical analogs of protein cores, and could therefore be useful in modeling the response of protein cores to cavity-expanding and -reducing mutations.

Biomolecules Soft Condensed Matter Biological Physics