No Arabic abstract
The CoHSI (Conservation of Hartley-Shannon Information) distribution is at the heart of a wide-class of discrete systems, defining (amongst other properties) the length distribution of their components. Discrete systems such as the known proteome, computer software and texts are all known to fit this distribution accurately. In a previous paper, we explored the properties of this distribution in detail. Here we will use these properties to show why the average length of components in general and proteins in particular is highly conserved, howsoever measured, demonstrating this on various aggregations of proteins taken from the UniProt database. We will go on to define departures from this equilibrium state, identifying fine structure in the average length of eukaryotic proteins that result from evolutionary processes.
The CoHSI (Conservation of Hartley-Shannon Information) distribution is at the heart of a wide-class of discrete systems, defining the length distribution of their components amongst other global properties. Discrete systems such as the known proteome where components are proteins, computer software, where components are functions and texts where components are books, are all known to fit this distribution accurately. In this short paper, we explore its solution and its resulting properties and lay the foundation for a series of papers which will demonstrate amongst other things, why the average length of components is so highly conserved and why long components occur so frequently in these systems. These properties are not amenable to local arguments such as natural selection in the case of the proteome or human volition in the case of computer software, and indeed turn out to be inevitable global properties of discrete systems devolving directly from CoHSI and shared by all. We will illustrate this using examples from the Uniprot protein database as a prelude to subsequent studies.
Statistical analysis of alignments of large numbers of protein sequences has revealed sectors of collectively coevolving amino acids in several protein families. Here, we show that selection acting on any functional property of a protein, represented by an additive trait, can give rise to such a sector. As an illustration of a selected trait, we consider the elastic energy of an important conformational change within an elastic network model, and we show that selection acting on this energy leads to correlations among residues. For this concrete example and more generally, we demonstrate that the main signature of functional sectors lies in the small-eigenvalue modes of the covariance matrix of the selected sequences. However, secondary signatures of these functional sectors also exist in the extensively-studied large-eigenvalue modes. Our simple, general model leads us to propose a principled method to identify functional sectors, along with the magnitudes of mutational effects, from sequence data. We further demonstrate the robustness of these functional sectors to various forms of selection, and the robustness of our approach to the identification of multiple selected traits.
Prediction of stable crystal structures at given pressure-temperature conditions, based only on the knowledge of the chemical composition, is a central problem of condensed matter physics. This extremely challenging problem is often termed crystal structure prediction problem, and recently developed evolutionary algorithm USPEX (Universal Structure Predictor: Evolutionary Xtallography) made an important progress in solving it, enabling efficient and reliable prediction of structures with up to ~40 atoms in the unit cell using ab initio methods. Here we review this methodology, as well as recent progress in analyzing energy landscape of solids (which also helps to analyze results of USPEX runs). We show several recent applications - (1) prediction of new high-pressure phases of CaCO3, (2) search for the structure of the polymeric phase of CO2 (phase V), (3) high-pressure phases of oxygen, (4) exploration of possible stable compounds in the Xe-C system at high pressures, (5) exotic high-pressure phases of elements boron and sodium.
In structure-based models of proteins, one often assumes that folding is accomplished when all contacts are established. This assumption may frequently lead to a conceptual problem that folding takes place in a temperature region of very low thermodynamic stability, especially when the contact map used is too sparse. We consider six different structure-based models and show that allowing for a small, but model-dependent, percentage of the native contacts not being established boosts the folding temperature substantially while affecting the time scales of folding only in a minor way. We also compare other properties of the six models. We show that the choice of the description of the backbone stiffness has a substantial effect on the values of characteristic temperatures that relate both to equilibrium and kinetic properties. Models without any backbone stiffness (like the self-organized polymer) are found to perform similar to those with the stiffness, including in the studies of stretching.
This work is a study of the inter-relationship between parameters that principally affect metal up-take in the plant. The relationships between the concentration of metal in the growth medium, Cs, the concentration of metal absorbed by the plant, Cp, and the total biomass achieved, M, all of which are factors relevant to the efficiency of phytoremediation of the plant, have been investigated via the macro-physiological response of Brassica juncea seedlings to Ni(II) stress. The factorial growth experiments treated the Ni(II) concentration in the agar gel and the diurnal light quanta (DLQ) as independently variable parameters. Observations included the evidence of light enhancement of Ni toxicity at the root as well as at the whole plant level, the shoot mass index as a possible indicator of shoot metal sequestration in B. juncea, the logarithmic variation of Cp with Cs and the power-law dependence of M on Cp. The sum total of these observations indicate that for the metal accumulator B. juncea with regard to its capacity to accumulate Ni, the overall metabolic nature of the plant is important; neither rapid biomass increase nor a high metal concentration capability favor the removal of high metal mass from the medium, but rather the plant with the moderate photosynthetically driven biomass growth and moderate metal concentrations demonstrated the ability to remove the maximum mass of metal from the medium. The implications of these observations in the context of the perceived need in phytoremediation engineering to maximize Cp and M simultaneously in the same plant, are discussed.