Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Nonlinear excitations in DNA: Aperiodic models vs actual genome sequences

177 0 0.0 ( 0 )

Download Cite

Added by Angel Sanchez

Publication date 2004

fields Biology Physics

and research's language is English

Authors Sara Cuenda - Angel Sanchez

Genomics Soft Condensed Matter Mathematical Physics

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

We study the effects of the sequence on the propagation of nonlinear excitations in simple models of DNA in which we incorporate actual DNA sequences obtained from human genome data. We show that kink propagation requires forces over a certain threshold, a phenomenon already found for aperiodic sequences [F. Domi nguez-Adame {em et al.}, Phys. Rev. E {bf 52}, 2183 (1995)]. For forces below threshold, the final stop positions are highly dependent on the specific sequence. The results of our model are consistent with the stick-slip dynamics of the unzipping process observed in experiments. We also show that the effective potential, a collective coordinate formalism introduced by Salerno and Kivshar [Phys. Lett. A {bf 193}, 263 (1994)] is a useful tool to identify key regions in DNA that control the dynamical behavior of large segments. Additionally, our results lead to further insights in the phenomenology observed in aperiodic systems.

rate research

Learning interpretable models of phenotypes from whole genome sequences with the Set Covering Machine

522 - Alexandre Drouin , Sebastien Gigu`ere , Vladana Sagatovich 2014

The increased affordability of whole genome sequencing has motivated its use for phenotypic studies. We address the problem of learning interpretable models for discrete phenotypes from whole genomes. We propose a general approach that relies on the Set Covering Machine and a k-mer representation of the genomes. We show results for the problem of predicting the resistance of Pseudomonas Aeruginosa, an important human pathogen, against 4 antibiotics. Our results demonstrate that extremely sparse models which are biologically relevant can be learnt using this approach.

Genomics Computational Engineering Machine Learning

Predicting genome-wide DNA methylation using methylation marks, genomic position, and DNA regulatory elements

377 - Weiwei Zhang , Tim D Spector , Panos Deloukas 2013

Background: Recent assays for individual-specific genome-wide DNA methylation profiles have enabled epigenome-wide association studies to identify specific CpG sites associated with a phenotype. Computational prediction of CpG site-specific methylation levels is important, but current approaches tackle average methylation within a genomic locus and are often limited to specific genomic regions. Results: We characterize genome-wide DNA methylation patterns, and show that correlation among CpG sites decays rapidly, making predictions solely based on neighboring sites challenging. We built a random forest classifier to predict CpG site methylation levels using as features neighboring CpG site methylation levels and genomic distance, and co-localization with coding regions, CGIs, and regulatory elements from the ENCODE project, among others. Our approach achieves 91% -- 94% prediction accuracy of genome-wide methylation levels at single CpG site precision. The accuracy increases to 98% when restricted to CpG sites within CGIs. Our classifier outperforms state-of-the-art methylation classifiers and identifies features that contribute to prediction accuracy: neighboring CpG site methylation status, CpG island status, co-localized DNase I hypersensitive sites, and specific transcription factor binding sites were found to be most predictive of methylation levels. Conclusions: Our observations of DNA methylation patterns led us to develop a classifier to predict site-specific methylation levels that achieves the best DNA methylation predictive accuracy to date. Furthermore, our method identified genomic features that interact with DNA methylation, elucidating mechanisms involved in DNA methylation modification and regulation, and linking different epigenetic processes.

Genomics

Identification of repeats in DNA sequences using nucleotide distribution uniformity

146 - Changchuan Yin 2016

Repetitive elements are important in genomic structures, functions and regulations, yet effective methods in precisely identifying repetitive elements in DNA sequences are not fully accessible, and the relationship between repetitive elements and periodicities of genomes is not clearly understood. We present an $textit{ab initio}$ method to quantitatively detect repetitive elements and infer the consensus repeat pattern in repetitive elements. The method uses the measure of the distribution uniformity of nucleotides at periodic positions in DNA sequences or genomes. It can identify periodicities, consensus repeat patterns, copy numbers and perfect levels of repetitive elements. The results of using the method on different DNA sequences and genomes demonstrate efficacy and accuracy in identifying repeat patterns and periodicities. The complexity of the method is linear with respect to the lengths of the analyzed sequences.

Genomics Computational Engineering Computer Vision and Pattern Recognition

Nonlinear molecular excitations in a completely inhomogeneous DNA chain

336 - M. Daniel , V. Vasumathi 2008

We study the nonlinear dynamics of a completely inhomogeneous DNA chain which is governed by a perturbed sine-Gordon equation. A multiple scale perturbation analysis provides perturbed kink-antikink solitons to represent open state configuration with small fluctuation. The perturbation due to inhomogeneities changes the velocity of the soliton. However, the width of the soliton remains constant.

Pattern Formation and Solitons Soft Condensed Matter

Poincare recurrences of DNA sequence

332 - K. M. Frahm , D. L. Shepelyansky 2011

We analyze the statistical properties of Poincare recurrences of Homo sapiens, mammalian and other DNA sequences taken from Ensembl Genome data base with up to fifteen billions base pairs. We show that the probability of Poincare recurrences decays in an algebraic way with the Poincare exponent $beta approx 4$ even if oscillatory dependence is well pronounced. The correlations between recurrences decay with an exponent $ u approx 0.6$ that leads to an anomalous super-diffusive walk. However, for Homo sapiens sequences, with the largest available statistics, the diffusion coefficient converges to a finite value on distances larger than million base pairs. We argue that the approach based on Poncare recurrences determines new proximity features between different species and shed a new light on their evolution history.

Genomics Statistical Mechanics Biological Physics

comments

Fetching comments

Helwan

Additional details More universities

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Nonlinear excitations in DNA: Aperiodic models vs actual genome sequences

Ask ChatGPT about the research

No Arabic abstract

Read More