Do you want to publish a course? Click here

NTRFINDER: A Software Tool to Find Nested Tandem Repeats

107   0   0.0 ( 0 )
 Added by Atheer Matroud
 Publication date 2010
  fields Biology
and research's language is English




Ask ChatGPT about the research

We introduce the software tool NTRFinder to find the complex repetitive structure in DNA we call a nested tandem repeat (NTR). An NTR is a recurrence of two or more distinct tandem motifs interspersed with each other. We propose that nested tandem repeats can be used as phylogenetic and population markers. We have tested our algorithm on both real and simulated data, and present some real nested tandem repeats of interest. We discuss how the NTR found in the ribosomal DNA of taro (Colocasia esculenta) may assist in determining the cultivation prehistory of this ancient staple food crop. NTRFinder can be downloaded from http://www.maths.otago.ac.nz/? aamatroud/.



rate research

Read More

We propose a simple tractable pair hidden Markov model for pairwise sequence alignment that accounts for the presence of short tandem repeats. Using the framework of gain functions, we design several optimization criteria for decoding this model and describe the resulting decoding algorithms, ranging from the traditional Viterbi and posterior decoding to block-based decoding algorithms specialized for our model. We compare the accuracy of individual decoding algorithms on simulated data and find our approach superior to the classical three-state pair HMM in simulations.
Untargeted metabolomic studies are revealing large numbers of naturally occurring metabolites that cannot be characterized because their chemical structures and MS/MS spectra are not available in databases. Here we present iMet, a computational tool based on experimental tandem mass spectrometry that could potentially allow the annotation of metabolites not discovered previously. iMet uses MS/MS spectra to identify metabolites structurally similar to an unknown metabolite, and gives a net atomic addition or removal that converts the known metabolite into the unknown one. We validate the algorithm with 148 metabolites, and show that for 89% of them at least one of the top four matches identified by iMet enables the proper annotation of the unknown metabolite. iMet is freely available at http://imet.seeslab.net.
Proteins are the active working horses in our body. These biomolecules perform all vital cellular functions from DNA replication and general biosynthesis to metabolic signaling and environmental sensing. While static 3D structures are now readily available, observing the functional cycle of proteins - involving conformational changes and interactions - remains very challenging, e.g., due to ensemble averaging. However, time-resolved information is crucial to gain a mechanistic understanding of protein function. Single-molecule techniques such as FRET and force spectroscopies provide answers but can be limited by the required labelling, a narrow time bandwidth, and more. Here, we describe electrical nanopore detection as a tool for probing protein dynamics. With a time bandwidth ranging from microseconds to hours, it covers an exceptionally wide range of timescales that is very relevant for protein function. First, we discuss the working principle of label-free nanopore experiments, various pore designs, instrumentation, and the characteristics of nanopore signals. In the second part, we review a few nanopore experiments that solved research questions in protein science, and we compare nanopores to other single-molecule techniques. We hope to make electrical nanopore sensing more accessible to the biochemical community, and to inspire new creative solutions to resolve a variety of protein dynamics - one molecule at a time.
Because of the recent technological advances, the key technologies needed for precision space optical astrometry are now in hand. The Microarcsecond Astrometry Probe (MAP) mission concept is designed to find 1 Earth mass planets at 1AU orbit (scaled to solar luminosity) around the nearest ~90 FGK stars. The MAP payload includes i) a single three-mirror anastigmatic telescope with a 1-m primary mirror and metrology subsystems, and ii) a camera. The camera focal plane consists of 42 detectors, providing a Nyquist sampled FOV of 0.4-deg. Its metrology subsystems ensure that MAP can achieve the 0.8 uas astrometric precision in 1 hr, which is required to detect Earth-like exoplanets in our stellar neighborhood. MAP mission could provide ~10 specific targets for a much larger coronagraphic mission that would measure its spectra. We argue for the development of the space astrometric missions capable of finding Earth-2.0. Given the current technology readiness such missions relying on precision astrometry could be flown in the next decade, perhaps in collaboration with other national space agencies.
Metabarcoding on amplicons is rapidly expanding as a method to produce molecular based inventories of microbial communities. Here, we work on freshwater diatoms, which are microalgae possibly inventoried both on a morphological and a molecular basis. We have developed an algorithm, in a program called diagno-syst, based a the notion of informative read, which carries out supervised clustering of reads by mapping them exactly one by one on all reads of a well curated and taxonomically annotated reference database. This program has been run on a HPC (and HTC) infrastructure to address computation load. We compare optical and molecular based inventories on 10 samples from Leman lake, and 30 from Swedish rivers. We track all possibilities of mismatches between both approaches, and compare the results with standard pipelines (with heuristics) like Mothur. We find that the comparison with optics is more accurate when using exact calculations, at the price of a heavier computation load. It is crucial when studying the long tail of biodiversity, which may be overestimated by pipelines or algorithms using heuristics instead (more false positive). This work supports the analysis that these methods will benefit from progress in, first, building an agreement between molecular based and morphological based systematics and, second, having as complete as possible publicly available reference databases.
comments
Fetching comments Fetching comments
Sign in to be able to follow your search criteria
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا