ترغب بنشر مسار تعليمي؟ اضغط هنا

Taxonomic Provenance: Two Influential Primate Classifications Logically Aligned

167   0   0.0 ( 0 )
 نشر من قبل Nico Franz
 تاريخ النشر 2014
  مجال البحث علم الأحياء
والبحث باللغة English




اسأل ChatGPT حول البحث

Classification standards such as the Mammal Species of the World (MSW) aim to unify name usages at the global scale, but may nevertheless experience significant levels of taxonomic change from one edition to the next. This circumstance challenges the biodiversity and phylogenetic data communities to develop more granular identifiers to track taxonomic congruence and incongruence in ways that both humans and machines can process, i.e., to logically represent taxonomic provenance across multiple classification hierarchies. Here we show that reasoning over taxonomic provenance is feasible for two classifications of primates corresponding to the second and third MSW editions. Our approach entails three main components: (1) individuation of name usages as taxonomic concepts, (2) articulation of concepts via human-asserted Region Connection Calculus (RCC-5) relationships, and (3) the use of an Answer Set Programming toolkit to infer and visualize logically consistent alignments of these taxonomic input constraints. Our use case entails the Primates sec. Groves (1993; MSW2 - 317 taxonomic concepts; 233 at the species level) and Primates sec. Groves (2005; MSW3 - 483 taxonomic concepts; 376 at the species level). Using 402 concept-to-concept input articulations, the reasoning process yields a single, consistent alignment, and infers 153,111 Maximally Informative Relations that constitute a comprehensive provenance resolution map for every concept pair in the Primates sec. MSW2/MSW3. The entire alignment and various partitions facilitate quantitative analyses of name/meaning dissociation, revealing that approximately one in three paired name usages across treatments is not reliable - in the sense of the same name identifying congruent taxonomic meanings. We conclude with an optimistic outlook for logic-based provenance tools in next-generation biodiversity and phylogeny data platforms.



قيم البحث

اقرأ أيضاً

We analyze several florae (collections of plant species populating specific areas) in different geographic and climatic regions. For every list of species we produce a taxonomic classification tree and we consider its statistical properties. We find that regardless of the geographical location, the climate and the environment all species collections have universal statistical properties that we show to be also robust in time. We then compare observed data sets with simulated communities obtained by randomly sampling a large pool of species from all over the world. We find differences in the behavior of the statistical properties of the corresponding taxonomic trees. Our results suggest that it is possible to distinguish quantitatively real species assemblages from random collections and thus demonstrate the existence of correlations between species.
This paper develops a formulation of the quasispecies equations appropriate for polysomic, semiconservatively replicating genomes. This paper is an extension of previous work on the subject, which considered the case of haploid genomes. Here, we deve lop a more general formulation of the quasispecies equations that is applicable to diploid and even polyploid genomes. Interestingly, with an appropriate classification of population fractions, we obtain a system of equations that is formally identical to the haploid case. As with the work for haploid genomes, we consider both random and immortal DNA strand chromosome segregation mechanisms. However, in contrast to the haploid case, we have found that an analytical solution for the mean fitness is considerably more difficult to obtain for the polyploid case. Accordingly, whereas for the haploid case we obtained expressions for the mean fitness for the case of an analogue of the single-fitness-peak landscape for arbitrary lesion repair probabilities (thereby allowing for non-complementary genomes), here we solve for the mean fitness for the restricted case of perfect lesion repair.
Our understanding of the evolutionary process has gone a long way since the publication, 150 years ago, of On the origin of species by Charles R. Darwin. The XXth Century witnessed great efforts to embrace replication, mutation, and selection within the framework of a formal theory, able eventually to predict the dynamics and fate of evolving populations. However, a large body of empirical evidence collected over the last decades strongly suggests that some of the assumptions of those classical models necessitate a deep revision. The viability of organisms is not dependent on a unique and optimal genotype. The discovery of huge sets of genotypes (or neutral networks) yielding the same phenotype --in the last term the same organism--, reveals that, most likely, very different functional solutions can be found, accessed and fixed in a population through a low-cost exploration of the space of genomes. The evolution behind the curtain may be the answer to some of the current puzzles that evolutionary theory faces, like the fast speciation process that is observed in the fossil record after very long stasis periods.
Least squares trees, multi-dimensional scaling and Neighbor Nets are all different and popular ways of visualizing multi-dimensional data. The method of flexi-Weighted Least Squares (fWLS) is a powerful method of fitting phylogenetic trees, when the exact form of errors is unknown. Here, both polynomial and exponential weights are used to model errors. The exact same models are implemented for multi-dimensional scaling to yield flexi-Weighted MDS, including as special cases methods such as the Sammon Stress function. Here we apply all these methods to population genetic data looking at the relationships of Abrahams Children encompassing Arabs and now widely dispersed populations of Jews, in relation to an African outgroup and a variety of European populations. Trees, MDS and Neighbor Nets of this data are compared within a common likelihood framework and the strengths and weaknesses of each method are explored. Because the errors in this type of data can be complex, for example, due to unexpected genetic transfer, we use a residual resampling method to assess the robustness of trees and the Neighbor Net. Despite the Neighbor Net fitting best by all criteria except BIC, its structure is ill defined following residual resampling. In contrast, fWLS trees are favored by BIC and retain considerable strong internal structure following residual resampling. This structure clearly separates various European and Middle Eastern populations, yet it is clear all of the models have errors much larger than expected by sampling variance alone.
572 - Emmanuel Tannenbaum 2007
This paper develops simplified mathematical models describing the mutation-selection balance for the asexual and sexual replication pathways in {it Saccharomyces cerevisiae}. We assume diploid genomes consisting of two chromosomes, and we assume that each chromosome is functional if and only if its base sequence is identical to some master sequence. The growth and replication of the yeast cells is modeled as a first-order process, with first-order growth rate constants that are determined by whether a given genome consists of zero, one, or two functional chromosomes. In the asexual pathway, we assume that a given diploid cell divides into two diploids. In the sexual pathway, we assume that a given diploid cell divides into two diploids, each of which then divide into two haploids. The resulting four haploids enter a haploid pool, where they grow and replicate until they meet another haploid with which to fuse. When the cost for sex is low, we find that the selective mating strategy leads to the highest mean fitness of the population, when compared to all of the other strategies. We also show that, at low to intermediate replication fidelities, sexual replication with random mating has a higher mean fitness than asexual replication, as long as the cost for sex is low. This is consistent with previous work suggesting that sexual replication is advantageous at high population densities, low replication rates, and intermediate replication fidelities. The results of this paper also suggest that {it S. cerevisiae} switches from asexual to sexual replication when stressed, because stressful growth conditions provide an opportunity for the yeast to clear out deleterious mutations from their genomes.
التعليقات
جاري جلب التعليقات جاري جلب التعليقات
سجل دخول لتتمكن من متابعة معايير البحث التي قمت باختيارها
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا