No Arabic abstract
The distributions of the times to the first common ancestor t_mrca is numerically studied for an ecological population model, the extended Moran model. This model has a fixed population size N. The number of descendants is drawn from a beta distribution Beta(alpha, 2-alpha) for various choices of alpha. This includes also the classical Moran model (alpha->0) as well as the uniform distribution (alpha=1). Using a statistical mechanics-based large-deviation approach, the distributions can be studied over an extended range of the support, down to probabilities like 10^{-70}, which allowed us to study the change of the tails of the distribution when varying the value of alpha in [0,2]. We find exponential distributions p(t_mrca)~ delta^{t_mrca} in all cases, with systematically varying values for the base delta. Only for the cases alpha=0 and alpha=1, analytical results are known, i.e., delta=exp(-2/N^2) and delta=2/3, respectively. We recover these values, confirming the validity of our approach. Finally, we also study the correlations between t_mrca and the number of descendants.
We study the evolution of the population genealogy in the classic neutral Moran Model of finite size and in discrete time. The stochastic transformations that shape a Moran population can be realized directly on its genealogy and give rise to a process with a state space consisting of the finite set of Yule trees of a certain size. We derive a number of properties of this process, and show that they are in agreement with existing results on the infinite-population limit of the Moran Model. Most importantly, this process admits time reversal, which gives rise to another tree-valued Markov Chain and allows for a thorough investigation of the Most Recent Common Ancestor process.
We define the Sampled Moran Genealogy Process, a continuous-time Markov process on the space of genealogies with the demography of the classical Moran process, sampled through time. To do so, we begin by defining the Moran Genealogy Process using a novel representation. We then extend this process to include sampling through time. We derive exact conditional and marginal probability distributions for the sampled process under a stationarity assumption, and an exact expression for the likelihood of any sequence of genealogies it generates. This leads to some interesting observations pertinent to existing phylodynamic methods in the literature. Throughout, our proofs are original and make use of strictly forward-in-time calculations and are exact for all population sizes and sampling processes.
We reconsider the Moran model in continuous time with population size $N$, two allelic types, and selection. We introduce a new particle representation, which we call the labelled Moran model, and which has the same distribution of type frequencies as the original Moran model, provided the initial values are chosen appropriately. In the new model, individuals are labelled $1,2, dots, N$; neutral resampling events may take place between arbitrary labels, whereas selective events only occur in the direction of increasing labels. With the help of elementary methods only, we not only recover fixation probabilities, but also obtain detailed insight into the number and nature of the selective events that play a role in the fixation process forward in time.
In exponentially proliferating populations of microbes, the population typically doubles at a rate less than the average doubling time of a single-cell due to variability at the single-cell level. It is known that the distribution of generation times obtained from a single lineage is, in general, insufficient to determine a populations growth rate. Is there an explicit relationship between observables obtained from a single lineage and the population growth rate? We show that a populations growth rate can be represented in terms of averages over isolated lineages. This lineage representation is related to a large deviation principle that is a generic feature of exponentially proliferating populations. Due to the large deviation structure of growing populations, the number of lineages needed to obtain an accurate estimate of the growth rate depends exponentially on the duration of the lineages, leading to a non-monotonic convergence of the estimate, which we verify in both synthetic and experimental data sets.
Eigens quasi-species model describes viruses as ensembles of different mutants of a high fitness master genotype. Mutants are assumed to have lower fitness than the master type, yet they coexist with it forming the quasi-species. When the mutation rate is sufficiently high, the master type no longer survives and gets replaced by a wide range of mutant types, thus destroying the quasi-species. It is the so-called error catastrophe. But natural selection acts on phenotypes, not genotypes, and huge amounts of genotypes yield the same phenotype. An important consequence of this is the appearance of beneficial mutations which increase the fitness of mutants. A model has been recently proposed to describe quasi-species in the presence of beneficial mutations. This model lacks the error catastrophe of Eigens model and predicts a steady state in which the viral population grows exponentially. Extinction can only occur if the infectivity of the quasi-species is so low that this exponential is negative. In this work I investigate the transient of this model when infection is started from a small amount of low fitness virions. I prove that, beyond an initial regime where viral population decreases (and can go extinct), the growth of the population is super-exponential. Hence this population quickly becomes so huge that selection due to lack of host cells to be infected begins to act before the steady state is reached. This result suggests that viral infection may widespread before the virus has developed its optimal form.