No Arabic abstract
The two-parameter Poisson--Dirichlet diffusion, introduced in 2009 by Petrov, extends the infinitely-many-neutral-alleles diffusion model, related to Kingmans one-parameter Poisson--Dirichlet distribution and to certain Fleming--Viot processes. The additional parameter has been shown to regulate the clustering structure of the population, but is yet to be fully understood in the way it governs the reproductive process. Here we shed some light on these dynamics by formulating a $K$-allele Wright--Fisher model for a population of size $N$, involving a uniform mutation pattern and a specific state-dependent migration mechanism. Suitably scaled, this process converges in distribution to a $K$-dimensional diffusion process as $Ntoinfty$. Moreover, the descending order statistics of the $K$-dimensional diffusion converge in distribution to the two-parameter Poisson--Dirichlet diffusion as $Ktoinfty$. The choice of the migration mechanism depends on a delicate balance between reinforcement and redistributive effects. The proof of convergence to the infinite-dimensional diffusion is nontrivial because the generators do not converge on a core. Our strategy for overcoming this complication is to prove textit{a priori} that in the limit there is no loss of mass, i.e., that, for each limit point of the sequence of finite-dimensional diffusions (after a reordering of components by size), allele frequencies sum to one.
The two parameter Poisson-Dirichlet distribution $PD(alpha,theta)$ is the distribution of an infinite dimensional random discrete probability. It is a generalization of Kingmans Poisson-Dirichlet distribution. The two parameter Dirichlet process $Pi_{alpha,theta, u_0}$ is the law of a pure atomic random measure with masses following the two parameter Poisson-Dirichlet distribution. In this article we focus on the construction and the properties of the infinite dimensional symmetric diffusion processes with respective symmetric measures $PD(alpha,theta)$ and $Pi_{alpha,theta, u_0}$. The methods used come from the theory of Dirichlet forms.
The two-parameter Poisson-Dirichlet distribution is the law of a sequence of decreasing nonnegative random variables with total sum one. It can be constructed from stable and Gamma subordinators with the two-parameters, $alpha$ and $theta$, corresponding to the stable component and Gamma component respectively. The moderate deviation principles are established for the two-parameter Poisson-Dirichlet distribution and the corresponding homozygosity when $theta$ approaches infinity, and the large deviation principle is established for the two-parameter Poisson-Dirichlet distribution when both $alpha$ and $theta$ approach zero.
The two-parameter Poisson-Dirichlet diffusion is an infinite-dimensional diffusion on the ordered simplex with a two-parameter Poisson-Dirichlet (alpha, theta) stationary distribution. We derive a dual process representation for the diffusion, suggested by Feng et al. (2011)s spectral expansion of the transition density, and its rearrangement by Zhou (2015). The dual process is in terms of a line-of-descent process which tracks the evolution of non-mutant frequencies from time zero. Remarkably the line-of-descent process does not depend on alpha. Methods of proof use the sampling distribution of n points in the two-parameter Poisson-Dirichlet diffusion. We connect the sampling distribution with a generalized Blackwell and MacQueen Polya urn model.
A number of discrete time, finite population size models in genetics describing the dynamics of allele frequencies are known to converge (subject to suitable scaling) to a diffusion process in the infinite population limit, termed the Wright-Fisher diffusion. In this article we show that the diffusion is ergodic uniformly in the selection and mutation parameters, and that the measures induced by the solution to the stochastic differential equation are uniformly locally asymptotically normal. Subsequently these two results are used to analyse the statistical properties of the Maximum Likelihood and Bayesian estimators for the selection parameter, when both selection and mutation are acting on the population. In particular, it is shown that these estimators are uniformly over compact sets consistent, display uniform in the selection parameter asymptotic normality and convergence of moments over compact sets, and are asymptotically efficient for a suitable class of loss functions.
The Wright-Fisher family of diffusion processes is a widely used class of evolutionary models. However, simulation is difficult because there is no known closed-form formula for its transition function. In this article we demonstrate that it is in fact possible to simulate exactly from a broad class of Wright-Fisher diffusion processes and their bridges. For those diffusions corresponding to reversible, neutral evolution, our key idea is to exploit an eigenfunction expansion of the transition function; this approach even applies to its infinite-dimensional analogue, the Fleming-Viot process. We then develop an exact rejection algorithm for processes with more general drift functions, including those modelling natural selection, using ideas from retrospective simulation. Our approach also yields methods for exact simulation of the moment dual of the Wright-Fisher diffusion, the ancestral process of an infinite-leaf Kingman coalescent tree. We believe our new perspective on diffusion simulation holds promise for other models admitting a transition eigenfunction expansion.