253
IRUS Total
Downloads
  Altmetric

Estimating effective population size from genetic data: the past, present, and the future

File Description SizeFormat 
Hui-TJ-2017-PhD-Thesis.pdfThesis2.11 MBAdobe PDFView/Open
Title: Estimating effective population size from genetic data: the past, present, and the future
Authors: Hui, Tin-Yu Jonathan
Item Type: Thesis or dissertation
Abstract: Effective population size (𝑁𝑒) is an important statistic in conservation science and in the broader topics of evolutionary genetics. 𝑁𝑒 is often used to quantify the rate of evolutionary events such as losses in genetic diversity. Estimating and interpreting such quantity can however be challenging. Chapter 2 focuses on the change in allele frequency between two or more time points due to genetic drift. A new likelihood-based estimator 𝑁𝐡̂ for contemporary 𝑁𝑒 estimation is proposed by adopting a hidden Markov algorithm and continuous approximations. 𝑁𝐡̂ is found to be several-fold faster than the existing methods without sacrificing accuracy. It also relaxes the upper bound of 𝑁𝑒 to several million and which is currently limited to about 50000 due to computing limitations. Chapter 3 extends 𝑁𝐡̂ to handle multialleleic loci through using Dirichlet-multinomial distributions. An R package is also provided and available for download. Chapter 4 explores the signatures of linkage disequilibrium (LD) between a pair of loci induced by genetic drift as a function of recombination rate and historical population sizes. 𝐸[π‘Ÿ2] can be expressed as the weighted sum of the probability of coalescent at different time points of which information about 𝑁𝑒 is contained. This relationship is verified by computer simulation and then applied to historical 𝑁𝑒 estimation as illustrated in an example of Anopheles coluzzii population. A new likelihood-based routine Constrained ML is suggested in chapter 5 to estimate haplotype frequencies and π‘Ÿ2 from genotypes under Hardy-Weinberg Equilibrium. It is shown to be identical to existing EM algorithm under normal conditions but far less sensitive to initial conditions. A new β€œunbiased” sample size correction is also proposed to estimate π‘Ÿ2. To summarise, this work pushes the 𝑁𝑒 estimation to its current boundary and more importantly provides suitable tools to analyse the ever-growing datasets.
Content Version: Open Access
Issue Date: Jan-2017
Date Awarded: Jul-2017
URI: http://hdl.handle.net/10044/1/49250
DOI: https://doi.org/10.25560/49250
Copyright Statement: Creative Common Attribute Non-Commercial No derivative licence
Supervisor: Burt, Austin
Sponsor/Funder: Bill and Melinda Gates Foundation
Department: Life Sciences
Publisher: Imperial College London
Qualification Level: Doctoral
Qualification Name: Doctor of Philosophy (PhD)
Appears in Collections:Life Sciences PhD theses