32
IRUS TotalDownloads
Altmetric
Integrated Bayesian approach to incorporate prior biological knowledge in the investigation of genetic variants associated with lung function
File | Description | Size | Format | |
---|---|---|---|---|
Pereira-M-2018-PhD-Thesis.pdf | Thesis | 3.61 MB | Adobe PDF | View/Open |
Title: | Integrated Bayesian approach to incorporate prior biological knowledge in the investigation of genetic variants associated with lung function |
Authors: | das Neves Sousa Pereira, Miguel Maria |
Item Type: | Thesis or dissertation |
Abstract: | Genetic association studies aim to identify single-nucleotide polymorphisms (SNPs) that are associated with complex diseases and traits. These studies, especially those performed at the genome-wide level, have successfully identified many SNPs associated with several diseases but current findings explain only a small fraction of their estimated genetic component. This thesis contributes to the scientific literature by developing a novel statistical approach for the analysis of genetic association studies and by applying it to the discovery of new SNPs associated with lung function. Lung function has an estimated genetic component of ~40% and an important epidemiological impact since low lung function has been associated with all-cause mortality. First, I investigate a Bayesian hierarchical shrinkage model that performs a joint analysis of SNPs and includes biological information about the SNPs in the analysis. The approach involves creating a score of biological knowledge retrieved from online databases which is used to inform the shrinkage towards a null SNP effect. The performance of this approach was investigated in a simulation study and a real data example where the true associations were known. It consistently outperformed the standard analysis, where SNPs are analysed separately using linear regression models. Second, I apply the approach developed to data from the large UK Biobank study to investigate the effects on lung function of variants located in 403 genes related to lung development. The results from this analysis were then replicated using three independent cohorts. The combination of the standard analysis with the Bayesian approach allowed the identification of 65 new variants in 31 different genes, which nearly doubles the number of known variants associated with lung function. Finally, I present Bioshrink, a user-friendly R Shiny application that is freely accessible and allows third-party users to apply the Bayesian approach to their own genetic association studies. |
Content Version: | Open Access |
Issue Date: | May-2018 |
Date Awarded: | Oct-2018 |
URI: | http://hdl.handle.net/10044/1/64788 |
DOI: | https://doi.org/10.25560/64788 |
Supervisor: | Minelli, Cosetta Thompson, John |
Sponsor/Funder: | National Heart and Lung Institute |
Department: | National Heart & Lung Institute |
Publisher: | Imperial College London |
Qualification Level: | Doctoral |
Qualification Name: | Doctor of Philosophy (PhD) |
Appears in Collections: | National Heart and Lung Institute PhD theses |