22
IRUS TotalDownloads
Altmetric
Investigation of the gene regulatory logic encoded in genome sequences using machine learning
File | Description | Size | Format | |
---|---|---|---|---|
Steinhauser-S-2020-PhD-Thesis.pdf | Thesis | 211.02 MB | Adobe PDF | View/Open |
Title: | Investigation of the gene regulatory logic encoded in genome sequences using machine learning |
Authors: | Steinhauser, Sebastian |
Item Type: | Thesis or dissertation |
Abstract: | Spatiotemporal regulation of gene expression in multicellular organisms is mediated by cell-type specific regulatory elements called transcriptional enhancers. Their activity is regulated by the coordinated binding of transcription factors (TFs) to specific DNA sequences encoded within these elements. In this PhD thesis, we investigate transcriptional enhancers in the context of pluripotency and early neural development. In Chapter 2, we investigate how enhancer activity is encoded in the DNA sequence underlying TF binding sites. To this end, enhancer activities of NANOG binding sites in naive mouse embryonic stem cells (mESCs) were measured using ChIP STARR-seq. This revealed two distinct classes of NANOG binding sites, active enhancers and inactive binding sites. Predictive modelling of these classes uncovered sequence features which account for these differences such as the ESRRB motif. In Chapter 3, we examined differential enhancer activities of NANOG binding sites between naive and primed mESC using ChIP STARR-seq. Comprehensive analysis of various chromatin genomic features revealed chromatin accessibility changes as a major predictor of differential enhancer activity. Predictive modelling using only DNA sequence features uncovered potential transcriptional regulators that define enhancer activity in the two cellular states, such as TCF3, TCF4 and ESRRB for naive mESC and ZIC3 for primed cells. In Chapter 4, we studied the timing of neural induction during early nervous system development along the anterior-posterior axis using information about enhancer accessibilities. Changes in open-chromatin states during neural development were measured by applying ATAC-seq to an in vitro model system. Analysis of resulting data revealed that for posterior spinal cord induction, cells initially commit to a posterior regional identity before acquiring neural identity. Moreover, our analysis highlighted the role of CDX2 during this process which represses hindbrain and activates spinal cord related transcriptional programs. Together, this Chapter provides conclusively evidence for the dual origin of neural progenitor cells. |
Content Version: | Open Access |
Issue Date: | Feb-2020 |
Date Awarded: | Jul-2020 |
URI: | http://hdl.handle.net/10044/1/90822 |
DOI: | https://doi.org/10.25560/90822 |
Copyright Statement: | Creative Commons Attribution NonCommercial 4.0 International Licence |
Supervisor: | Luscombe, Nicholas Lenhard, Boris |
Department: | Institute of Clinical Sciences |
Publisher: | Imperial College London |
Qualification Level: | Doctoral |
Qualification Name: | Doctor of Philosophy (PhD) |
Appears in Collections: | Department of Clinical Sciences PhD Theses |
This item is licensed under a Creative Commons License