Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • Communities & Collections
  • Research Outputs
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Natural Sciences
  3. Life Sciences
  4. Life Sciences PhD theses
  5. Learning large-scale gene regulatory networks from single cell transcriptomic data using multivariate information theory
 
  • Details
Learning large-scale gene regulatory networks from single cell transcriptomic data using multivariate information theory
File(s)
Chan-TE-2018-PhD-Thesis.pdf (32.4 MB)
Thesis
Author(s)
Chan, Thalia E
Type
Thesis or dissertation
Abstract
Networks provide powerful and flexible models for many biological systems. Gene regulatory networks model the complex biomolecular interactions that drive genetically identical cells to adopt distinct functional identities. Gene regulation is complicated and multifactorial, involving different biological mechanisms that are often well-characterized individually, but whose combinatorial effects on a large scale are poorly understood. Recent technological advances mean that highly-informative single cell data are being generated on an unprecedented scale, but these large, unwieldy datasets contain technical noise and biological heterogeneity, and must be analyzed carefully. Here we make use of information theory to address the complexities and nonlinearities of these data, drawing on recent developments in quantifying information within multivariate systems. We first provide a comprehensive review of single cell analysis and the multiple definitions of gene regulatory networks. Then we develop a network inference algorithm using multivariate information, and use it to learn networks from several developmental systems. We next present a method for quantifying significance in a large-scale information theoretic context, and use it to control the accuracy of our networks for different applications. Finally we infer a network from over 11,000 expressed genes and, using graph theoretic and bioinformatic analyses, characterize the biological processes occurring during early neural development. Throughout this work, we take a data-driven approach, favouring statistical methods that make minimal assumptions. Alongside our analyses, we provide efficient, reusable software, culminating in a collection of free, open source software libraries.
Version
Open Access
Date Issued
2018-07
Date Awarded
2018-12
URI
http://hdl.handle.net/10044/1/75534
DOI
https://doi.org/10.25560/75534
Copyright Statement
Creative Commons Attribution Non-Commercial No Derivatives licence.
Advisor
Stumpf, Michael
Babtie, Ann
Sponsor
Biotechnology and Biological Sciences Research Council (Great Britain)
Publisher Department
Life Sciences
Publisher Institution
Imperial College London
Qualification Level
Doctoral
Qualification Name
Doctor of Philosophy (PhD)
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback