Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • Communities & Collections
  • Research Outputs
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Natural Sciences
  3. Cell and Molecular Biology
  4. Cell and Molecular Biology PhD theses
  5. Orthologous pair transfer and hybrid Bayes methods to predict the protein-protein interaction network of the Anopheles gambiae mosquitoes
 
  • Details
Orthologous pair transfer and hybrid Bayes methods to predict the protein-protein interaction network of the Anopheles gambiae mosquitoes
File(s)
Li-Q-2009-PhD-Thesis.pdf (3.94 MB)
Author(s)
Li, Qiuxiang
Type
Thesis or dissertation
Abstract
Based on the published protein-protein interaction maps of five organisms and other public databases for domain-domain and protein-protein interactions, two new approaches are proposed to infer the protein-protein interaction network of the Anopheles gambiae (A. gambiae) mosquitoes.
Our main contributions are: i) Adopted an orthologous protein pair transfer method that has so far not been seen in literature ii) Proposed a new hybrid Bayes method; iii) Used voting machines at two levels of the combined classifier/predictor; iv) Used heterogeneous datasets as the training data; v) And finally, used the trained classifier to predict the protein interactions maps for A. gambiae, arguably one of a few least known organisms in terms of protein interaction mechanism.
With the first method, the orthologous and in-paralogous protein clusters are extracted for both species. The relations between two peer-to-peer proteins in the two species are identified so that the interactions in the D. melanogaster protein interaction maps are transferred to pairs of interacting proteins in A. gambiae.
The second strategy, namely the hybrid Bayes, is based on the domain composition of proteins, with which we utilize a probability model to build virtual domain-domain maps by integrating large-scale protein interaction data from five organisms, namely Saccharomyces cerevisiae, Caenorhabditis elegans, Escherichia coli, Mus musculus and Drosophila melanogaster.
For the hybrid Bayes method, once the virtual domain-domain interaction maps are constructed, we propose two ways to predict the protein-protein interaction maps. These two methods are compared and then combined to form a voting machine to collectively decide a protein-pair’s candidacy. The users could adjust the weights for different methods to flexibly control the output. Parameters are chosen through running different experiments on the training data set.
While both the orthologous cluster and hybrid Bayes methods produce encouraging results the second one predicts more protein-protein interaction than the first. Yet these two data sets share a very small fraction of common interactions. We adopt a second voting machine and calibrate the parameters with the putative protein interaction data. Those parameters for the voting machine are used to predict the protein-protein interaction maps of the A. gambiae and produces reasonably good results.
Date Issued
2008
Date Awarded
2009-01
URI
http://hdl.handle.net/10044/1/4635
DOI
https://doi.org/10.25560/4635
Advisor
Crisanti, Andrea
Muggleton, Stephen
Sponsor
ORS scholarship
Creator
Li, Qiuxiang
Publisher Department
Cell and Molecular Biology
Publisher Institution
Imperial College London
Qualification Level
Doctoral
Qualification Name
Doctor of Philosophy (PhD)
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback