Use this URL to cite or link to this record in EThOS:
Title: Extending the graphical representation of four KEGG pathways for a better understanding of prostate cancer using machine learning of graphical models
Author: Aloraini, Adel Abdullah M.
ISNI:       0000 0004 2715 6187
Awarding Body: University of York
Current Institution: University of York
Date of Award: 2011
Availability of Full Text:
Access from EThOS:
Access from Institution:
This thesis shows a novel contribution to computational biology alongside with developed machine learning methods. It shows how the graphical representation of KEGG pathways can be refined using machine learning of graphical models. The focus mainly is on a set of graphical models called Bayesian networks. Throughout this thesis , different ways of learning Bayesian networks are discussed. The work is based on Affymetrix gene expression microarray profiles and penalised Gaussian linear models. Penalisation in linear models includes choosing the most important parents and estimating the associated coefficients simultaneously using L1-regression. The sparse dataset that is generated from Affymetrix microarray technology is the key point in this thesis when learning Bayesian networks. Thus, the work in this thesis can be viewed as developing robust methods to avoid overfitting that usually associated with gene expression datasets and contributing to invoke more details about a well known discrepancy in KEGG pathways. So,the problem we have is to learn from a large number of candidates, small samples,(p>>n), and for such problem the goal is to apply model selection methods that hopefully achieve an accurate prediction , interpretable models, and stable models. The prediction and the most powerful predictors can be improved by using methods that trade-off between bias and variance. Also, providing which predictors are meaningful rather than using all predictors will provide interpretable models, and finally by choosing the most important predictors, a small change in the data will not result in large changes in the subset of predictors which consequently gives the stability to the models that are learnt.
Supervisor: Cussens, James Sponsor: Not available
Qualification Name: Thesis (Ph.D.) Qualification Level: Doctoral
EThOS ID:  DOI: Not available