Use this URL to cite or link to this record in EThOS:
Title: Novel guidelines for the analysis of single nucleotide polymorphisms in disease association studies
Author: Fiaschi, Linda
ISNI:       0000 0004 2709 4262
Awarding Body: University of Nottingham
Current Institution: University of Nottingham
Date of Award: 2011
Availability of Full Text:
Access from EThOS:
Access from Institution:
How genetic mutations such as Single Nucleotide Polymorphisms (SNPs) affect the risk of contracting a specific disease is still an open question for numerous different medical conditions. Two problems related to SNPs analysis are (i) the selection of computational techniques to discover possible single and multiple SNP associations; and (ii) the size of the latest datasets, which may contain millions of SNPs. In order to find associations between SNPs and diseases, two popular techniques are investigated and enhanced. Firstly, the ‘Transmission Disequilibrium Test’ for familybased analysis is considered. The fixed length of haplotypes provided by this approach represents a possible limit to the quality of the obtained results. For this reason, an adaptation is proposed to select the minimum number of SNPs that are responsible for disease predisposition. Secondly, decision tree algorithms for case-control analysis in situations of unrelated individuals are considered. The application of a single tool may lead to limited analysis of the genetic association to a specific condition. Thus, a novel consensus approach is proposed exploiting the strengths of three different algorithms, ADTree, C4.5 and Id3. Results obtained suggest the new approach achieves improved performance. The recent explosive growth in size of current SNPs databases has highlighted limitations in current techniques. An example is ‘Linkage Disequilibrium’ which identifies redundancy in multiple SNPs. Despite the high accuracies obtained by this method, it exhibits poor scalability for large datasets, which severely impacts on its performance. Therefore, a new fast scalable tool based on ‘Linkage Disequilibrium’ is developed to reduce the size through the measurement and elimination of redundancy between SNPs included in the initial dataset. Experimental evidence validates the potentially improved performance of the new method.
Supervisor: Not available Sponsor: Not available
Qualification Name: Thesis (Ph.D.) Qualification Level: Doctoral
EThOS ID:  DOI: Not available
Keywords: QA 75 Electronic computers. Computer science ; QH426 Genetics