Use this URL to cite or link to this record in EThOS: https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.777822
Title: Efficient algorithms for cancer gene searching and classification (colon cancer)
Author: Al-Rajab, Murad Mustafa Jaber
ISNI:       0000 0004 7963 5948
Awarding Body: University of Huddersfield
Current Institution: University of Huddersfield
Date of Award: 2019
Availability of Full Text:
Access from EThOS:
Full text unavailable from EThOS. Restricted access.
Access from Institution:
Abstract:
Cancer kills millions of people worldwide each year. It is a growing problem and is the foremost cause of death worldwide. The numbers of people battling cancer is growing rapidly, owing to different reasons, such as lifestyle. Clinically, determining the cause of cancer is very challenging and often inaccurate. The goal of this research springs from the increasing necessity to incorporate efficient and accurate algorithms to detect colon cancer. In this research, two main models within case studies are proposed. The first case study (model) suggests a 3-phased method of examining the accuracy and time efficiency of high-performance gene selection and cancer classification algorithms applied to detecting colon cancer cells. The first and second phases examine gene/feature selection and cancer classification algorithms applied independently across the entire colon dataset. Phase three examines the performance of the first two phases incorporated together. The performance accuracies and time analyses are then compared across algorithms. The second case study proposes a model that reports accuracy improvements using a two-stage hybrid multifilter feature selection method for colon-cancer classification. This model is a benefit of applying gene selection prior to classification methods, and it enhances the accuracy of cancer-cell detection performance results. The proposed model first applies a hybrid genetic algorithm (GA) and information gain incorporated as the first stage of selection, followed by a filter-ranking algorithm of minimum redundancy maximum relevance (mRMR) to refine the subset of selected genes for the second stage of selection. Thereafter, the selected genes are evaluated by a variety of machine-learning algorithms. It is found from the first case study that GA performs better for gene selection on the colon dataset during phase 1. Whereas, during phase 2, decision tree (DT) and support vector machine (SVM) classifiers reflect very good accuracy results(86%-87%). During phase 3, the incorporation of GA as a selector and DT as a classifier outperforms other algorithms with respect to accuracy (92%). The incorporation also analyses better with a time efficiency. However, the second case study finds that SVM classifiers reflected high accuracy following the proposed 2-stage multifilter selection approach (94%). When compared to methods in the literature, the proposed models yield better results.
Supervisor: Lu, Joan ; Xu, Qiang Sponsor: Not available
Qualification Name: Thesis (Ph.D.) Qualification Level: Doctoral
EThOS ID: uk.bl.ethos.777822  DOI: Not available
Keywords: Q Science (General) ; QH426 Genetics
Share: