Use this URL to cite or link to this record in EThOS: https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.779702
Title: Discriminant analysis : a functional perspective
Author: Perez Ruiz, Diego
ISNI:       0000 0004 7965 3978
Awarding Body: University of Manchester
Current Institution: University of Manchester
Date of Award: 2019
Availability of Full Text:
Access from EThOS:
Access from Institution:
Abstract:
Functional Data Analysis (FDA) provides information about curves that vary over a continuum. In this thesis, we propose two novel methodologies to classify a functional dataset, using supervised learning. The first methodology is based on Nearest Neighbours methods for functional data and classifies based on ranks of the functional signed depth. The proposed classifier uses the simplicity of the k-Ranked Nearest Neighbours (k-RNN) and its practical efficiency, exploiting the fact that the k-RNN provides conditional probabilities where the depth of an observed curve belongs to a particular group. Using this, we develop a probabilistic classifier and construct point-wise confidence intervals using a bootstrap approach. Following a generalized additive model, we propose a classifier based on the signed depth and the distance to the mode for functional observations. By means of a simulation study, we compare the performance of the proposed classifier against other nearest neighbours and depth classifiers. We also investigate the performance of the proposed classifier under different types of outliers common to these kinds of problems; we see that our proposed method works well under these different scenarios. The second methodology we developed is based on log ratios of density estimates using Bayes' theorem. We propose a nonparametric adaptive density Bayesian classifier based on log ratios density estimates of functional principal component scores combined with different semimetrics. We study some of the main properties of the density estimator in a finite dimensional space and conduct a simulation study to investigate the performance of the proposed classifier under two semimetrics: the semimetric based on principal components scores and the semimetric based on partial least squares. We also compare the performance of the proposed classifier against different methods for simulated and real datasets. Imbalanced sample sizes appear frequently in the classification problem and present multiple issues. We propose a method to strengthen observations that are at the boundary. We study different sampling methods to strengthen observations that are more susceptible to misclassification and we generate new curves by considering a linear combination of the observations in the border and the observations closes in depth.
Supervisor: Charalambous, Christiana Sponsor: Not available
Qualification Name: Thesis (Ph.D.) Qualification Level: Doctoral
EThOS ID: uk.bl.ethos.779702  DOI: Not available
Keywords: Principal component analysis ; Simulations ; Functional Depth ; k-ranked nearest neighbours ; Nonparametric statistics ; Kernel density estimation ; Functional Data ; Bayesian classifier
Share: