Use this URL to cite or link to this record in EThOS: https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.799481
Title: Scalable Gaussian process methods for single-cell data
Author: Ahmed, Sumon
ISNI:       0000 0004 8505 0524
Awarding Body: University of Manchester
Current Institution: University of Manchester
Date of Award: 2020
Availability of Full Text:
Access from EThOS:
Access from Institution:
Abstract:
The analysis of single-cell data creates the opportunity to examine the temporal dynamics of complex biological processes where the generation of time course experiments is challenging or technically impossible. One popular approach is to learn a lower dimensional manifold or trajectory through the data that captures major sources of variation in the data. Gene expression patterns can then be aligned through different lineages in the trajectory as smooth functions of pseudotime which promises to facilitate the identification of differentially expressed (DE) genes across trajectories. We briefly review some popular trajectory inference and downstream analysis methods along with their strengths and assumptions. We provide a brief overview of Gaussian process (GP) inference and describe how GPs can be used for dimensionality reduction and data association, which later facilitate probabilistic pseudotime estimation and downstream analysis to inferring DE genes and branching times. We present a scalable implementation of the Gaussian process latent variable model (GPLVM) and develop a pseudotime estimation method that scales to droplet-based large volume single-cell datasets and can be extended to higher dimensional latent spaces to capture other sources of variation such as branching dynamics. The model's efficacy is evaluated on a number of datasets from different organisms collected using different protocols. The model converges significantly faster compared to existing methods whilst achieving comparable estimation accuracy. We reimplement an existing downstream analysis method for identifying branching dynamics from bulk time series data and apply it on single-cell data after pseudotime inference, extending the models to model counts data. We also present the limitations of a recent approach to inference of branching dynamics in single-cell data and extend the model to mitigate its limitations. Our downstream analysis models are shown to successfully identify branching locations for individual genes when applied on simulated data and single-cell mouse haematopoietic stem cells (HSCs) data.
Supervisor: Rattray, Magnus ; Boukouvalas, Alexis Sponsor: Not available
Qualification Name: Thesis (Ph.D.) Qualification Level: Doctoral
EThOS ID: uk.bl.ethos.799481  DOI: Not available
Keywords: gene expression ; branching ; differential expression ; single-cell ; Gaussian process ; pseudotime
Share: