Use this URL to cite or link to this record in EThOS:
Title: Distributed computing of large-scale singular value decompositions
Author: Fang, Sheng
ISNI:       0000 0004 7960 0331
Awarding Body: University of Oxford
Current Institution: University of Oxford
Date of Award: 2018
Availability of Full Text:
Access from EThOS:
Full text unavailable from EThOS. Please try the link below.
Access from Institution:
Big data projects increasingly make use of networks of heterogeneous computational resources for scientific computing purposes. To meet the requirements of big data applications and harness the power of modern computing architectures, novel highly parallel algorithms for numerical linear algebra computations are needed that rely on as little node synchronicity and data communication as possible. Distributed algorithms are particularly important in situations where the data itself is naturally distributed across several or many servers, or where the data collection is decentralised. The singular value decomposition (SVD) is one of the fundamental matrix decompositions and a cornerstone of numerical linear algebra. The computation of leading part SVDs plays an essential role in a variety of models and algorithms and dominates their computational costs. In this thesis, we analyse a distributed algorithm for leading part SVD computations of large-scale matrices. The algorithm is well adapted to cloud computing or computer networks, as it satisfies the loose coupling requirement (low synchronicity, low communication overhead). The algorithm was first introduced by R. Hauser and D. Goodman and was first published in the latter's DPhil thesis. We will give the geometric intuition behind the algorithm and introduce it formally, but the main part of this thesis will focus on the analysis of the distributed SVD algorithm and its experimental validation. Two different approaches will be proposed to investigate the convergence of the algorithm. The first approach utilizes classical matrix analysis tools, leading to an error bound. The second approach of analysis provides more geometric insights. The framework based on geometric observations gives bounds on the global aggregation of numerical approximation errors that occur in local computations. Bounds on these local errors are studied probabilistically under the assumption of random input. Finally, we report numerical experiments, as well as the discussion of the variants of the algorithm. Applications in matrix optimisation problems, including low-rank matrix completion and sparse principal component analysis, will also be presented, and we use these to obtain an experimental validation of the algorithm under non-random input.
Supervisor: Hauser, Raphael Sponsor: China Scholarship Council ; Engineering and Physical Sciences Research Council
Qualification Name: Thesis (Ph.D.) Qualification Level: Doctoral
EThOS ID:  DOI: Not available