Use this URL to cite or link to this record in EThOS:
Title: Automatic topic labelling and opinion summarisation
Author: Barawi, Mohamad Hardyman
ISNI:       0000 0004 8498 4421
Awarding Body: University of Aberdeen
Current Institution: University of Aberdeen
Date of Award: 2019
Availability of Full Text:
Access from EThOS:
Full text unavailable from EThOS. Please try the link below.
Access from Institution:
With the global increase in online tools such as online reviews and social media platforms, individuals all around the globe have changed their way of making a decision, interacting and sharing information. This change has led researchers to explore various interests in these invaluable sources of information by using a set of statistical methods such as the topic models to discover the hidden thematic structure in a large collection of documents. As an illustration, these models learn sets of topics from words frequency that co-occurs in the document collection automatically. Topics discovered are associated with relevant documents and often represent abstract themes, i.e. Politics or Sports. As a result, these characteristics make topic models a useful tool to extract interesting topics automatically from a mass amount of data such as reviews, online expressions and ratings. The main aim of this thesis is to focus on some fundamental challenges in inter-preting topic models, making them more useful and comprehensible to humans. First, we look at the problem of labelling the topics discovered by topic models. We propose novel methods for labelling the sentiment-bearing topics automatically and show that our approaches work better than previously proposed methods. Next, we propose methods for summarising opinions in a large document collection. Our opinion summarisation approach is scalable and also provides diverse and general summaries. Finally, we look at the problem of organising large collections of opinionated articles to visualising relevant information in articles. We develop a browsing system that allows users to navigate and identify relevant information in article collections by using the topics discovered by topic models as keywords. We also propose approaches to visualise topics discovered in a quantitative way, such as the heat maps. We also show the topics visualisation discovered in the different region using the reverse geo-coding approach.
Supervisor: Lin, Chenghua ; Siddharthan, Advaith Sponsor: Not available
Qualification Name: Thesis (Ph.D.) Qualification Level: Doctoral
EThOS ID:  DOI: Not available
Keywords: Text data mining ; Probabilities