Use this URL to cite or link to this record in EThOS: | https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.489759 |
![]() |
|||||||
Title: | The automatic extraction of linguistic information from text corpora | ||||||
Author: | Mason, Oliver Jan |
ISNI:
0000 0001 2447 6909
|
|||||
Awarding Body: | University of Birmingham | ||||||
Current Institution: | University of Birmingham | ||||||
Date of Award: | 2006 | ||||||
Availability of Full Text: |
|
||||||
Abstract: | |||||||
This is a study exploring the feasibility of a fully automated analysis of linguistic data. It identifies a requirement for large-scale investigations, which cannot be done manually by a human researcher. Instead, methods from natural language processing are suggested as a way to analyse large amounts of corpus data without any human intervention. Human involvement hinders scalability and introduces a bias which prevents studies from being completely replicable. The fundamental assumption underlying this work is that linguistic analysis must be empirical, and that reliance on existing theories or even descriptive categories should be avoided as far as possible. In this thesis we report the results of a number of case studies investigating various areas of language description, lexis, grammar, and meaning. The aim of these case studies is to see how far we can automate the analysis of different aspects of language, both with data gathering and subsequent processing of the data. The outcomes of the feasibility studies demonstrate the practicability of such automated analyses.
|
|||||||
Supervisor: | Not available | Sponsor: | Not available | ||||
Qualification Name: | Thesis (Ph.D.) | Qualification Level: | Doctoral | ||||
EThOS ID: | uk.bl.ethos.489759 | DOI: | Not available | ||||
Keywords: | PE English ; P Philology. Linguistics ; QA76 Computer software | ||||||
Share: |