Use this URL to cite or link to this record in EThOS: http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.595145
Title: Semantic tagging of medical narratives using SNOMED CT
Author: Hina, Saman
Awarding Body: University of Leeds
Current Institution: University of Leeds
Date of Award: 2013
Availability of Full Text:
Access from EThOS:
Full text unavailable from EThOS. Restricted access.
Access from Institution:
Abstract:
In the medical domain, semantic analysis is critical for several research questions which are not only limited to healthcare researchers but are of interest to NLP researchers. Yet, most of the data exists in the form of medical narratives. Semantic analysis of medical narratives is required to be carried out for the identification of semantic information and its classification with semantic categories. This semantic analysis is useful for domain users as well as non-domain users for further investigations. The main objective of this research is to develop a generic semantic tagger for medical narratives using a tag set derived from SNOMED CT® which is an international healthcare terminology. Towards this objective, the key hypothesis is that it is possible to identify semantic information (paraphrases of concepts, abbreviations of concepts and complex multiword concepts) in medical narratives and classify with globally known semantic categories by analysis of an authentic corpus of medical narratives and the language of SNOMED CT®. This research began with an investigation of using SNOMED CT® for identification of concepts in medical narratives which resulted in the derivation of a tag set. Later in this research, this tag set was used to develop three gold standard datasets. One of these datasets required anonymization because it contained four protected health information (PHI) categories. Therefore, a separate module was developed for the anonymization of these PHI categories. After the anonymization, a generic annotation scheme was developed and evaluated for the annotation of three gold standard datasets. One of the gold standard datasets was used to develop generic rule-patterns for the semantic tagger while the other two datasets were used for the evaluation of semantic tagger. Besides evaluation using the gold standard datasets, the semantic tagger was compared with three systems based on different methods, and shown to outperform them.
Supervisor: Not available Sponsor: Not available
Qualification Name: Thesis (Ph.D.) Qualification Level: Doctoral
EThOS ID: uk.bl.ethos.595145  DOI: Not available
Share: