Use this URL to cite or link to this record in EThOS: http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.666646
Title: Similarity in the context of the orphan drug legislation
Author: Pereira Franco, Joao Pedro
ISNI:       0000 0004 5356 0618
Awarding Body: University of Sheffield
Current Institution: University of Sheffield
Date of Award: 2015
Availability of Full Text:
Access through EThOS:
Access through Institution:
Abstract:
An orphan drug is a medicinal product intended for the treatment of a rare disease that affects only a small number of patients, e.g., five in ten thousand. According to the current European orphan drug legislation, the European Union shall not, for a period of ten years, accept another orphan medicinal product for the same therapeutic indication, in respect of a similar medicinal product. Thus far, the European Medicines Agency has used human judgments of similarity when assessing new medicines for rare diseases. The project reported here seeks to develop quantitative methods for this purpose. The project began with an analysis of the correlation between human and computed judgments of similarity for 100 pairs of molecules chosen from the Drug Bank 3.0 database. The human similarity assessments for these pairs of molecules were obtained from a total of 143 experts from Europe, Asia and the US, with the experts being asked to state whether each pair was, or was not, similar. The percentage of the experts judging a pair to be similar was then compared to the Tanimoto coefficient computed using a range of different types of descriptors (1D, 2D and 3D), with the aim of identifying those descriptors that correlated most closely with the human judgments. The following types of fingerprint were studied: ECFP4, ECFC4, Daylight, Unity, BCI, MDL as implemented in the Pipeline Pilot system; and CDK Extended, CDK Standard, Estate, PubChem, MACCS, Morgan, Feat Morgan, Atom Pair, Torsion, RDKit, Avalon, Layers, FP:TGD and FP:TGT as implemented in the KNIME system. The 3D fingerprints studied were the following: FP:TAD and FP:TAT as implemented in the KNIME protocol. 1D molecular property descriptors were also studied but these proved to be of only limited effectiveness for this application. Logistic regression models were developed for each type of descriptor, relating the Tanimoto similarity for a pair of molecules computed with the probability of the human experts regarding that pair as being similar. The resulting regression models were then validated using a separate test-set containing 100 pairs of molecules that had previously been evaluated by the European Medicines Agency in the context of the authorisation of medicines for rare diseases. The best models were able to reproduce over 95% of the human judgments. This success rate was increased to 98-99% using a simple data fusion approach in which a pair of molecules is classified as similar (or non-similar) when three or more of the individual fingerprints are in agreement. The results obtained here suggest that computed Tanimoto values based on 2D descriptors could provide a useful source of information when assessing new active substances that are being proposed for the treatment of rare diseases.
Supervisor: Willett, Peter ; Holliday, John Sponsor: Not available
Qualification Name: Thesis (Ph.D.) Qualification Level: Doctoral
EThOS ID: uk.bl.ethos.666646  DOI: Not available
Share: