Applying active learning to supervised word sense disambiguation in MEDLINE

Open Access

1 September 2013

journal article
research article
Published by Oxford University Press (OUP) in Journal of the American Medical Informatics Association

Vol. 20 (5), 1001-1006
https://doi.org/10.1136/amiajnl-2012-001244

Abstract

Objectives This study was to assess whether active learning strategies can be integrated with supervised word sense disambiguation (WSD) methods, thus reducing the number of annotated samples, while keeping or improving the quality of disambiguation models. Methods We developed support vector machine (SVM) classifiers to disambiguate 197 ambiguous terms and abbreviations in the MSH WSD collection. Three different uncertainty sampling-based active learning algorithms were implemented with the SVM classifiers and were compared with a passive learner (PL) based on random sampling. For each ambiguous term and each learning algorithm, a learning curve that plots the accuracy computed from the test set as a function of the number of annotated samples used in the model was generated. The area under the learning curve (ALC) was used as the primary metric for evaluation. Results Our experiments demonstrated that active learners (ALs) significantly outperformed the PL, showing better performance for 177 out of 197 (89.8%) WSD tasks. Further analysis showed that to achieve an average accuracy of 90%, the PL needed 38 annotated samples, while the ALs needed only 24, a 37% reduction in annotation effort. Moreover, we analyzed cases where active learning algorithms did not achieve superior performance and identified three causes: (1) poor models in the early learning stage; (2) easy WSD cases; and (3) difficult WSD cases, which provide useful insight for future improvements. Conclusions This study demonstrated that integrating active learning strategies with supervised WSD methods could effectively reduce annotation cost and improve the disambiguation models.

This publication has 22 references indexed in Scilit:

Active learning for clinical text classification: is it better than random sampling?
Journal of the American Medical Informatics Association, 2012
Applying active learning to assertion classification of concepts in clinical text
Journal of Biomedical Informatics, 2012
Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation
BMC Bioinformatics, 2011
Semi-automated screening of biomedical citations for systematic reviews
BMC Bioinformatics, 2010
Using MEDLINE as a knowledge source for disambiguating abbreviations and acronyms in full-text biomedical journal articles
Journal of Biomedical Informatics, 2007
Word Sense Disambiguation in the Biomedical Domain: An Overview
Journal of Computational Biology, 2005
Gene name ambiguity of eukaryotic nomenclatures
Bioinformatics, 2004
A Multi-aspect Comparison Study of Supervised Word Sense Disambiguation
Journal of the American Medical Informatics Association, 2004
Automatic Resolution of Ambiguous Terms Based on Machine Learning and Conceptual Relations in the UMLS
Journal of the American Medical Informatics Association, 2002
Individual Comparisons by Ranking Methods
Biometrics Bulletin, 1945

Cited by 25 articles