Using UMLS lexical resources to disambiguate abbreviations in clinical text

AMIA Annu Symp Proc. 2011:2011:715-22. Epub 2011 Oct 22.

Abstract

Clinical text is rich in acronyms and abbreviations, and they are highly ambiguous. As a pre-processing step before subsequent NLP analysis, we are developing and evaluating clinical abbreviation disambiguation methods. The evaluation of two sequential steps, the detection and the disambiguation of abbreviations, is reported here, for various types of clinical notes. For abbreviations detection, our result indicated the SPECIALIST Lexicon LRABR needed to be revised for better abbreviation detection. Our semi-supervised method using generated training data based on expanded form matching for 12 frequent abbreviations in our clinical notes reached over 90% accuracy in five-fold cross validation and unsupervised approach produced comparable results with the semi-supervised methods.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Abbreviations as Topic*
  • Artificial Intelligence
  • Medical Records*
  • Natural Language Processing*
  • Unified Medical Language System*