Identification of methicillin-resistant Staphylococcus aureus within the nation's Veterans Affairs medical centers using natural language processing

BMC Med Inform Decis Mak. 2012 Jul 11:12:34. doi: 10.1186/1472-6947-12-34.

Abstract

Background: Accurate information is needed to direct healthcare systems' efforts to control methicillin-resistant Staphylococcus aureus (MRSA). Assembling complete and correct microbiology data is vital to understanding and addressing the multiple drug-resistant organisms in our hospitals.

Methods: Herein, we describe a system that securely gathers microbiology data from the Department of Veterans Affairs (VA) network of databases. Using natural language processing methods, we applied an information extraction process to extract organisms and susceptibilities from the free-text data. We then validated the extraction against independently derived electronic data and expert annotation.

Results: We estimate that the collected microbiology data are 98.5% complete and that methicillin-resistant Staphylococcus aureus was extracted accurately 99.7% of the time.

Conclusions: Applying natural language processing methods to microbiology records appears to be a promising way to extract accurate and useful nosocomial pathogen surveillance data. Both scientific inquiry and the data's reliability will be dependent on the surveillance system's capability to compare from multiple sources and circumvent systematic error. The dataset constructed and methods used for this investigation could contribute to a comprehensive infectious disease surveillance system or other pressing needs.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms*
  • Bias
  • Hospitals, Veterans / statistics & numerical data*
  • Humans
  • Information Storage and Retrieval / methods*
  • Information Storage and Retrieval / standards
  • Internet / statistics & numerical data
  • Medical Records Systems, Computerized / standards*
  • Methicillin-Resistant Staphylococcus aureus* / isolation & purification
  • Microbiological Techniques / standards
  • Natural Language Processing*
  • Population Surveillance / methods
  • Quality Control
  • Reference Standards
  • Reproducibility of Results
  • Staphylococcal Infections / epidemiology
  • Staphylococcal Infections / microbiology
  • United States
  • United States Department of Veterans Affairs