Reusable Filtering Functions for Application in ICU data: a case study

AMIA Annu Symp Proc. 2017 Feb 10:2016:844-853. eCollection 2016.

Abstract

Complex medical data sometimes requires significant data preprocessing to prepare for analysis. The complexity can lead non-domain experts to apply simple filters of available data or to not use the data at all. The preprocessing choices can also have serious effects on the results of the study if incorrect decision or missteps are made. In this work, we present open-source data filters for an analysis motivated by understanding mortality in the context of sepsis- associated cardiomyopathy in the ICU. We report specific ICU filters and validations through chart review and graphs. These published filters reduce the complexity of using data in analysis by (1) encapsulating the domain expertise and feature engineering applied to the filter, by (2) providing debugged and ready code for use, and by (3) providing sensible validations. We intend these filters to evolve through pull requests and forks and serve as common starting points for specific analyses.

MeSH terms

  • Adult
  • Aged
  • Aged, 80 and over
  • Cardiomyopathies / etiology*
  • Cardiomyopathies / mortality
  • Cardiomyopathies / therapy
  • Databases, Factual*
  • Echocardiography
  • Female
  • Hospital Mortality
  • Humans
  • Information Storage and Retrieval / methods*
  • Intensive Care Units / organization & administration*
  • Logistic Models
  • Male
  • Medical Records Systems, Computerized
  • Middle Aged
  • Organizational Case Studies
  • Sepsis / complications*
  • Software*