Identifying Caregiver Availability Using Medical Notes With Rule-Based Natural Language Processing: Retrospective Cohort Study

Elham Mahmoudi; Wenbo Wu; Cyrus Najarian; James Aikens; Julie Bynum; V G Vinod Vydiswaran

doi:10.2196/40241

Identifying Caregiver Availability Using Medical Notes With Rule-Based Natural Language Processing: Retrospective Cohort Study

JMIR Aging. 2022 Sep 22;5(3):e40241. doi: 10.2196/40241.

Authors

Elham Mahmoudi^#^{1

2}, Wenbo Wu³, Cyrus Najarian⁴, James Aikens¹, Julie Bynum⁴, V G Vinod Vydiswaran⁵

Affiliations

¹ Department of Family Medicine, Medical School, University of Michigan, Ann Arbor, MI, United States.
² Institute for Healthcare Policy and Innovation, University of Michigan, Ann Arbor, MI, United States.
³ Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI, United States.
⁴ Medical School, University of Michigan, Ann Arbor, MI, United States.
⁵ Department of Learning Health Sciences, Medical School, University of Michigan, Ann Arbor, MI, United States.

^# Contributed equally.

PMID: 35998328
PMCID: PMC9539648
DOI: 10.2196/40241

Abstract

Background: Identifying caregiver availability, particularly for patients with dementia or those with a disability, is critical to informing the appropriate care planning by the health systems, hospitals, and providers. This information is not readily available, and there is a paucity of pragmatic approaches to automatically identifying caregiver availability and type.

Objective: Our main objective was to use medical notes to assess caregiver availability and type for hospitalized patients with dementia. Our second objective was to identify whether the patient lived at home or resided at an institution.

Methods: In this retrospective cohort study, we used 2016-2019 telephone-encounter medical notes from a single institution to develop a rule-based natural language processing (NLP) algorithm to identify the patient's caregiver availability and place of residence. Using note-level data, we compared the results of the NLP algorithm with human-conducted chart abstraction for both training (749/976, 77%) and test sets (227/976, 23%) for a total of 223 adults aged 65 years and older diagnosed with dementia. Our outcomes included determining whether the patients (1) reside at home or in an institution, (2) have a formal caregiver, and (3) have an informal caregiver.

Results: Test set results indicated that our NLP algorithm had high level of accuracy and reliability for identifying whether patients had an informal caregiver (F₁=0.94, accuracy=0.95, sensitivity=0.97, and specificity=0.93), but was relatively less able to identify whether the patient lived at an institution (F₁=0.64, accuracy=0.90, sensitivity=0.51, and specificity=0.98). The most common explanations for NLP misclassifications across all categories were (1) incomplete or misspelled facility names; (2) past, uncertain, or undecided status; (3) uncommon abbreviations; and (4) irregular use of templates.

Conclusions: This innovative work was the first to use medical notes to pragmatically determine caregiver availability. Our NLP algorithm identified whether hospitalized patients with dementia have a formal or informal caregiver and, to a lesser extent, whether they lived at home or in an institutional setting. There is merit in using NLP to identify caregivers. This study serves as a proof of concept. Future work can use other approaches and further identify caregivers and the extent of their availability.

Keywords: Alzheimer; aging; algorithm; care planning; caregiver; dementia; elderly care; elderly population; health care; medical notes; natural language processing; pragmatic.

©Elham Mahmoudi, Wenbo Wu, Cyrus Najarian, James Aikens, Julie Bynum, V G Vinod Vydiswaran. Originally published in JMIR Aging (https://aging.jmir.org), 22.09.2022.

Abstract

Grants and funding