Validation of prediction models for critical care outcomes using natural language processing of electronic health record data.

Marafino BJ, Park M, Davies JM, Thombley R, Luft HS, Sing DC, Kazi DS, DeJong C, Boscardin WJ, Dean ML, Dudley RA., JAMA Netw Open. 1(8):e185097. doi: 10.1001/jamanetworkopen.2018.5097., 2018 Dec 07

Abstract

Importance: Accurate prediction of outcomes among patients in intensive care units (ICUs) is important for clinical research and monitoring care quality. Most existing prediction models do not take full advantage of the electronic health record, using only the single worst value of laboratory tests and vital signs and largely ignoring information present in free-text notes. Whether capturing more of the available data and applying machine learning and natural language processing (NLP) can improve and automate the prediction of outcomes among patients in the ICU remains unknown.

Objectives: To evaluate the change in power for a mortality prediction model among patients in the ICU achieved by incorporating measures of clinical trajectory together with NLP of clinical text and to assess the generalizability of this approach.

Design, Setting, and Participants: This retrospective cohort study included 101?196 patients with a first-time admission to the ICU and a length of stay of at least 4 hours. Twenty ICUs at 2 academic medical centers (University of California, San Francisco [UCSF], and Beth Israel Deaconess Medical Center [BIDMC], Boston, Massachusetts) and 1 community hospital (Mills-Peninsula Medical Center [MPMC], Burlingame, California) contributed data from January 1, 2001, through June 1, 2017. Data were analyzed from July 1, 2017, through August 1, 2018.

Main Outcomes and Measures: In-hospital mortality and model discrimination as assessed by the area under the receiver operating characteristic curve (AUC) and model calibration as assessed by the modified Hosmer-Lemeshow statistic.Results: Among 101?196 patients included in the analysis, 51.3% (n = 51?899) were male, with a mean (SD) age of 61.3 (17.1) years; their in-hospital mortality rate was 10.4% (n = 10?505). A baseline model using only the highest and lowest observed values for each laboratory test result or vital sign achieved a cross-validated AUC of 0.831 (95% CI, 0.830-0.832). In contrast, that model augmented with measures of clinical trajectory achieved an AUC of 0.899 (95% CI, 0.896-0.902; P < .001 for AUC difference). Further augmenting this model with NLP-derived terms associated with mortality further increased the AUC to 0.922 (95% CI, 0.916-0.924; P < .001). These NLP-derived terms were associated with improved model performance even when applied across sites (AUC difference for UCSF: 0.077 to 0.021; AUC difference for MPMC: 0.071 to 0.051; AUC difference for BIDMC: 0.035 to 0.043; P < .001) when augmenting with NLP at each site.

Conclusions and Relevance: Intensive care unit mortality prediction models incorporating measures of clinical trajectory and NLP-derived terms yielded excellent predictive performance and generalized well in this sample of hospitals. The role of these automated algorithms, particularly those using unstructured data from notes and other sources, in clinical research and quality improvement seems to merit additional investigation.

Pubmed Abstract

Pubmed AbstractOpens New Window

Associated Topics

Related Publications

Comparative usability study of a newly created patient-centered tool and Medicare.gov plan finder to help Medicare beneficiaries choose prescription drug plans.

Stults CD, Fattahi S, Meehan A, Bundorf MK, Chan AS, Pun T, Tai-Seale M.
J Patient Exp. 6(1):81-86. doi: 10.1177/2374373518778343. Epub 2018 Jun 6.
2019 Mar 01

Predicting need for advanced illness or palliative care in a primary care population using electronic health record data.

Jung K, Sudat SEK, Kwon N, Stewart WF, Shan NH.
J Biomed Inform. 92:103115.
2019 Apr 01

Impact of home-based, patient-centered support for people with advanced illness in an open health system: a retrospective claims analysis of health expenditures, utilization, and quality of care at end of life.

Sudat SEK, Franco A, Pressman AR, Rosenfeld K, Gornet E, Stewart W.
Palliat Med. 2018 Feb;32(2):485-492. doi: 10.1177/0269216317711824. Epub 2017 Jun 7.
2018 Feb 01

Adherence to placebo and mortality in the Beta Blocker Evaluation of Survival Trial (BEST).

Pressman A, Avins AL, Neuhaus J, Ackerson L, Rudd P.
Contemp Clin Trials. 33(3):492-8. doi: 10.1016/j.cct.2011.12.003. Epub 2012 Jan 12.
2012 May 01

Machine-based expert recommendations and insurance choices among Medicare Part D enrollees.

Bundorf MK, Polyakova M, Stults C, Meehan A, Klimke R, Pun T, Chan AS, Tai-Seale M.
Health Aff (Millwood). 38(3):482-490. doi: 10.1377/hlthaff.2018.05017.
2019 Mar 01

Sutter Health

Book Appointment

Walk-In Care

Urgent Care

Video Visits

Primary Care

COVID-19 and Flu

Patient Login

Patient Resources

Billing and Insurance

Patients and Visitors

My Health Online

My Health Online

Get Care

Featured Services

Treatments and Services

Schedule a Visit

Research

Education

Research and Education

Graduate Medical Education