Automating Data Abstraction in a Quality Improvement Platform for Surgical and Interventional Procedures
Overview
Affiliations
Objective: This paper describes a text processing system designed to automate the manual data abstraction process in a quality improvement (QI) program. The Surgical Care and Outcomes Assessment Program (SCOAP) is a clinician-led, statewide performance benchmarking QI platform for surgical and interventional procedures. The data elements abstracted as part of this program cover a wide range of clinical information from patient medical history to details of surgical interventions.
Methods: Statistical and rule-based extractors were developed to automatically abstract data elements. A preprocessing pipeline was created to chunk free-text notes into its sections, sentences, and tokens. The information extracted in this preprocessing step was used by the statistical and rule-based extractors as features.
Findings: Performance results for 25 extractors (14 statistical, 11 rule based) are presented. The average f1-scores for 11 rule-based extractors and 14 statistical extractors are 0.785 (min=0.576,max=0.931,std-dev=0.113) and 0.812 (min=0.571,max=0.993,std-dev=0.135) respectively.
Discussion: Our error analysis revealed that most extraction errors were due either to data imbalance in the data set or the way the gold standard had been created.
Conclusion: As future work, more experiments will be conducted with a more comprehensive data set from multiple institutions contributing to the QI project.
Tamang S, Humbert-Droz M, Gianfrancesco M, Izadi Z, Schmajuk G, Yazdany J JMIR Med Inform. 2023; 11:e37805.
PMID: 36595345 PMC: 9846439. DOI: 10.2196/37805.
Zhu Y, Simon G, Wick E, Abe-Jones Y, Najafi N, Sheka A J Am Coll Surg. 2021; 232(6):963-971.e1.
PMID: 33831539 PMC: 8679130. DOI: 10.1016/j.jamcollsurg.2021.03.026.
Devine E, Van Eaton E, Zadworny M, Symons R, Devlin A, Yanez D EGEMS (Wash DC). 2018; 6(1):8.
PMID: 29881766 PMC: 5983060. DOI: 10.5334/egems.211.
Enhanced Quality Measurement Event Detection: An Application to Physician Reporting.
Tamang S, Hernandez-Boussard T, Ross E, Gaskin G, Patel M, Shah N EGEMS (Wash DC). 2018; 5(1):5.
PMID: 29881731 PMC: 5983066. DOI: 10.13063/2327-9214.1270.
Hu Z, Melton G, Moeller N, Arsoniadis E, Wang Y, Kwaan M AMIA Annu Symp Proc. 2017; 2016:1822-1831.
PMID: 28269941 PMC: 5333220.