Natural Language Processing to Convert Unstructured COVID-19 Chest-CT Reports into Structured Reports
Overview
Authors
Affiliations
Background: Structured reporting has been demonstrated to increase report completeness and to reduce error rate, also enabling data mining of radiological reports. Still, structured reporting is perceived by radiologists as a fragmented reporting style, limiting their freedom of expression.
Purpose: A deep learning-based natural language processing method was developed to automatically convert unstructured COVID-19 chest CT reports into structured reports.
Methods: Two hundred-two COVID-19 chest CT were retrospectively reviewed by two experienced radiologists, who wrote for each exam a free-form text radiological report and coherently filled the template provided by the Italian Society of Medical and Interventional Radiology, used as ground-truth. A semi-supervised convolutional neural network was implemented to extract 62 categorical variables from the report. Two iterations were carried-out, the first without fine-tuning, the second one performing a fine-tuning. The performance was measured using the mean accuracy and the F1 mean score. An error analysis was performed to identify errors entirely attributable to incorrect processing of the model.
Results: The algorithm achieved a mean accuracy of 93.7% and an F1 score 93.8% in the first iteration. Most of the errors were exclusively attributable to wrong inference (46%). In the second iteration the model achieved for both parameters 95,8% and percentage of errors attributable to wrong inference decreased to 26%.
Conclusions: The convolutional neural network achieved an optimal performance in the automated conversion of free-form text into structured radiological reports, overcoming all the limitation attributed to structured reporting and finally paving the way for data mining of radiological report.
Lopez-Ubeda P, Martin-Noguerol T, Escartin J, Cabrera-Zubizarreta A, Luna A Jpn J Radiol. 2024; .
PMID: 39730936 DOI: 10.1007/s11604-024-01721-1.
Feng N, Zhao S, Wang K, Chen P, Wang Y, Gao Y Eur J Radiol Open. 2024; 13:100609.
PMID: 39554616 PMC: 11566704. DOI: 10.1016/j.ejro.2024.100609.