Can a Novel Natural Language Processing Model and Artificial Intelligence Automatically Generate Billing Codes From Spine Surgical Operative Notes?

Overview

Journal Global Spine J

Publisher Sage Publications

Date 2023 Mar 18

PMID 36932733

Authors

Bashar Zaidat

Justin Tang

Varun Arvind

Eric A Geng

Brian Cho

Akiro H Duey

Calista Dominy

Kiehyun D Riew

Samuel K Cho

Jun S Kim

Affiliations

Soon will be listed here.

Abstract

Study Design: Retrospective cohort.

Objective: Billing and coding-related administrative tasks are a major source of healthcare expenditure in the United States. We aim to show that a second-iteration Natural Language Processing (NLP) machine learning algorithm, XLNet, can automate the generation of CPT codes from operative notes in ACDF, PCDF, and CDA procedures.

Methods: We collected 922 operative notes from patients who underwent ACDF, PCDF, or CDA from 2015 to 2020 and included CPT codes generated by the billing code department. We trained XLNet, a generalized autoregressive pretraining method, on this dataset and tested its performance by calculating AUROC and AUPRC.

Results: The performance of the model approached human accuracy. Trial 1 (ACDF) achieved an AUROC of .82 (range: .48-.93), an AUPRC of .81 (range: .45-.97), and class-by-class accuracy of 77% (range: 34%-91%); trial 2 (PCDF) achieved an AUROC of .83 (.44-.94), an AUPRC of .70 (.45-.96), and class-by-class accuracy of 71% (42%-93%); trial 3 (ACDF and CDA) achieved an AUROC of .95 (.68-.99), an AUPRC of .91 (.56-.98), and class-by-class accuracy of 87% (63%-99%); trial 4 (ACDF, PCDF, CDA) achieved an AUROC of .95 (.76-.99), an AUPRC of .84 (.49-.99), and class-by-class accuracy of 88% (70%-99%).

Conclusions: We show that the XLNet model can be successfully applied to orthopedic surgeon's operative notes to generate CPT billing codes. As NLP models as a whole continue to improve, billing can be greatly augmented with artificial intelligence assisted generation of CPT billing codes which will help minimize error and promote standardization in the process.

Citing Articles

Evaluating Large Language Models for Automated CPT Code Prediction in Endovascular Neurosurgery.

Roy J, Self D, Isch E, Musmar B, Lan M, Keppetipola K J Med Syst. 2025; 49(1):15.

PMID: 39853605 DOI: 10.1007/s10916-025-02149-4.

Chatbot Demonstrates Moderate Interrater Reliability in Billing for Hand Surgery Clinic Encounters.

Latario L, Fowler J Hand (N Y). 2024; :15589447241295328.

PMID: 39548885 PMC: 11571175. DOI: 10.1177/15589447241295328.

Attitude of aspiring orthopaedic surgeons towards artificial intelligence: a multinational cross-sectional survey study.

Pawelczyk J, Kraus M, Eckl L, Nehrer S, Aurich M, Izadpanah K Arch Orthop Trauma Surg. 2024; 144(8):3541-3552.

PMID: 39127806 PMC: 11417067. DOI: 10.1007/s00402-024-05408-0.

Orthopaedic surgeons display a positive outlook towards artificial intelligence: A survey among members of the AGA Society for Arthroscopy and Joint Surgery.

Rupp M, Moser L, Hess S, Angele P, Aurich M, Dyrna F J Exp Orthop. 2024; 11(3):e12080.

PMID: 38974054 PMC: 11227606. DOI: 10.1002/jeo2.12080.

Applications of natural language processing tools in the surgical journey.

Le K, Tay S, Choy K, Verjans J, Sasanelli N, Kong J Front Surg. 2024; 11():1403540.

PMID: 38826809 PMC: 11140056. DOI: 10.3389/fsurg.2024.1403540.

References

Sakowski J, Kahn J, Kronick R, Newman J, Luft H . Peering into the black box: billing and insurance activities in a medical group. Health Aff (Millwood). 2009; 28(4):w544-54. DOI: 10.1377/hlthaff.28.4.w544. View

Burns M, Mathis M, Vandervest J, Tan X, Lu B, Colquhoun D . Classification of Current Procedural Terminology Codes from Electronic Health Record Data Using Machine Learning. Anesthesiology. 2020; 132(4):738-749. PMC: 7665375. DOI: 10.1097/ALN.0000000000003150. View

Morra D, Nicholson S, Levinson W, Gans D, Hammons T, Casalino L . US physician practices versus Canadians: spending nearly four times as much money interacting with payers. Health Aff (Millwood). 2011; 30(8):1443-50. DOI: 10.1377/hlthaff.2010.0893. View

Tseng P, Kaplan R, Richman B, Shah M, Schulman K . Administrative Costs Associated With Physician Billing and Insurance-Related Activities at an Academic Health Care System. JAMA. 2018; 319(7):691-697. PMC: 5839285. DOI: 10.1001/jama.2017.19148. View

Kahn J, Kronick R, Kreger M, Gans D . The cost of health insurance administration in California: estimates for insurers, physicians, and hospitals. Health Aff (Millwood). 2005; 24(6):1629-39. DOI: 10.1377/hlthaff.24.6.1629. View

Kim J, Vivas A, Arvind V, Lombardi J, Reidler J, Zuckerman S . Can Natural Language Processing and Artificial Intelligence Automate The Generation of Billing Codes From Operative Note Dictations?. Global Spine J. 2022; 13(7):1946-1955. PMC: 10556904. DOI: 10.1177/21925682211062831. View

Oh S, Kang M, Lee Y . Protected Health Information Recognition by Fine-Tuning a Pre-training Transformer Model. Healthc Inform Res. 2022; 28(1):16-24. PMC: 8850174. DOI: 10.4258/hir.2022.28.1.16. View

Drabiak K, Wolfson J . What Should Health Care Organizations Do to Reduce Billing Fraud and Abuse?. AMA J Ethics. 2020; 22(3):E221-231. DOI: 10.1001/amajethics.2020.221. View

Martin-Sanchez F, Verspoor K . Big data in medicine is driving big changes. Yearb Med Inform. 2014; 9:14-20. PMC: 4287083. DOI: 10.15265/IY-2014-0020. View

10.

Levy J, Vattikonda N, Haudenschild C, Christensen B, Vaickus L . Comparison of Machine-Learning Algorithms for the Prediction of Current Procedural Terminology (CPT) Codes from Pathology Reports. J Pathol Inform. 2022; 13:3. PMC: 8802304. DOI: 10.4103/jpi.jpi_52_21. View

11.

Huang J, Osorio C, Sy L . An empirical evaluation of deep learning for ICD-9 code assignment using MIMIC-III clinical notes. Comput Methods Programs Biomed. 2019; 177:141-153. DOI: 10.1016/j.cmpb.2019.05.024. View

12.

Soleymani M, Yaseri M, Farzadfar F, Mohammadpour A, Sharifi F, Kabir M . Detecting medical prescriptions suspected of fraud using an unsupervised data mining algorithm. Daru. 2018; 26(2):209-214. PMC: 6279664. DOI: 10.1007/s40199-018-0227-z. View

13.

Chernew M, Mintz H . Administrative Expenses in the US Health Care System: Why So High?. JAMA. 2021; 326(17):1679-1680. DOI: 10.1001/jama.2021.17318. View