» Articles » PMID: 37426542

Distinguishing Academic Science Writing from Humans or ChatGPT with over 99% Accuracy Using Off-the-shelf Machine Learning Tools

Overview
Publisher Cell Press
Date 2023 Jul 10
PMID 37426542
Authors
Affiliations
Soon will be listed here.
Abstract

ChatGPT has enabled access to artificial intelligence (AI)-generated writing for the masses, initiating a culture shift in the way people work, learn, and write. The need to discriminate human writing from AI is now both critical and urgent. Addressing this need, we report a method for discriminating text generated by ChatGPT from (human) academic scientists, relying on prevalent and accessible supervised classification methods. The approach uses new features for discriminating (these) humans from AI; as examples, scientists write long paragraphs and have a penchant for equivocal language, frequently using words like "but," "however," and "although." With a set of 20 features, we built a model that assigns the author, as human or AI, at over 99% accuracy. This strategy could be further adapted and developed by others with basic skills in supervised classification, enabling access to many highly accurate and targeted models for detecting AI usage in academic writing and beyond.

Citing Articles

ChatGPT and exercise prescription: Human vs. machine or human plus machine?.

Cavazzotto T, Dantas D, Queiroga M J Sport Health Sci. 2024; .

PMID: 39492473 PMC: 11282324. DOI: 10.1016/j.jshs.2023.10.008.


Analysis of ChatGPT Responses to Ophthalmic Cases: Can ChatGPT Think like an Ophthalmologist?.

Chen J, Reddy A, Al-Sharif E, Shoji M, Kalaw F, Eslani M Ophthalmol Sci. 2024; 5(1):100600.

PMID: 39346575 PMC: 11437840. DOI: 10.1016/j.xops.2024.100600.


Exploring the molecular mechanisms and shared potential drugs between rheumatoid arthritis and arthrofibrosis based on large language model and synovial microenvironment analysis.

Wei Z, Chen X, Sun Y, Zhang Y, Dong R, Wang X Sci Rep. 2024; 14(1):18939.

PMID: 39147768 PMC: 11327321. DOI: 10.1038/s41598-024-69080-5.


Unveiling ChatGPT text using writing style.

Berriche L, Larabi-Marie-Sainte S Heliyon. 2024; 10(12):e32976.

PMID: 38984302 PMC: 11231544. DOI: 10.1016/j.heliyon.2024.e32976.


How to fight fake papers: a review on important information sources and steps towards solution of the problem.

Wittau J, Seifert R Naunyn Schmiedebergs Arch Pharmacol. 2024; 397(12):9281-9294.

PMID: 38970685 PMC: 11582211. DOI: 10.1007/s00210-024-03272-8.


References
1.
Hua D, Desaire H . Improved Discrimination of Disease States Using Proteomics Data with the Updated Aristotle Classifier. J Proteome Res. 2021; 20(5):2823-2829. PMC: 8541691. DOI: 10.1021/acs.jproteome.1c00066. View

2.
Fagni T, Falchi F, Gambini M, Martella A, Tesconi M . TweepFake: About detecting deepfake tweets. PLoS One. 2021; 16(5):e0251415. PMC: 8118345. DOI: 10.1371/journal.pone.0251415. View

3.
King M . A Conversation on Artificial Intelligence, Chatbots, and Plagiarism in Higher Education. Cell Mol Bioeng. 2023; 16(1):1-2. PMC: 9842816. DOI: 10.1007/s12195-022-00754-8. View

4.
Gao C, Howard F, Markov N, Dyer E, Ramesh S, Luo Y . Comparing scientific abstracts generated by ChatGPT to real abstracts with detectors and blinded human reviewers. NPJ Digit Med. 2023; 6(1):75. PMC: 10133283. DOI: 10.1038/s41746-023-00819-6. View