Real-time Classification of Auditory Sentences Using Evoked Cortical Activity in Humans
Overview
Neurology
Affiliations
Objective: Recent research has characterized the anatomical and functional basis of speech perception in the human auditory cortex. These advances have made it possible to decode speech information from activity in brain regions like the superior temporal gyrus, but no published work has demonstrated this ability in real-time, which is necessary for neuroprosthetic brain-computer interfaces.
Approach: Here, we introduce a real-time neural speech recognition (rtNSR) software package, which was used to classify spoken input from high-resolution electrocorticography signals in real-time. We tested the system with two human subjects implanted with electrode arrays over the lateral brain surface. Subjects listened to multiple repetitions of ten sentences, and rtNSR classified what was heard in real-time from neural activity patterns using direct sentence-level and HMM-based phoneme-level classification schemes.
Main Results: We observed single-trial sentence classification accuracies of [Formula: see text] or higher for each subject with less than 7 minutes of training data, demonstrating the ability of rtNSR to use cortical recordings to perform accurate real-time speech decoding in a limited vocabulary setting.
Significance: Further development and testing of the package with different speech paradigms could influence the design of future speech neuroprosthetic applications.
Silva A, Liu J, Metzger S, Bhaya-Grossman I, Dougherty M, Seaton M Nat Biomed Eng. 2024; 8(8):977-991.
PMID: 38769157 PMC: 11554235. DOI: 10.1038/s41551-024-01207-5.
A high-performance neuroprosthesis for speech decoding and avatar control.
Metzger S, Littlejohn K, Silva A, Moses D, Seaton M, Wang R Nature. 2023; 620(7976):1037-1046.
PMID: 37612505 PMC: 10826467. DOI: 10.1038/s41586-023-06443-4.
Wingfield C, Zhang C, Devereux B, Fonteneau E, Thwaites A, Liu X Front Comput Neurosci. 2023; 16:1057439.
PMID: 36618270 PMC: 9811675. DOI: 10.3389/fncom.2022.1057439.
Metzger S, Liu J, Moses D, Dougherty M, Seaton M, Littlejohn K Nat Commun. 2022; 13(1):6510.
PMID: 36347863 PMC: 9643551. DOI: 10.1038/s41467-022-33611-3.
Dataset of Speech Production in intracranial.Electroencephalography.
Verwoert M, Ottenhoff M, Goulis S, Colon A, Wagner L, Tousseyn S Sci Data. 2022; 9(1):434.
PMID: 35869138 PMC: 9307753. DOI: 10.1038/s41597-022-01542-9.