» Articles » PMID: 29378977

Real-time Classification of Auditory Sentences Using Evoked Cortical Activity in Humans

Overview
Journal J Neural Eng
Date 2018 Jan 31
PMID 29378977
Citations 18
Authors
Affiliations
Soon will be listed here.
Abstract

Objective: Recent research has characterized the anatomical and functional basis of speech perception in the human auditory cortex. These advances have made it possible to decode speech information from activity in brain regions like the superior temporal gyrus, but no published work has demonstrated this ability in real-time, which is necessary for neuroprosthetic brain-computer interfaces.

Approach: Here, we introduce a real-time neural speech recognition (rtNSR) software package, which was used to classify spoken input from high-resolution electrocorticography signals in real-time. We tested the system with two human subjects implanted with electrode arrays over the lateral brain surface. Subjects listened to multiple repetitions of ten sentences, and rtNSR classified what was heard in real-time from neural activity patterns using direct sentence-level and HMM-based phoneme-level classification schemes.

Main Results: We observed single-trial sentence classification accuracies of [Formula: see text] or higher for each subject with less than 7 minutes of training data, demonstrating the ability of rtNSR to use cortical recordings to perform accurate real-time speech decoding in a limited vocabulary setting.

Significance: Further development and testing of the package with different speech paradigms could influence the design of future speech neuroprosthetic applications.

Citing Articles

A bilingual speech neuroprosthesis driven by cortical articulatory representations shared between languages.

Silva A, Liu J, Metzger S, Bhaya-Grossman I, Dougherty M, Seaton M Nat Biomed Eng. 2024; 8(8):977-991.

PMID: 38769157 PMC: 11554235. DOI: 10.1038/s41551-024-01207-5.


A high-performance neuroprosthesis for speech decoding and avatar control.

Metzger S, Littlejohn K, Silva A, Moses D, Seaton M, Wang R Nature. 2023; 620(7976):1037-1046.

PMID: 37612505 PMC: 10826467. DOI: 10.1038/s41586-023-06443-4.


On the similarities of representations in artificial and brain neural networks for speech recognition.

Wingfield C, Zhang C, Devereux B, Fonteneau E, Thwaites A, Liu X Front Comput Neurosci. 2023; 16:1057439.

PMID: 36618270 PMC: 9811675. DOI: 10.3389/fncom.2022.1057439.


Generalizable spelling using a speech neuroprosthesis in an individual with severe limb and vocal paralysis.

Metzger S, Liu J, Moses D, Dougherty M, Seaton M, Littlejohn K Nat Commun. 2022; 13(1):6510.

PMID: 36347863 PMC: 9643551. DOI: 10.1038/s41467-022-33611-3.


Dataset of Speech Production in intracranial.Electroencephalography.

Verwoert M, Ottenhoff M, Goulis S, Colon A, Wagner L, Tousseyn S Sci Data. 2022; 9(1):434.

PMID: 35869138 PMC: 9307753. DOI: 10.1038/s41597-022-01542-9.


References
1.
Leonard M, Baud M, Sjerps M, Chang E . Perceptual restoration of masked speech in human cortex. Nat Commun. 2016; 7:13619. PMC: 5187421. DOI: 10.1038/ncomms13619. View

2.
Rauschecker J, Scott S . Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nat Neurosci. 2009; 12(6):718-24. PMC: 2846110. DOI: 10.1038/nn.2331. View

3.
Leuthardt E, Miller K, Schalk G, Rao R, Ojemann J . Electrocorticography-based brain computer interface--the Seattle experience. IEEE Trans Neural Syst Rehabil Eng. 2006; 14(2):194-8. DOI: 10.1109/TNSRE.2006.875536. View

4.
. Recommendations for use of uniform nomenclature pertinent to patients with severe alterations in consciousness. American Congress of Rehabilitation Medicine. Arch Phys Med Rehabil. 1995; 76(2):205-9. DOI: 10.1016/s0003-9993(95)80031-x. View

5.
Mugler E, Patton J, Flint R, Wright Z, Schuele S, Rosenow J . Direct classification of all American English phonemes using signals from functional speech motor cortex. J Neural Eng. 2014; 11(3):035015. PMC: 4097188. DOI: 10.1088/1741-2560/11/3/035015. View