» Articles » PMID: 38951520

Shared Functional Specialization in Transformer-based Language Models and the Human Brain

Overview
Journal Nat Commun
Specialty Biology
Date 2024 Jul 1
PMID 38951520
Authors
Affiliations
Soon will be listed here.
Abstract

When processing language, the brain is thought to deploy specialized computations to construct meaning from complex linguistic structures. Recently, artificial neural networks based on the Transformer architecture have revolutionized the field of natural language processing. Transformers integrate contextual information across words via structured circuit computations. Prior work has focused on the internal representations ("embeddings") generated by these circuits. In this paper, we instead analyze the circuit computations directly: we deconstruct these computations into the functionally-specialized "transformations" that integrate contextual information across words. Using functional MRI data acquired while participants listened to naturalistic stories, we first verify that the transformations account for considerable variance in brain activity across the cortical language network. We then demonstrate that the emergent computations performed by individual, functionally-specialized "attention heads" differentially predict brain activity in specific cortical regions. These heads fall along gradients corresponding to different layers and context lengths in a low-dimensional cortical space.

Citing Articles

Linguistic coupling between neural systems for speech production and comprehension during real-time dyadic conversations.

Zada Z, Nastase S, Speer S, Mwilambwe-Tshilobo L, Tsoi L, Burns S bioRxiv. 2025; .

PMID: 39990465 PMC: 11844503. DOI: 10.1101/2025.02.14.638276.


Approximating the semantic space: word embedding techniques in psychiatric speech analysis.

Palominos C, He R, Frohlich K, Mulfarth R, Seuffert S, Sommer I Schizophrenia (Heidelb). 2024; 10(1):114.

PMID: 39622800 PMC: 11612388. DOI: 10.1038/s41537-024-00524-7.


A shared model-based linguistic space for transmitting our thoughts from brain to brain in natural conversations.

Zada Z, Goldstein A, Michelmann S, Simony E, Price A, Hasenfratz L Neuron. 2024; 112(18):3211-3222.e5.

PMID: 39096896 PMC: 11427153. DOI: 10.1016/j.neuron.2024.06.025.

References
1.
Wehbe L, Murphy B, Talukdar P, Fyshe A, Ramdas A, Mitchell T . Simultaneously uncovering the patterns of brain regions involved in different story reading subprocesses. PLoS One. 2014; 9(11):e112575. PMC: 4245107. DOI: 10.1371/journal.pone.0112575. View

2.
Vigneau M, Beaucousin V, Herve P, Duffau H, Crivello F, Houde O . Meta-analyzing left hemisphere language areas: phonology, semantics, and sentence processing. Neuroimage. 2006; 30(4):1414-32. DOI: 10.1016/j.neuroimage.2005.11.002. View

3.
Kuperberg G, McGuire P, Bullmore E, Brammer M, Rabe-Hesketh S, Wright I . Common and distinct neural substrates for pragmatic, semantic, and syntactic processing of spoken sentences: an fMRI study. J Cogn Neurosci. 2000; 12(2):321-41. DOI: 10.1162/089892900562138. View

4.
Fedorenko E, Behr M, Kanwisher N . Functional specificity for high-level linguistic processing in the human brain. Proc Natl Acad Sci U S A. 2011; 108(39):16428-33. PMC: 3182706. DOI: 10.1073/pnas.1112937108. View

5.
Bornkessel I, Zysset S, Friederici A, von Cramon D, Schlesewsky M . Who did what to whom? The neural basis of argument hierarchies during language comprehension. Neuroimage. 2005; 26(1):221-33. DOI: 10.1016/j.neuroimage.2005.01.032. View