Essentials: The Science of Learning & Speaking Languages | Dr. Eddie Chang
Neurosurgeon Dr. Eddie Chang discusses the neuroscience of speech and language, his groundbreaking brain-machine interface work enabling paralyzed patients to communicate, and broader topics including stuttering, augmentation ethics, and avatar-based communication. The conversation highlights the distinction between speech (the physical production of sound) and language (meaning, syntax, semantics), and showcases how decoding cortical activity can restore communication for locked-in patients.
Summary
Dr. Eddie Chang, a neurosurgeon at UCSF, joins Andrew Huberman to explore the neuroscience of speech and language. He begins by distinguishing the two: speech refers to the physical motor act of producing sound through the vocal tract, while language encompasses broader cognitive functions including semantics (meaning), syntax (grammar), and pragmatics (contextual understanding). Chang explains the mechanics of vocalization — how the larynx brings vocal folds together during exhalation to produce voicing around 100 Hz in men and 200 Hz in women, and how the pharynx, oral cavity, tongue, and lips then shape that raw sound into intelligible consonants and vowels. He notes that primitive vocalizations like crying and moaning are governed by evolutionarily older brain areas distinct from speech and language regions, which is why brainstem stroke patients can still moan but not speak.
The conversation then moves to Chang's clinical research on brain-machine interfaces (BMI) for paralyzed patients. He describes conditions like brainstem stroke and ALS that result in 'locked-in syndrome,' where cognitive function is fully intact but voluntary movement and speech are impossible. Chang describes the BRAVO clinical trial, in which electrodes were implanted over speech motor cortex areas in a patient named Pancho — a man paralyzed for 15 years following a brainstem stroke after a car accident. The system intercepts cortical signals as the patient attempts to speak, converts them via machine learning algorithms into decoded words, and uses autocorrect-style language models (trained on 50-word vocabularies) to improve accuracy. This represented the first time a paralyzed individual's intended speech was decoded directly from brain activity into words and sentences.
Chang also discusses the future of this technology, including the development of computer-animated avatars that can reproduce not only decoded speech but also facial expressions and mouth movements, making communication more natural and holistic. He envisions a near-term future where social interactions increasingly occur in digital/virtual spaces, making such avatar-based communication tools essential for disabled individuals. He connects this to broader consumer trends, suggesting that avatar-speech will become a mainstream communication format.
On the topic of cognitive augmentation via neurotechnology, Chang acknowledges that the field is progressing but argues that current commercial technologies are far from matching the bandwidth and sophistication of naturally evolved neural speech systems. He raises important ethical concerns about who gets access to augmentation technologies, whether society has thought through the implications, and notes that augmentation itself is not a new human behavior — caffeine, medications, and smartphones already augment cognition.
Finally, Chang discusses stuttering as a speech (not language) condition involving a breakdown in the precise motor coordination required for fluent speech. He explains that anxiety can exacerbate stuttering but does not cause it, and that auditory feedback — hearing oneself speak — plays a critical role, offering clues about the neural mechanisms involved. Treatment typically involves speech therapy focused on creating conditions for fluent initiation, and early intervention during periods of high neuroplasticity is generally beneficial.
Key Insights
- Chang argues that speech and language are neurologically distinct: speech is the motor production of sound via the vocal tract, while language encompasses semantics, syntax, and pragmatics processed in higher cortical areas.
- Chang explains that primitive vocalizations like moaning and crying are controlled by evolutionarily older brain regions shared with non-human primates, which is why brainstem stroke patients can still vocalize but cannot produce intelligible speech.
- Chang's BRAVO trial demonstrated for the first time that intended speech could be decoded directly from cortical activity in a paralyzed patient, using a 50-word vocabulary combined with autocorrect-style language modeling to produce sentences on a screen.
- Chang argues that current commercial brain-machine interface technologies do not come close to matching the bandwidth of naturally evolved neural speech systems, which are supported by millions of neurons refined over millions of years of evolution.
- Chang contends that while augmentation via neurotechnology raises serious ethical questions — particularly around access and societal implications — the pursuit of cognitive enhancement is not new to humans, as caffeine, medications, and smartphones already serve augmentative functions.
- Chang claims that stuttering is a breakdown in the precise motor coordination required for fluent speech, not a language problem, and that anxiety does not cause stuttering but can trigger or worsen it in predisposed individuals.
- Chang identifies auditory feedback — hearing oneself speak in real time — as a critical and underappreciated component of fluent speech production, and suggests that disruptions to this feedback loop may be involved in the neural mechanism of stuttering.
- Chang envisions that avatar-based speech neuroprosthetics — which decode not just words but also facial expressions and mouth movements — will allow paralyzed individuals to participate in the increasingly digital and virtual social spaces that most people already inhabit.
Topics
Full transcript available for MurmurCast members
Sign Up to Access