Physical and perceptual factors shape the neural mechanisms that integrate audiovisual signals in speech comprehension

Hweeling Lee*, Uta Noppeney, Uta Noppeney

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

38 Citations (Scopus)


Face-to-face communication challenges the human brain to integrate information from auditory and visual senses with linguistic representations. Yet the role of bottom-up physical (spectrotemporal structure) input and top-down linguistic constraints in shaping the neural mechanisms specialized for integrating audiovisual speech signals are currently unknown. Participants were presented with speech and sinewave speech analogs in visual, auditory, and audiovisual modalities. Before the fMRI study, they were trained to perceive physically identical sinewave speech analogs as speech (SWS-S) or nonspeech (SWS-N). Comparing audiovisual integration (interactions) of speech, SWS-S, and SWS-N revealed a posterior-anterior processing gradient within the left superior temporal sulcus/gyrus (STS/STG): Bilateral posterior STS/STG integrated audiovisual inputs regardless of spectrotemporal structure or speech percept; in left mid-STS, the integration profile was primarily determined by the spectrotemporal structure of the signals; more anterior STS regions discarded spectrotemporal structure and integrated audiovisual signals constrained by stimulus intelligibility and the availability of linguistic representations. In addition to this "ventral" processing stream, a "dorsal" circuitry encompassing posterior STS/STG and left inferior frontal gyrus differentially integrated audiovisual speech andSWSsignals. Indeed, dynamic causal modeling and Bayesian model comparison provided strong evidence for a parallel processing structure encompassing a ventral and a dorsal stream with speech intelligibility training enhancing the connectivity between posterior and anterior STS/STG. In conclusion, audiovisual speech comprehension emerges in an interactive process with the integration of auditory and visual signals being progressively constrained by stimulus intelligibility along the STS and spectrotemporal structure in a dorsal fronto-temporal circuitry.

Original languageEnglish
Pages (from-to)11338-11350
Number of pages13
JournalThe Journal of Neuroscience
Issue number31
Publication statusPublished - 3 Aug 2011

ASJC Scopus subject areas

  • General Neuroscience


Dive into the research topics of 'Physical and perceptual factors shape the neural mechanisms that integrate audiovisual signals in speech comprehension'. Together they form a unique fingerprint.

Cite this