Audiovisual asynchrony detection in human speech

Joost X Maier, Massimiliano Di Luca, Uta Noppeney

Research output: Contribution to journalArticlepeer-review

40 Citations (Scopus)


Combining information from the visual and auditory senses can greatly enhance intelligibility of natural speech. Integration of audiovisual speech signals is robust even when temporal offsets are present between the component signals. In the present study, we characterized the temporal integration window for speech and nonspeech stimuli with similar spectrotemporal structure to investigate to what extent humans have adapted to the specific characteristics of natural audiovisual speech. We manipulated spectrotemporal structure of the auditory signal, stimulus length, and task context. Results indicate that the temporal integration window is narrower and more asymmetric for speech than for nonspeech signals. When perceiving audiovisual speech, subjects tolerate visual leading asynchronies, but are nevertheless very sensitive to auditory leading asynchronies that are less likely to occur in natural speech. Thus, speech perception may be fine-tuned to the natural statistics of audiovisual speech, where facial movements always occur before acoustic speech articulation.
Original languageEnglish
Pages (from-to)245-56
Number of pages12
JournalJournal of Experimental Psychology: Human Perception and Performance
Issue number1
Publication statusPublished - 2011


Dive into the research topics of 'Audiovisual asynchrony detection in human speech'. Together they form a unique fingerprint.

Cite this