Roles of voice onset time and F0 in stop consonant voicing perception

Effects of masking noise and low-pass filtering

Matthew B. Winn, Monita Chatterjee, William J. Idsardi

Research output: Contribution to journalArticle

14 Citations (Scopus)

Abstract

Purpose: The contributions of voice onset time (VOT) and fundamental frequency (F0) were evaluated for the perception of voicing in syllable-initial stop consonants in words that were low-pass filtered and/or masked by speech-shaped noise. It was expected that listeners would rely less on VOT and more on F0 in these degraded conditions. Method: Twenty young listeners with normal hearing identified modified natural speech tokens that varied by VOT and F0 in several conditions of low-pass filtering and masking noise. Stimuli included /b/-/p/ and /d/-/t/ continua that were presented in separate blocks. Identification results were modeled using mixed-effects logistic regression. Results: When speech was filtered and/or masked by noise, listeners' voicing perceptions were driven less by VOT and more by F0. Speech-shaped masking noise exerted greater effects on the /b/-/p/ contrast, while low-pass filtering exerted greater effects on the /d/-/t/ contrast, consistent with the acoustics of these contrasts. Conclusion: Listeners can adjust their use of acoustic-phonetic cues in a dynamic way that is appropriate for challenging listening conditions; cues that are less influential in ideal conditions can gain priority in challenging conditions.

Original languageEnglish (US)
Pages (from-to)1097-1107
Number of pages11
JournalJournal of Speech, Language, and Hearing Research
Volume56
Issue number4
DOIs
StatePublished - Aug 1 2013

Fingerprint

listener
Noise
Acoustics
acoustics
Cues
Phonetics
phonetics
Hearing
stimulus
Logistic Models
logistics
regression
time
Voice Onset Time
Masking
Listeners
Stop Consonants
Voicing

Keywords

  • Bandwidth
  • Noise
  • Speech perception
  • Voicing contrast

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language
  • Speech and Hearing

Cite this

Roles of voice onset time and F0 in stop consonant voicing perception : Effects of masking noise and low-pass filtering. / Winn, Matthew B.; Chatterjee, Monita; Idsardi, William J.

In: Journal of Speech, Language, and Hearing Research, Vol. 56, No. 4, 01.08.2013, p. 1097-1107.

Research output: Contribution to journalArticle

@article{d1b166b282ca473099602a19599b61cc,
title = "Roles of voice onset time and F0 in stop consonant voicing perception: Effects of masking noise and low-pass filtering",
abstract = "Purpose: The contributions of voice onset time (VOT) and fundamental frequency (F0) were evaluated for the perception of voicing in syllable-initial stop consonants in words that were low-pass filtered and/or masked by speech-shaped noise. It was expected that listeners would rely less on VOT and more on F0 in these degraded conditions. Method: Twenty young listeners with normal hearing identified modified natural speech tokens that varied by VOT and F0 in several conditions of low-pass filtering and masking noise. Stimuli included /b/-/p/ and /d/-/t/ continua that were presented in separate blocks. Identification results were modeled using mixed-effects logistic regression. Results: When speech was filtered and/or masked by noise, listeners' voicing perceptions were driven less by VOT and more by F0. Speech-shaped masking noise exerted greater effects on the /b/-/p/ contrast, while low-pass filtering exerted greater effects on the /d/-/t/ contrast, consistent with the acoustics of these contrasts. Conclusion: Listeners can adjust their use of acoustic-phonetic cues in a dynamic way that is appropriate for challenging listening conditions; cues that are less influential in ideal conditions can gain priority in challenging conditions.",
keywords = "Bandwidth, Noise, Speech perception, Voicing contrast",
author = "Winn, {Matthew B.} and Monita Chatterjee and Idsardi, {William J.}",
year = "2013",
month = "8",
day = "1",
doi = "10.1044/1092-4388(2012/12-0086)",
language = "English (US)",
volume = "56",
pages = "1097--1107",
journal = "Journal of Speech, Language, and Hearing Research",
issn = "1092-4388",
publisher = "American Speech-Language-Hearing Association (ASHA)",
number = "4",

}

TY - JOUR

T1 - Roles of voice onset time and F0 in stop consonant voicing perception

T2 - Effects of masking noise and low-pass filtering

AU - Winn, Matthew B.

AU - Chatterjee, Monita

AU - Idsardi, William J.

PY - 2013/8/1

Y1 - 2013/8/1

N2 - Purpose: The contributions of voice onset time (VOT) and fundamental frequency (F0) were evaluated for the perception of voicing in syllable-initial stop consonants in words that were low-pass filtered and/or masked by speech-shaped noise. It was expected that listeners would rely less on VOT and more on F0 in these degraded conditions. Method: Twenty young listeners with normal hearing identified modified natural speech tokens that varied by VOT and F0 in several conditions of low-pass filtering and masking noise. Stimuli included /b/-/p/ and /d/-/t/ continua that were presented in separate blocks. Identification results were modeled using mixed-effects logistic regression. Results: When speech was filtered and/or masked by noise, listeners' voicing perceptions were driven less by VOT and more by F0. Speech-shaped masking noise exerted greater effects on the /b/-/p/ contrast, while low-pass filtering exerted greater effects on the /d/-/t/ contrast, consistent with the acoustics of these contrasts. Conclusion: Listeners can adjust their use of acoustic-phonetic cues in a dynamic way that is appropriate for challenging listening conditions; cues that are less influential in ideal conditions can gain priority in challenging conditions.

AB - Purpose: The contributions of voice onset time (VOT) and fundamental frequency (F0) were evaluated for the perception of voicing in syllable-initial stop consonants in words that were low-pass filtered and/or masked by speech-shaped noise. It was expected that listeners would rely less on VOT and more on F0 in these degraded conditions. Method: Twenty young listeners with normal hearing identified modified natural speech tokens that varied by VOT and F0 in several conditions of low-pass filtering and masking noise. Stimuli included /b/-/p/ and /d/-/t/ continua that were presented in separate blocks. Identification results were modeled using mixed-effects logistic regression. Results: When speech was filtered and/or masked by noise, listeners' voicing perceptions were driven less by VOT and more by F0. Speech-shaped masking noise exerted greater effects on the /b/-/p/ contrast, while low-pass filtering exerted greater effects on the /d/-/t/ contrast, consistent with the acoustics of these contrasts. Conclusion: Listeners can adjust their use of acoustic-phonetic cues in a dynamic way that is appropriate for challenging listening conditions; cues that are less influential in ideal conditions can gain priority in challenging conditions.

KW - Bandwidth

KW - Noise

KW - Speech perception

KW - Voicing contrast

UR - http://www.scopus.com/inward/record.url?scp=84881271059&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84881271059&partnerID=8YFLogxK

U2 - 10.1044/1092-4388(2012/12-0086)

DO - 10.1044/1092-4388(2012/12-0086)

M3 - Article

VL - 56

SP - 1097

EP - 1107

JO - Journal of Speech, Language, and Hearing Research

JF - Journal of Speech, Language, and Hearing Research

SN - 1092-4388

IS - 4

ER -