Phase effects in masking by harmonic complexes

Speech recognition

Mickael L D Deroche, John F. Culling, Monita Chatterjee

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Harmonic complexes that generate highly modulated temporal envelopes on the basilar membrane (BM) mask a tone less effectively than complexes that generate relatively flat temporal envelopes, because the non-linear active gain of the BM selectively amplifies a low-level tone in the dips of a modulated masker envelope. The present study examines a similar effect in speech recognition. Speech reception thresholds (SRTs) were measured for a voice masked by harmonic complexes with partials in sine phase (SP) or in random phase (RP). The masker's fundamental frequency (F0) was 50, 100 or 200Hz. SRTs were considerably lower for SP than for RP maskers at 50-Hz F0, but the two converged at 100-Hz F0, while at 200-Hz F0, SRTs were a little higher for SP than RP maskers. The results were similar whether the target voice was male or female and whether the masker's spectral profile was flat or speech-shaped. Although listening in the masker dips has been shown to play a large role for artificial stimuli such as Schroeder-phase complexes at high levels, it contributes weakly to speech recognition in the presence of harmonic maskers with different crest factors at more moderate sound levels (65dB SPL).

Original languageEnglish (US)
Pages (from-to)54-62
Number of pages9
JournalHearing Research
Volume306
DOIs
StatePublished - Dec 1 2013

Fingerprint

Basilar Membrane
Masks

ASJC Scopus subject areas

  • Sensory Systems

Cite this

Phase effects in masking by harmonic complexes : Speech recognition. / Deroche, Mickael L D; Culling, John F.; Chatterjee, Monita.

In: Hearing Research, Vol. 306, 01.12.2013, p. 54-62.

Research output: Contribution to journalArticle

Deroche, Mickael L D ; Culling, John F. ; Chatterjee, Monita. / Phase effects in masking by harmonic complexes : Speech recognition. In: Hearing Research. 2013 ; Vol. 306. pp. 54-62.
@article{aeb65688a378482793ff51a4f7c0e224,
title = "Phase effects in masking by harmonic complexes: Speech recognition",
abstract = "Harmonic complexes that generate highly modulated temporal envelopes on the basilar membrane (BM) mask a tone less effectively than complexes that generate relatively flat temporal envelopes, because the non-linear active gain of the BM selectively amplifies a low-level tone in the dips of a modulated masker envelope. The present study examines a similar effect in speech recognition. Speech reception thresholds (SRTs) were measured for a voice masked by harmonic complexes with partials in sine phase (SP) or in random phase (RP). The masker's fundamental frequency (F0) was 50, 100 or 200Hz. SRTs were considerably lower for SP than for RP maskers at 50-Hz F0, but the two converged at 100-Hz F0, while at 200-Hz F0, SRTs were a little higher for SP than RP maskers. The results were similar whether the target voice was male or female and whether the masker's spectral profile was flat or speech-shaped. Although listening in the masker dips has been shown to play a large role for artificial stimuli such as Schroeder-phase complexes at high levels, it contributes weakly to speech recognition in the presence of harmonic maskers with different crest factors at more moderate sound levels (65dB SPL).",
author = "Deroche, {Mickael L D} and Culling, {John F.} and Monita Chatterjee",
year = "2013",
month = "12",
day = "1",
doi = "10.1016/j.heares.2013.09.008",
language = "English (US)",
volume = "306",
pages = "54--62",
journal = "Hearing Research",
issn = "0378-5955",
publisher = "Elsevier",

}

TY - JOUR

T1 - Phase effects in masking by harmonic complexes

T2 - Speech recognition

AU - Deroche, Mickael L D

AU - Culling, John F.

AU - Chatterjee, Monita

PY - 2013/12/1

Y1 - 2013/12/1

N2 - Harmonic complexes that generate highly modulated temporal envelopes on the basilar membrane (BM) mask a tone less effectively than complexes that generate relatively flat temporal envelopes, because the non-linear active gain of the BM selectively amplifies a low-level tone in the dips of a modulated masker envelope. The present study examines a similar effect in speech recognition. Speech reception thresholds (SRTs) were measured for a voice masked by harmonic complexes with partials in sine phase (SP) or in random phase (RP). The masker's fundamental frequency (F0) was 50, 100 or 200Hz. SRTs were considerably lower for SP than for RP maskers at 50-Hz F0, but the two converged at 100-Hz F0, while at 200-Hz F0, SRTs were a little higher for SP than RP maskers. The results were similar whether the target voice was male or female and whether the masker's spectral profile was flat or speech-shaped. Although listening in the masker dips has been shown to play a large role for artificial stimuli such as Schroeder-phase complexes at high levels, it contributes weakly to speech recognition in the presence of harmonic maskers with different crest factors at more moderate sound levels (65dB SPL).

AB - Harmonic complexes that generate highly modulated temporal envelopes on the basilar membrane (BM) mask a tone less effectively than complexes that generate relatively flat temporal envelopes, because the non-linear active gain of the BM selectively amplifies a low-level tone in the dips of a modulated masker envelope. The present study examines a similar effect in speech recognition. Speech reception thresholds (SRTs) were measured for a voice masked by harmonic complexes with partials in sine phase (SP) or in random phase (RP). The masker's fundamental frequency (F0) was 50, 100 or 200Hz. SRTs were considerably lower for SP than for RP maskers at 50-Hz F0, but the two converged at 100-Hz F0, while at 200-Hz F0, SRTs were a little higher for SP than RP maskers. The results were similar whether the target voice was male or female and whether the masker's spectral profile was flat or speech-shaped. Although listening in the masker dips has been shown to play a large role for artificial stimuli such as Schroeder-phase complexes at high levels, it contributes weakly to speech recognition in the presence of harmonic maskers with different crest factors at more moderate sound levels (65dB SPL).

UR - http://www.scopus.com/inward/record.url?scp=84885030798&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84885030798&partnerID=8YFLogxK

U2 - 10.1016/j.heares.2013.09.008

DO - 10.1016/j.heares.2013.09.008

M3 - Article

VL - 306

SP - 54

EP - 62

JO - Hearing Research

JF - Hearing Research

SN - 0378-5955

ER -