Memory disruption by irrelevant noise-vocoded speech: Effects of native language and the number of frequency bands

Wolfgang Ellermeier, Florian Kattner, Kazuo Ueda, Kana Doumoto, Yoshitaka Nakajima

Research output: Contribution to journalArticle

21 Citations (Scopus)

Abstract

To investigate the mechanisms by which unattended speech impairs short-term memory performance, speech samples were systematically degraded by means of a noise vocoder. For experiment 1, recordings of German and Japanese sentences were passed through a filter bank dividing the spectrum between 50 and 7000 Hz into 20 critical-band channels or combinations of those, yielding 20, 4, 2, or just 1 channel(s) of noise-vocoded speech. Listening tests conducted with native speakers of both languages showed a monotonic decrease in speech intelligibility as the number of frequency channels was reduced. For experiment 2, 40 native German and 40 native Japanese participants were exposed to speech processed in the same manner while trying to memorize visually presented sequences of digits in the correct order. Half of each sample received the German, the other half received the Japanese speech samples. The results show large irrelevant-speech effects increasing in magnitude with the number of frequency channels. The effects are slightly larger when subjects are exposed to their own native language. The results are neither predicted very well by the speech transmission index, nor by psychoacoustical fluctuation strength, most likely, since both metrics fail to disentangle amplitude and frequency modulations in the signals.

Original languageEnglish
Pages (from-to)1561-1569
Number of pages9
JournalThe Journal of the Acoustical Society of America
Volume138
Issue number3
DOIs
Publication statusPublished - Sep 1 2015

Fingerprint

sentences
intelligibility
Native Language
Disruption
digits
frequency modulation
recording
filters
Experiment
Short-term Memory
Fluctuations
Filter
Modulation
Speech Intelligibility
Native Speaker
Language

All Science Journal Classification (ASJC) codes

  • Arts and Humanities (miscellaneous)
  • Acoustics and Ultrasonics

Cite this

Memory disruption by irrelevant noise-vocoded speech : Effects of native language and the number of frequency bands. / Ellermeier, Wolfgang; Kattner, Florian; Ueda, Kazuo; Doumoto, Kana; Nakajima, Yoshitaka.

In: The Journal of the Acoustical Society of America, Vol. 138, No. 3, 01.09.2015, p. 1561-1569.

Research output: Contribution to journalArticle

@article{9e765500b3ec4a428ca1eaf9c168879b,
title = "Memory disruption by irrelevant noise-vocoded speech: Effects of native language and the number of frequency bands",
abstract = "To investigate the mechanisms by which unattended speech impairs short-term memory performance, speech samples were systematically degraded by means of a noise vocoder. For experiment 1, recordings of German and Japanese sentences were passed through a filter bank dividing the spectrum between 50 and 7000 Hz into 20 critical-band channels or combinations of those, yielding 20, 4, 2, or just 1 channel(s) of noise-vocoded speech. Listening tests conducted with native speakers of both languages showed a monotonic decrease in speech intelligibility as the number of frequency channels was reduced. For experiment 2, 40 native German and 40 native Japanese participants were exposed to speech processed in the same manner while trying to memorize visually presented sequences of digits in the correct order. Half of each sample received the German, the other half received the Japanese speech samples. The results show large irrelevant-speech effects increasing in magnitude with the number of frequency channels. The effects are slightly larger when subjects are exposed to their own native language. The results are neither predicted very well by the speech transmission index, nor by psychoacoustical fluctuation strength, most likely, since both metrics fail to disentangle amplitude and frequency modulations in the signals.",
author = "Wolfgang Ellermeier and Florian Kattner and Kazuo Ueda and Kana Doumoto and Yoshitaka Nakajima",
year = "2015",
month = "9",
day = "1",
doi = "10.1121/1.4928954",
language = "English",
volume = "138",
pages = "1561--1569",
journal = "Journal of the Acoustical Society of America",
issn = "0001-4966",
publisher = "Acoustical Society of America",
number = "3",

}

TY - JOUR

T1 - Memory disruption by irrelevant noise-vocoded speech

T2 - Effects of native language and the number of frequency bands

AU - Ellermeier, Wolfgang

AU - Kattner, Florian

AU - Ueda, Kazuo

AU - Doumoto, Kana

AU - Nakajima, Yoshitaka

PY - 2015/9/1

Y1 - 2015/9/1

N2 - To investigate the mechanisms by which unattended speech impairs short-term memory performance, speech samples were systematically degraded by means of a noise vocoder. For experiment 1, recordings of German and Japanese sentences were passed through a filter bank dividing the spectrum between 50 and 7000 Hz into 20 critical-band channels or combinations of those, yielding 20, 4, 2, or just 1 channel(s) of noise-vocoded speech. Listening tests conducted with native speakers of both languages showed a monotonic decrease in speech intelligibility as the number of frequency channels was reduced. For experiment 2, 40 native German and 40 native Japanese participants were exposed to speech processed in the same manner while trying to memorize visually presented sequences of digits in the correct order. Half of each sample received the German, the other half received the Japanese speech samples. The results show large irrelevant-speech effects increasing in magnitude with the number of frequency channels. The effects are slightly larger when subjects are exposed to their own native language. The results are neither predicted very well by the speech transmission index, nor by psychoacoustical fluctuation strength, most likely, since both metrics fail to disentangle amplitude and frequency modulations in the signals.

AB - To investigate the mechanisms by which unattended speech impairs short-term memory performance, speech samples were systematically degraded by means of a noise vocoder. For experiment 1, recordings of German and Japanese sentences were passed through a filter bank dividing the spectrum between 50 and 7000 Hz into 20 critical-band channels or combinations of those, yielding 20, 4, 2, or just 1 channel(s) of noise-vocoded speech. Listening tests conducted with native speakers of both languages showed a monotonic decrease in speech intelligibility as the number of frequency channels was reduced. For experiment 2, 40 native German and 40 native Japanese participants were exposed to speech processed in the same manner while trying to memorize visually presented sequences of digits in the correct order. Half of each sample received the German, the other half received the Japanese speech samples. The results show large irrelevant-speech effects increasing in magnitude with the number of frequency channels. The effects are slightly larger when subjects are exposed to their own native language. The results are neither predicted very well by the speech transmission index, nor by psychoacoustical fluctuation strength, most likely, since both metrics fail to disentangle amplitude and frequency modulations in the signals.

UR - http://www.scopus.com/inward/record.url?scp=84974533321&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84974533321&partnerID=8YFLogxK

U2 - 10.1121/1.4928954

DO - 10.1121/1.4928954

M3 - Article

C2 - 26428793

AN - SCOPUS:84974533321

VL - 138

SP - 1561

EP - 1569

JO - Journal of the Acoustical Society of America

JF - Journal of the Acoustical Society of America

SN - 0001-4966

IS - 3

ER -