The effect of amplitude envelope blending across frequency bands on the quality of noise-vocoded speech

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Ueda and Nakajima [Trans. Tech. Comm. Psychol. Physiol. Acoust., 38, 771-776, (2008); 39, 211-216, (2009)] found a consistent clustering of frequency bands common to different languages through factor analyses applied to power fluctuations of critical-band filtered speech sounds. One of the factors exhibited a characteristic shape of two peaks, which implied a correlation between a pair of distant frequency bands. The present study examined how amplitude envelope independence across frequency bands affected perception of Japanese noise-vocoded speech. The results indicated that the 20- and 4-band-synthesis conditions exhibited nearly perfect performances without any systematic training or feedback, and that the conditions in which the lowest and the next lowest frequency band blended, keeping a long-term spectrum energy distribution (sharpness) constant, yielded low mora accuracy. Those results indicated that noise-vocoded speech synthesized with the 4 frequency bands contained enough information for speech perception.

Original languageEnglish
Title of host publication8th European Conference on Noise Control 2009, EURONOISE 2009 - Proceedings of the Institute of Acoustics
Volume31
EditionPART 3
Publication statusPublished - 2009
Event8th European Conference on Noise Control 2009, EURONOISE 2009 - Edinburgh, United Kingdom
Duration: Oct 26 2009Oct 28 2009

Other

Other8th European Conference on Noise Control 2009, EURONOISE 2009
CountryUnited Kingdom
CityEdinburgh
Period10/26/0910/28/09

Fingerprint

envelopes
sharpness
energy distribution
education
low frequencies
acoustics
synthesis

All Science Journal Classification (ASJC) codes

  • Acoustics and Ultrasonics

Cite this

Ueda, K., Araki, T., & Nakajima, Y. (2009). The effect of amplitude envelope blending across frequency bands on the quality of noise-vocoded speech. In 8th European Conference on Noise Control 2009, EURONOISE 2009 - Proceedings of the Institute of Acoustics (PART 3 ed., Vol. 31)

The effect of amplitude envelope blending across frequency bands on the quality of noise-vocoded speech. / Ueda, Kazuo; Araki, Tomoya; Nakajima, Yoshitaka.

8th European Conference on Noise Control 2009, EURONOISE 2009 - Proceedings of the Institute of Acoustics. Vol. 31 PART 3. ed. 2009.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Ueda, K, Araki, T & Nakajima, Y 2009, The effect of amplitude envelope blending across frequency bands on the quality of noise-vocoded speech. in 8th European Conference on Noise Control 2009, EURONOISE 2009 - Proceedings of the Institute of Acoustics. PART 3 edn, vol. 31, 8th European Conference on Noise Control 2009, EURONOISE 2009, Edinburgh, United Kingdom, 10/26/09.
Ueda K, Araki T, Nakajima Y. The effect of amplitude envelope blending across frequency bands on the quality of noise-vocoded speech. In 8th European Conference on Noise Control 2009, EURONOISE 2009 - Proceedings of the Institute of Acoustics. PART 3 ed. Vol. 31. 2009
Ueda, Kazuo ; Araki, Tomoya ; Nakajima, Yoshitaka. / The effect of amplitude envelope blending across frequency bands on the quality of noise-vocoded speech. 8th European Conference on Noise Control 2009, EURONOISE 2009 - Proceedings of the Institute of Acoustics. Vol. 31 PART 3. ed. 2009.
@inproceedings{c1053cbf22fd4a738b2dd32d24a1735b,
title = "The effect of amplitude envelope blending across frequency bands on the quality of noise-vocoded speech",
abstract = "Ueda and Nakajima [Trans. Tech. Comm. Psychol. Physiol. Acoust., 38, 771-776, (2008); 39, 211-216, (2009)] found a consistent clustering of frequency bands common to different languages through factor analyses applied to power fluctuations of critical-band filtered speech sounds. One of the factors exhibited a characteristic shape of two peaks, which implied a correlation between a pair of distant frequency bands. The present study examined how amplitude envelope independence across frequency bands affected perception of Japanese noise-vocoded speech. The results indicated that the 20- and 4-band-synthesis conditions exhibited nearly perfect performances without any systematic training or feedback, and that the conditions in which the lowest and the next lowest frequency band blended, keeping a long-term spectrum energy distribution (sharpness) constant, yielded low mora accuracy. Those results indicated that noise-vocoded speech synthesized with the 4 frequency bands contained enough information for speech perception.",
author = "Kazuo Ueda and Tomoya Araki and Yoshitaka Nakajima",
year = "2009",
language = "English",
isbn = "9781615676804",
volume = "31",
booktitle = "8th European Conference on Noise Control 2009, EURONOISE 2009 - Proceedings of the Institute of Acoustics",
edition = "PART 3",

}

TY - GEN

T1 - The effect of amplitude envelope blending across frequency bands on the quality of noise-vocoded speech

AU - Ueda, Kazuo

AU - Araki, Tomoya

AU - Nakajima, Yoshitaka

PY - 2009

Y1 - 2009

N2 - Ueda and Nakajima [Trans. Tech. Comm. Psychol. Physiol. Acoust., 38, 771-776, (2008); 39, 211-216, (2009)] found a consistent clustering of frequency bands common to different languages through factor analyses applied to power fluctuations of critical-band filtered speech sounds. One of the factors exhibited a characteristic shape of two peaks, which implied a correlation between a pair of distant frequency bands. The present study examined how amplitude envelope independence across frequency bands affected perception of Japanese noise-vocoded speech. The results indicated that the 20- and 4-band-synthesis conditions exhibited nearly perfect performances without any systematic training or feedback, and that the conditions in which the lowest and the next lowest frequency band blended, keeping a long-term spectrum energy distribution (sharpness) constant, yielded low mora accuracy. Those results indicated that noise-vocoded speech synthesized with the 4 frequency bands contained enough information for speech perception.

AB - Ueda and Nakajima [Trans. Tech. Comm. Psychol. Physiol. Acoust., 38, 771-776, (2008); 39, 211-216, (2009)] found a consistent clustering of frequency bands common to different languages through factor analyses applied to power fluctuations of critical-band filtered speech sounds. One of the factors exhibited a characteristic shape of two peaks, which implied a correlation between a pair of distant frequency bands. The present study examined how amplitude envelope independence across frequency bands affected perception of Japanese noise-vocoded speech. The results indicated that the 20- and 4-band-synthesis conditions exhibited nearly perfect performances without any systematic training or feedback, and that the conditions in which the lowest and the next lowest frequency band blended, keeping a long-term spectrum energy distribution (sharpness) constant, yielded low mora accuracy. Those results indicated that noise-vocoded speech synthesized with the 4 frequency bands contained enough information for speech perception.

UR - http://www.scopus.com/inward/record.url?scp=84864717126&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84864717126&partnerID=8YFLogxK

M3 - Conference contribution

SN - 9781615676804

VL - 31

BT - 8th European Conference on Noise Control 2009, EURONOISE 2009 - Proceedings of the Institute of Acoustics

ER -