An acoustic key to eight languages/dialects

Factor analyses of critical-band-filtered speech

Research output: Contribution to journalArticle

6 Citations (Scopus)

Abstract

The peripheral auditory system functions like a frequency analyser, often modelled as a bank of non-overlapping band-pass filters called critical bands; 20 bands are necessary for simulating frequency resolution of the ear within an ordinary frequency range of speech (up to 7,000 Hz). A far smaller number of filters seemed sufficient, however, to re-synthesise intelligible speech sentences with power fluctuations of the speech signals passing through them; nevertheless, the number and frequency ranges of the frequency bands for efficient speech communication are yet unknown. We derived four common frequency bands - covering approximately 50-540, 540-1,700, 1,700-3,300, and above 3,300 Hz - from factor analyses of spectral fluctuations in eight different spoken languages/dialects. The analyses robustly led to three factors common to all languages investigated - the low &mid-high factor related to the two separate frequency ranges of 50-540 and 1,700-3,300 Hz, the mid-low factor the range of 540-1,700 Hz, and the high factor the range above 3,300 Hz - in these different languages/dialects, suggesting a language universal.

Original languageEnglish
JournalScientific reports
Volume7
Issue number42468
DOIs
Publication statusPublished - Feb 15 2017

Fingerprint

Acoustics
Frequency bands
Speech communication
Bandpass filters

All Science Journal Classification (ASJC) codes

  • General

Cite this

@article{e06d3fa08fd34e3bb21cee5fd37b9f2a,
title = "An acoustic key to eight languages/dialects: Factor analyses of critical-band-filtered speech",
abstract = "The peripheral auditory system functions like a frequency analyser, often modelled as a bank of non-overlapping band-pass filters called critical bands; 20 bands are necessary for simulating frequency resolution of the ear within an ordinary frequency range of speech (up to 7,000 Hz). A far smaller number of filters seemed sufficient, however, to re-synthesise intelligible speech sentences with power fluctuations of the speech signals passing through them; nevertheless, the number and frequency ranges of the frequency bands for efficient speech communication are yet unknown. We derived four common frequency bands - covering approximately 50-540, 540-1,700, 1,700-3,300, and above 3,300 Hz - from factor analyses of spectral fluctuations in eight different spoken languages/dialects. The analyses robustly led to three factors common to all languages investigated - the low &mid-high factor related to the two separate frequency ranges of 50-540 and 1,700-3,300 Hz, the mid-low factor the range of 540-1,700 Hz, and the high factor the range above 3,300 Hz - in these different languages/dialects, suggesting a language universal.",
author = "Kazuo Ueda and Yoshitaka Nakajima",
year = "2017",
month = "2",
day = "15",
doi = "10.1038/srep42468",
language = "English",
volume = "7",
journal = "Scientific Reports",
issn = "2045-2322",
publisher = "Nature Publishing Group",
number = "42468",

}

TY - JOUR

T1 - An acoustic key to eight languages/dialects

T2 - Factor analyses of critical-band-filtered speech

AU - Ueda, Kazuo

AU - Nakajima, Yoshitaka

PY - 2017/2/15

Y1 - 2017/2/15

N2 - The peripheral auditory system functions like a frequency analyser, often modelled as a bank of non-overlapping band-pass filters called critical bands; 20 bands are necessary for simulating frequency resolution of the ear within an ordinary frequency range of speech (up to 7,000 Hz). A far smaller number of filters seemed sufficient, however, to re-synthesise intelligible speech sentences with power fluctuations of the speech signals passing through them; nevertheless, the number and frequency ranges of the frequency bands for efficient speech communication are yet unknown. We derived four common frequency bands - covering approximately 50-540, 540-1,700, 1,700-3,300, and above 3,300 Hz - from factor analyses of spectral fluctuations in eight different spoken languages/dialects. The analyses robustly led to three factors common to all languages investigated - the low &mid-high factor related to the two separate frequency ranges of 50-540 and 1,700-3,300 Hz, the mid-low factor the range of 540-1,700 Hz, and the high factor the range above 3,300 Hz - in these different languages/dialects, suggesting a language universal.

AB - The peripheral auditory system functions like a frequency analyser, often modelled as a bank of non-overlapping band-pass filters called critical bands; 20 bands are necessary for simulating frequency resolution of the ear within an ordinary frequency range of speech (up to 7,000 Hz). A far smaller number of filters seemed sufficient, however, to re-synthesise intelligible speech sentences with power fluctuations of the speech signals passing through them; nevertheless, the number and frequency ranges of the frequency bands for efficient speech communication are yet unknown. We derived four common frequency bands - covering approximately 50-540, 540-1,700, 1,700-3,300, and above 3,300 Hz - from factor analyses of spectral fluctuations in eight different spoken languages/dialects. The analyses robustly led to three factors common to all languages investigated - the low &mid-high factor related to the two separate frequency ranges of 50-540 and 1,700-3,300 Hz, the mid-low factor the range of 540-1,700 Hz, and the high factor the range above 3,300 Hz - in these different languages/dialects, suggesting a language universal.

UR - http://www.scopus.com/inward/record.url?scp=85013020595&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85013020595&partnerID=8YFLogxK

U2 - 10.1038/srep42468

DO - 10.1038/srep42468

M3 - Article

VL - 7

JO - Scientific Reports

JF - Scientific Reports

SN - 2045-2322

IS - 42468

ER -