Morphological and acoustic analysis of the vocal tract using a multi-speaker volumetric MRI dataset

Research output: Contribution to journalConference article

2 Citations (Scopus)

Abstract

The shape of the vocal tract was analyzed from both morphological and acoustic perspectives for ten male speakers of Japanese. A volumetric MRI (magnetic resonance imaging) measurement was performed while each speaker uttered each of the five Japanese vowels. The cross-sectional vocal-tract area function was computed from the MRI dataset and the resulting 50 vocal-tract shapes were analyzed statistically to determine the principal deformation patterns. A perturbation of the vocaltract shape was then given for each vowel to examine the effect on the first and second formant frequencies. When the perturbation was given by changing the coefficient values of the first and second principal modes, a local region on the coefficient plane was observed where the formant change was small. In other words, this region was acoustically insensitive to the perturbation of the vocal-tract shape. When the vocal-tract shapes of the ten speakers were marked on the same plot, it was also found that marked vocal-tract shapes were located in the vicinity of the acoustically insensitive region. From these numerical investigations, it was considered how the individual differences in the vocal-tract shape can be resolved to generate phonetically relevant speech sounds.

Original languageEnglish
Pages (from-to)379-383
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume2015-January
Publication statusPublished - Jan 1 2015
Event16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 - Dresden, Germany
Duration: Sep 6 2015Sep 10 2015

Fingerprint

Magnetic Resonance Imaging
Magnetic resonance
Acoustics
acoustics
Imaging techniques
Acoustic waves
perturbation
Perturbation
Individual Differences
Coefficient
analysis
Vocal Tract
Morphological Analysis
Acoustic Analysis
Numerical Investigation

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Human-Computer Interaction
  • Signal Processing
  • Software
  • Modelling and Simulation

Cite this

@article{22eb7db8b9844300b122af49b157aec1,
title = "Morphological and acoustic analysis of the vocal tract using a multi-speaker volumetric MRI dataset",
abstract = "The shape of the vocal tract was analyzed from both morphological and acoustic perspectives for ten male speakers of Japanese. A volumetric MRI (magnetic resonance imaging) measurement was performed while each speaker uttered each of the five Japanese vowels. The cross-sectional vocal-tract area function was computed from the MRI dataset and the resulting 50 vocal-tract shapes were analyzed statistically to determine the principal deformation patterns. A perturbation of the vocaltract shape was then given for each vowel to examine the effect on the first and second formant frequencies. When the perturbation was given by changing the coefficient values of the first and second principal modes, a local region on the coefficient plane was observed where the formant change was small. In other words, this region was acoustically insensitive to the perturbation of the vocal-tract shape. When the vocal-tract shapes of the ten speakers were marked on the same plot, it was also found that marked vocal-tract shapes were located in the vicinity of the acoustically insensitive region. From these numerical investigations, it was considered how the individual differences in the vocal-tract shape can be resolved to generate phonetically relevant speech sounds.",
author = "Tokihiko Kaburagi",
year = "2015",
month = "1",
day = "1",
language = "English",
volume = "2015-January",
pages = "379--383",
journal = "Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH",
issn = "2308-457X",

}

TY - JOUR

T1 - Morphological and acoustic analysis of the vocal tract using a multi-speaker volumetric MRI dataset

AU - Kaburagi, Tokihiko

PY - 2015/1/1

Y1 - 2015/1/1

N2 - The shape of the vocal tract was analyzed from both morphological and acoustic perspectives for ten male speakers of Japanese. A volumetric MRI (magnetic resonance imaging) measurement was performed while each speaker uttered each of the five Japanese vowels. The cross-sectional vocal-tract area function was computed from the MRI dataset and the resulting 50 vocal-tract shapes were analyzed statistically to determine the principal deformation patterns. A perturbation of the vocaltract shape was then given for each vowel to examine the effect on the first and second formant frequencies. When the perturbation was given by changing the coefficient values of the first and second principal modes, a local region on the coefficient plane was observed where the formant change was small. In other words, this region was acoustically insensitive to the perturbation of the vocal-tract shape. When the vocal-tract shapes of the ten speakers were marked on the same plot, it was also found that marked vocal-tract shapes were located in the vicinity of the acoustically insensitive region. From these numerical investigations, it was considered how the individual differences in the vocal-tract shape can be resolved to generate phonetically relevant speech sounds.

AB - The shape of the vocal tract was analyzed from both morphological and acoustic perspectives for ten male speakers of Japanese. A volumetric MRI (magnetic resonance imaging) measurement was performed while each speaker uttered each of the five Japanese vowels. The cross-sectional vocal-tract area function was computed from the MRI dataset and the resulting 50 vocal-tract shapes were analyzed statistically to determine the principal deformation patterns. A perturbation of the vocaltract shape was then given for each vowel to examine the effect on the first and second formant frequencies. When the perturbation was given by changing the coefficient values of the first and second principal modes, a local region on the coefficient plane was observed where the formant change was small. In other words, this region was acoustically insensitive to the perturbation of the vocal-tract shape. When the vocal-tract shapes of the ten speakers were marked on the same plot, it was also found that marked vocal-tract shapes were located in the vicinity of the acoustically insensitive region. From these numerical investigations, it was considered how the individual differences in the vocal-tract shape can be resolved to generate phonetically relevant speech sounds.

UR - http://www.scopus.com/inward/record.url?scp=84959119435&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84959119435&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:84959119435

VL - 2015-January

SP - 379

EP - 383

JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

SN - 2308-457X

ER -