Estimating the vocal-tract area function from formants using a sensitivity function and least square

Tokihiko Kaburagi, Tetsuro Takano, Yuki Sakamoto

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present a method for estimating the vocal-tract area function from specified formant frequencies. The method extends the work of Story (J.A.S.A., 119, 715-718, 1996) based on a sensitivity function representing the change in the formant frequency due to a small perturbation of the cross-sectional area of the vocal tract. Our method estimates the vocal-tract shape through an iterative procedure in which the sensitivity function is used as the basis function to gradually optimize the cross-sectional area that produces the target formant frequencies. In addition, the summing weight of sensitivity functions is determined by minimizing an objective function representing the relative frequency error of every format. We conducted numerical experiments using area function data of English vowels. Results showed that our method can estimate the vocal-tract shape with satisfactory accuracy. In addition, the number of iterative calculations is significantly lower than with Story's original method.

Original languageEnglish
Title of host publication13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Pages2191-2194
Number of pages4
Publication statusPublished - Dec 1 2012
Event13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 - Portland, OR, United States
Duration: Sep 9 2012Sep 13 2012

Publication series

Name13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Volume3

Other

Other13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
CountryUnited States
CityPortland, OR
Period9/9/129/13/12

Fingerprint

experiment
Experiments

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Communication

Cite this

Kaburagi, T., Takano, T., & Sakamoto, Y. (2012). Estimating the vocal-tract area function from formants using a sensitivity function and least square. In 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 (pp. 2191-2194). (13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012; Vol. 3).

Estimating the vocal-tract area function from formants using a sensitivity function and least square. / Kaburagi, Tokihiko; Takano, Tetsuro; Sakamoto, Yuki.

13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012. 2012. p. 2191-2194 (13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012; Vol. 3).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Kaburagi, T, Takano, T & Sakamoto, Y 2012, Estimating the vocal-tract area function from formants using a sensitivity function and least square. in 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, vol. 3, pp. 2191-2194, 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Portland, OR, United States, 9/9/12.
Kaburagi T, Takano T, Sakamoto Y. Estimating the vocal-tract area function from formants using a sensitivity function and least square. In 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012. 2012. p. 2191-2194. (13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012).
Kaburagi, Tokihiko ; Takano, Tetsuro ; Sakamoto, Yuki. / Estimating the vocal-tract area function from formants using a sensitivity function and least square. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012. 2012. pp. 2191-2194 (13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012).
@inproceedings{81418ef54b85445385f52950a2fa360e,
title = "Estimating the vocal-tract area function from formants using a sensitivity function and least square",
abstract = "We present a method for estimating the vocal-tract area function from specified formant frequencies. The method extends the work of Story (J.A.S.A., 119, 715-718, 1996) based on a sensitivity function representing the change in the formant frequency due to a small perturbation of the cross-sectional area of the vocal tract. Our method estimates the vocal-tract shape through an iterative procedure in which the sensitivity function is used as the basis function to gradually optimize the cross-sectional area that produces the target formant frequencies. In addition, the summing weight of sensitivity functions is determined by minimizing an objective function representing the relative frequency error of every format. We conducted numerical experiments using area function data of English vowels. Results showed that our method can estimate the vocal-tract shape with satisfactory accuracy. In addition, the number of iterative calculations is significantly lower than with Story's original method.",
author = "Tokihiko Kaburagi and Tetsuro Takano and Yuki Sakamoto",
year = "2012",
month = "12",
day = "1",
language = "English",
isbn = "9781622767595",
series = "13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012",
pages = "2191--2194",
booktitle = "13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012",

}

TY - GEN

T1 - Estimating the vocal-tract area function from formants using a sensitivity function and least square

AU - Kaburagi, Tokihiko

AU - Takano, Tetsuro

AU - Sakamoto, Yuki

PY - 2012/12/1

Y1 - 2012/12/1

N2 - We present a method for estimating the vocal-tract area function from specified formant frequencies. The method extends the work of Story (J.A.S.A., 119, 715-718, 1996) based on a sensitivity function representing the change in the formant frequency due to a small perturbation of the cross-sectional area of the vocal tract. Our method estimates the vocal-tract shape through an iterative procedure in which the sensitivity function is used as the basis function to gradually optimize the cross-sectional area that produces the target formant frequencies. In addition, the summing weight of sensitivity functions is determined by minimizing an objective function representing the relative frequency error of every format. We conducted numerical experiments using area function data of English vowels. Results showed that our method can estimate the vocal-tract shape with satisfactory accuracy. In addition, the number of iterative calculations is significantly lower than with Story's original method.

AB - We present a method for estimating the vocal-tract area function from specified formant frequencies. The method extends the work of Story (J.A.S.A., 119, 715-718, 1996) based on a sensitivity function representing the change in the formant frequency due to a small perturbation of the cross-sectional area of the vocal tract. Our method estimates the vocal-tract shape through an iterative procedure in which the sensitivity function is used as the basis function to gradually optimize the cross-sectional area that produces the target formant frequencies. In addition, the summing weight of sensitivity functions is determined by minimizing an objective function representing the relative frequency error of every format. We conducted numerical experiments using area function data of English vowels. Results showed that our method can estimate the vocal-tract shape with satisfactory accuracy. In addition, the number of iterative calculations is significantly lower than with Story's original method.

UR - http://www.scopus.com/inward/record.url?scp=84878549163&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84878549163&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84878549163

SN - 9781622767595

T3 - 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012

SP - 2191

EP - 2194

BT - 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012

ER -