Estimating area function of the vocal tract from formants using a sensitivity function and least-squares

Tokihiko Kaburagi, Tetsuro Takano, Yuki Sakamoto

研究成果: ジャーナルへの寄稿記事

5 引用 (Scopus)

抄録

We present methods for estimating the cross-sectional area function of the vocal tract from formant frequencies. They extend the work of Story (J. Acoust. Soc. Am., 119, 715-718, 1996) based on a sensitivity function representing the change in the formant frequency due to a perturbation of the cross-sectional area. In Method I, the area function is estimated through an iterative procedure that uses the sensitivity function as the basis function to optimize the area function that produces the target frequencies. In Method II, a mode function linearly expands the area function. The estimation is performed by optimizing the value of each mode coefficient, where the sensitivity function is used as a constraint in the optimization. As a specific feature, the summing weight of sensitivity functions in Method I and mode functions in Method II is determined by minimizing an objective function representing the frequency error of every formant. By using existing area function data for English vowels, we compare the performance of each method with respect to the estimation accuracy and convergence speed. The results show that our methods can effectively reduce degrees of freedom of the area function and quickly obtain the optimal solution with fair accuracy.

元の言語英語
ページ(範囲)301-310
ページ数10
ジャーナルAcoustical Science and Technology
34
発行部数5
DOI
出版物ステータス出版済み - 9 16 2013

Fingerprint

estimating
sensitivity
vowels
degrees of freedom
perturbation
optimization

All Science Journal Classification (ASJC) codes

  • Acoustics and Ultrasonics

これを引用

Estimating area function of the vocal tract from formants using a sensitivity function and least-squares. / Kaburagi, Tokihiko; Takano, Tetsuro; Sakamoto, Yuki.

:: Acoustical Science and Technology, 巻 34, 番号 5, 16.09.2013, p. 301-310.

研究成果: ジャーナルへの寄稿記事

@article{6f22c225971a44349eb133a2c76fce03,
title = "Estimating area function of the vocal tract from formants using a sensitivity function and least-squares",
abstract = "We present methods for estimating the cross-sectional area function of the vocal tract from formant frequencies. They extend the work of Story (J. Acoust. Soc. Am., 119, 715-718, 1996) based on a sensitivity function representing the change in the formant frequency due to a perturbation of the cross-sectional area. In Method I, the area function is estimated through an iterative procedure that uses the sensitivity function as the basis function to optimize the area function that produces the target frequencies. In Method II, a mode function linearly expands the area function. The estimation is performed by optimizing the value of each mode coefficient, where the sensitivity function is used as a constraint in the optimization. As a specific feature, the summing weight of sensitivity functions in Method I and mode functions in Method II is determined by minimizing an objective function representing the frequency error of every formant. By using existing area function data for English vowels, we compare the performance of each method with respect to the estimation accuracy and convergence speed. The results show that our methods can effectively reduce degrees of freedom of the area function and quickly obtain the optimal solution with fair accuracy.",
author = "Tokihiko Kaburagi and Tetsuro Takano and Yuki Sakamoto",
year = "2013",
month = "9",
day = "16",
doi = "10.1250/ast.34.301",
language = "English",
volume = "34",
pages = "301--310",
journal = "Acoustical Science and Technology",
issn = "1346-3969",
publisher = "The Acoustical Society of Japan (ASJ)",
number = "5",

}

TY - JOUR

T1 - Estimating area function of the vocal tract from formants using a sensitivity function and least-squares

AU - Kaburagi, Tokihiko

AU - Takano, Tetsuro

AU - Sakamoto, Yuki

PY - 2013/9/16

Y1 - 2013/9/16

N2 - We present methods for estimating the cross-sectional area function of the vocal tract from formant frequencies. They extend the work of Story (J. Acoust. Soc. Am., 119, 715-718, 1996) based on a sensitivity function representing the change in the formant frequency due to a perturbation of the cross-sectional area. In Method I, the area function is estimated through an iterative procedure that uses the sensitivity function as the basis function to optimize the area function that produces the target frequencies. In Method II, a mode function linearly expands the area function. The estimation is performed by optimizing the value of each mode coefficient, where the sensitivity function is used as a constraint in the optimization. As a specific feature, the summing weight of sensitivity functions in Method I and mode functions in Method II is determined by minimizing an objective function representing the frequency error of every formant. By using existing area function data for English vowels, we compare the performance of each method with respect to the estimation accuracy and convergence speed. The results show that our methods can effectively reduce degrees of freedom of the area function and quickly obtain the optimal solution with fair accuracy.

AB - We present methods for estimating the cross-sectional area function of the vocal tract from formant frequencies. They extend the work of Story (J. Acoust. Soc. Am., 119, 715-718, 1996) based on a sensitivity function representing the change in the formant frequency due to a perturbation of the cross-sectional area. In Method I, the area function is estimated through an iterative procedure that uses the sensitivity function as the basis function to optimize the area function that produces the target frequencies. In Method II, a mode function linearly expands the area function. The estimation is performed by optimizing the value of each mode coefficient, where the sensitivity function is used as a constraint in the optimization. As a specific feature, the summing weight of sensitivity functions in Method I and mode functions in Method II is determined by minimizing an objective function representing the frequency error of every formant. By using existing area function data for English vowels, we compare the performance of each method with respect to the estimation accuracy and convergence speed. The results show that our methods can effectively reduce degrees of freedom of the area function and quickly obtain the optimal solution with fair accuracy.

UR - http://www.scopus.com/inward/record.url?scp=84883703693&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84883703693&partnerID=8YFLogxK

U2 - 10.1250/ast.34.301

DO - 10.1250/ast.34.301

M3 - Article

VL - 34

SP - 301

EP - 310

JO - Acoustical Science and Technology

JF - Acoustical Science and Technology

SN - 1346-3969

IS - 5

ER -