Analysis of voice source characteristics using a constrained polynomial model

Tokihiko Kaburagi, Koji Kawai

Research output: Contribution to conferencePaper

2 Citations (Scopus)

Abstract

This paper presents an analysis method of voice source characteristics from speech by simultaneously employing models of the vocal tract and voice source signal. The vocal tract is represented as a linear filter based on the conventional all-pole assumption. On the other hand, the voice source signal is represented by linearly overlapping multiple number of base signals obtained from a generalization of the Rosenberg model. The resulting voice source model is a polynomial function of time and has lesser degrees-of-freedom than the polynomial order. By virtue of the linearity of both models, the optimal values of their parameters can be jointly determined when the instants of the glottal opening and closing are given for each pitch period. We also present a temporal search method of these glottal events using the dynamic programming technique. Finally, experimental results are presented to reveal the applicability of the proposed method for several phonation conditions.

Original languageEnglish
Pages461-464
Number of pages4
Publication statusPublished - Jan 1 2003
Event8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - Geneva, Switzerland
Duration: Sep 1 2003Sep 4 2003

Other

Other8th European Conference on Speech Communication and Technology, EUROSPEECH 2003
CountrySwitzerland
CityGeneva
Period9/1/039/4/03

Fingerprint

Polynomials
Pole
Dynamic programming
programming
Poles
Statistical Models
event
time

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Software
  • Linguistics and Language
  • Communication

Cite this

Kaburagi, T., & Kawai, K. (2003). Analysis of voice source characteristics using a constrained polynomial model. 461-464. Paper presented at 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, Geneva, Switzerland.

Analysis of voice source characteristics using a constrained polynomial model. / Kaburagi, Tokihiko; Kawai, Koji.

2003. 461-464 Paper presented at 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, Geneva, Switzerland.

Research output: Contribution to conferencePaper

Kaburagi, T & Kawai, K 2003, 'Analysis of voice source characteristics using a constrained polynomial model', Paper presented at 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, Geneva, Switzerland, 9/1/03 - 9/4/03 pp. 461-464.
Kaburagi T, Kawai K. Analysis of voice source characteristics using a constrained polynomial model. 2003. Paper presented at 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, Geneva, Switzerland.
Kaburagi, Tokihiko ; Kawai, Koji. / Analysis of voice source characteristics using a constrained polynomial model. Paper presented at 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, Geneva, Switzerland.4 p.
@conference{98fa7a0ba5b04a3f95a45d54bfcfc58a,
title = "Analysis of voice source characteristics using a constrained polynomial model",
abstract = "This paper presents an analysis method of voice source characteristics from speech by simultaneously employing models of the vocal tract and voice source signal. The vocal tract is represented as a linear filter based on the conventional all-pole assumption. On the other hand, the voice source signal is represented by linearly overlapping multiple number of base signals obtained from a generalization of the Rosenberg model. The resulting voice source model is a polynomial function of time and has lesser degrees-of-freedom than the polynomial order. By virtue of the linearity of both models, the optimal values of their parameters can be jointly determined when the instants of the glottal opening and closing are given for each pitch period. We also present a temporal search method of these glottal events using the dynamic programming technique. Finally, experimental results are presented to reveal the applicability of the proposed method for several phonation conditions.",
author = "Tokihiko Kaburagi and Koji Kawai",
year = "2003",
month = "1",
day = "1",
language = "English",
pages = "461--464",
note = "8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 ; Conference date: 01-09-2003 Through 04-09-2003",

}

TY - CONF

T1 - Analysis of voice source characteristics using a constrained polynomial model

AU - Kaburagi, Tokihiko

AU - Kawai, Koji

PY - 2003/1/1

Y1 - 2003/1/1

N2 - This paper presents an analysis method of voice source characteristics from speech by simultaneously employing models of the vocal tract and voice source signal. The vocal tract is represented as a linear filter based on the conventional all-pole assumption. On the other hand, the voice source signal is represented by linearly overlapping multiple number of base signals obtained from a generalization of the Rosenberg model. The resulting voice source model is a polynomial function of time and has lesser degrees-of-freedom than the polynomial order. By virtue of the linearity of both models, the optimal values of their parameters can be jointly determined when the instants of the glottal opening and closing are given for each pitch period. We also present a temporal search method of these glottal events using the dynamic programming technique. Finally, experimental results are presented to reveal the applicability of the proposed method for several phonation conditions.

AB - This paper presents an analysis method of voice source characteristics from speech by simultaneously employing models of the vocal tract and voice source signal. The vocal tract is represented as a linear filter based on the conventional all-pole assumption. On the other hand, the voice source signal is represented by linearly overlapping multiple number of base signals obtained from a generalization of the Rosenberg model. The resulting voice source model is a polynomial function of time and has lesser degrees-of-freedom than the polynomial order. By virtue of the linearity of both models, the optimal values of their parameters can be jointly determined when the instants of the glottal opening and closing are given for each pitch period. We also present a temporal search method of these glottal events using the dynamic programming technique. Finally, experimental results are presented to reveal the applicability of the proposed method for several phonation conditions.

UR - http://www.scopus.com/inward/record.url?scp=85009160799&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85009160799&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:85009160799

SP - 461

EP - 464

ER -