Acoustic diagnosis of voice disturbance using an analytical model of the vocal tract

Kazuya Yokota, Satoshi Ishikawa, Shinya Kijimoto, Yosuke Koba, Shohei Sugiki

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Laryngeal cancer and voice disturbance share the initial symptom of hoarseness of voice. By analyzing changes in the voice, these diseases might be diagnosed. Because it is important to detect these diseases as early as possible, there is demand for a simple and highly accurate diagnostic method. The GRBAS (grade, roughness, breathiness, asthenia, strain) scale is used as an acoustic diagnosis method for these disorders but its objectivity is not well established. Instead, more accurate diagnosis is possible by capturing the waveform of the flow velocity at the vocal cords. The aim of this study was to enable diagnosis of laryngeal cancer and voice disturbance by identifying the sound-source waveform from voice measurements. For acoustic analysis of the vocal tract, we modeled the air in the vocal tract as concentrated masses connected by linear springs and dampers. The vocal tract shape was identified by making the natural frequencies of the analytical model correspond to the measured formant frequencies. The sound-source waveform was calculated from the measured voice waveform. To assess the validity of the model, we measured actual voices and used the model to identify the vocal tract shapes and corresponding sound-source waveforms. The identified waveforms were similar to the Rosenberg wave, which is an approximation of the actual vocal sound-source waveform. Because of the influence of local solutions, multiple vocal tract shapes could be identified from a single sample. However, mathematical analysis showed that these differed in amplitude of sound-source waveform only, which does not affect the shape of the waveform. From this, we conclude that our proposed methods are valid.

Original languageEnglish
Title of host publication25th International Congress on Sound and Vibration 2018, ICSV 2018
Subtitle of host publicationHiroshima Calling
PublisherInternational Institute of Acoustics and Vibration, IIAV
Pages3578-3585
Number of pages8
ISBN (Electronic)9781510868458
Publication statusPublished - Jan 1 2018
Event25th International Congress on Sound and Vibration 2018: Hiroshima Calling, ICSV 2018 - Hiroshima, Japan
Duration: Jul 8 2018Jul 12 2018

Publication series

Name25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling
Volume6

Conference

Conference25th International Congress on Sound and Vibration 2018: Hiroshima Calling, ICSV 2018
CountryJapan
CityHiroshima
Period7/8/187/12/18

Fingerprint

waveforms
disturbances
acoustics
vocal cords
cancer
applications of mathematics
dampers
resonant frequencies
grade
roughness
flow velocity
disorders
air
approximation

All Science Journal Classification (ASJC) codes

  • Acoustics and Ultrasonics

Cite this

Yokota, K., Ishikawa, S., Kijimoto, S., Koba, Y., & Sugiki, S. (2018). Acoustic diagnosis of voice disturbance using an analytical model of the vocal tract. In 25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling (pp. 3578-3585). (25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling; Vol. 6). International Institute of Acoustics and Vibration, IIAV.

Acoustic diagnosis of voice disturbance using an analytical model of the vocal tract. / Yokota, Kazuya; Ishikawa, Satoshi; Kijimoto, Shinya; Koba, Yosuke; Sugiki, Shohei.

25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling. International Institute of Acoustics and Vibration, IIAV, 2018. p. 3578-3585 (25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling; Vol. 6).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Yokota, K, Ishikawa, S, Kijimoto, S, Koba, Y & Sugiki, S 2018, Acoustic diagnosis of voice disturbance using an analytical model of the vocal tract. in 25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling. 25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling, vol. 6, International Institute of Acoustics and Vibration, IIAV, pp. 3578-3585, 25th International Congress on Sound and Vibration 2018: Hiroshima Calling, ICSV 2018, Hiroshima, Japan, 7/8/18.
Yokota K, Ishikawa S, Kijimoto S, Koba Y, Sugiki S. Acoustic diagnosis of voice disturbance using an analytical model of the vocal tract. In 25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling. International Institute of Acoustics and Vibration, IIAV. 2018. p. 3578-3585. (25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling).
Yokota, Kazuya ; Ishikawa, Satoshi ; Kijimoto, Shinya ; Koba, Yosuke ; Sugiki, Shohei. / Acoustic diagnosis of voice disturbance using an analytical model of the vocal tract. 25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling. International Institute of Acoustics and Vibration, IIAV, 2018. pp. 3578-3585 (25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling).
@inproceedings{a900f6be53bc45c9a8d43e570d416cac,
title = "Acoustic diagnosis of voice disturbance using an analytical model of the vocal tract",
abstract = "Laryngeal cancer and voice disturbance share the initial symptom of hoarseness of voice. By analyzing changes in the voice, these diseases might be diagnosed. Because it is important to detect these diseases as early as possible, there is demand for a simple and highly accurate diagnostic method. The GRBAS (grade, roughness, breathiness, asthenia, strain) scale is used as an acoustic diagnosis method for these disorders but its objectivity is not well established. Instead, more accurate diagnosis is possible by capturing the waveform of the flow velocity at the vocal cords. The aim of this study was to enable diagnosis of laryngeal cancer and voice disturbance by identifying the sound-source waveform from voice measurements. For acoustic analysis of the vocal tract, we modeled the air in the vocal tract as concentrated masses connected by linear springs and dampers. The vocal tract shape was identified by making the natural frequencies of the analytical model correspond to the measured formant frequencies. The sound-source waveform was calculated from the measured voice waveform. To assess the validity of the model, we measured actual voices and used the model to identify the vocal tract shapes and corresponding sound-source waveforms. The identified waveforms were similar to the Rosenberg wave, which is an approximation of the actual vocal sound-source waveform. Because of the influence of local solutions, multiple vocal tract shapes could be identified from a single sample. However, mathematical analysis showed that these differed in amplitude of sound-source waveform only, which does not affect the shape of the waveform. From this, we conclude that our proposed methods are valid.",
author = "Kazuya Yokota and Satoshi Ishikawa and Shinya Kijimoto and Yosuke Koba and Shohei Sugiki",
year = "2018",
month = "1",
day = "1",
language = "English",
series = "25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling",
publisher = "International Institute of Acoustics and Vibration, IIAV",
pages = "3578--3585",
booktitle = "25th International Congress on Sound and Vibration 2018, ICSV 2018",

}

TY - GEN

T1 - Acoustic diagnosis of voice disturbance using an analytical model of the vocal tract

AU - Yokota, Kazuya

AU - Ishikawa, Satoshi

AU - Kijimoto, Shinya

AU - Koba, Yosuke

AU - Sugiki, Shohei

PY - 2018/1/1

Y1 - 2018/1/1

N2 - Laryngeal cancer and voice disturbance share the initial symptom of hoarseness of voice. By analyzing changes in the voice, these diseases might be diagnosed. Because it is important to detect these diseases as early as possible, there is demand for a simple and highly accurate diagnostic method. The GRBAS (grade, roughness, breathiness, asthenia, strain) scale is used as an acoustic diagnosis method for these disorders but its objectivity is not well established. Instead, more accurate diagnosis is possible by capturing the waveform of the flow velocity at the vocal cords. The aim of this study was to enable diagnosis of laryngeal cancer and voice disturbance by identifying the sound-source waveform from voice measurements. For acoustic analysis of the vocal tract, we modeled the air in the vocal tract as concentrated masses connected by linear springs and dampers. The vocal tract shape was identified by making the natural frequencies of the analytical model correspond to the measured formant frequencies. The sound-source waveform was calculated from the measured voice waveform. To assess the validity of the model, we measured actual voices and used the model to identify the vocal tract shapes and corresponding sound-source waveforms. The identified waveforms were similar to the Rosenberg wave, which is an approximation of the actual vocal sound-source waveform. Because of the influence of local solutions, multiple vocal tract shapes could be identified from a single sample. However, mathematical analysis showed that these differed in amplitude of sound-source waveform only, which does not affect the shape of the waveform. From this, we conclude that our proposed methods are valid.

AB - Laryngeal cancer and voice disturbance share the initial symptom of hoarseness of voice. By analyzing changes in the voice, these diseases might be diagnosed. Because it is important to detect these diseases as early as possible, there is demand for a simple and highly accurate diagnostic method. The GRBAS (grade, roughness, breathiness, asthenia, strain) scale is used as an acoustic diagnosis method for these disorders but its objectivity is not well established. Instead, more accurate diagnosis is possible by capturing the waveform of the flow velocity at the vocal cords. The aim of this study was to enable diagnosis of laryngeal cancer and voice disturbance by identifying the sound-source waveform from voice measurements. For acoustic analysis of the vocal tract, we modeled the air in the vocal tract as concentrated masses connected by linear springs and dampers. The vocal tract shape was identified by making the natural frequencies of the analytical model correspond to the measured formant frequencies. The sound-source waveform was calculated from the measured voice waveform. To assess the validity of the model, we measured actual voices and used the model to identify the vocal tract shapes and corresponding sound-source waveforms. The identified waveforms were similar to the Rosenberg wave, which is an approximation of the actual vocal sound-source waveform. Because of the influence of local solutions, multiple vocal tract shapes could be identified from a single sample. However, mathematical analysis showed that these differed in amplitude of sound-source waveform only, which does not affect the shape of the waveform. From this, we conclude that our proposed methods are valid.

UR - http://www.scopus.com/inward/record.url?scp=85058800188&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85058800188&partnerID=8YFLogxK

M3 - Conference contribution

T3 - 25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling

SP - 3578

EP - 3585

BT - 25th International Congress on Sound and Vibration 2018, ICSV 2018

PB - International Institute of Acoustics and Vibration, IIAV

ER -