Speaker normalization based on piecewise linear frequency warping

Kei Yamada, Seiichi Uchida, Hiroaki Sakoe

Research output: Contribution to journalArticle

Abstract

An efficient algorithm for speaker-independent spoken word recognition is presented. This algorithm is based on the time-frequency warping with inter-frame consistency, where each frame of an input pattern is mapped to a reference pattern by controlling the mapping of several points (pivots) on the frame. The mapping of non-pivot points is given by linear interpolation between mapping of two consecutive pivots. The optimal mapping is obtained by using a dynamic programming based algorithm. The computational complexity of the algorithm is reduced to less than that of the previous time-frequency warping algorithm with inter-frame consistency. Experimental results show advantageous characteristics of the present algorithm.

Original languageEnglish
Pages (from-to)91-92
Number of pages2
JournalResearch Reports on Information Science and Electrical Engineering of Kyushu University
Volume6
Issue number1
Publication statusPublished - Jan 1 2001

Fingerprint

Dynamic programming
Computational complexity
Interpolation

All Science Journal Classification (ASJC) codes

  • Computer Science(all)
  • Electrical and Electronic Engineering

Cite this

Speaker normalization based on piecewise linear frequency warping. / Yamada, Kei; Uchida, Seiichi; Sakoe, Hiroaki.

In: Research Reports on Information Science and Electrical Engineering of Kyushu University, Vol. 6, No. 1, 01.01.2001, p. 91-92.

Research output: Contribution to journalArticle

@article{31fe32d2ec154a69a215df1c7d16d2ad,
title = "Speaker normalization based on piecewise linear frequency warping",
abstract = "An efficient algorithm for speaker-independent spoken word recognition is presented. This algorithm is based on the time-frequency warping with inter-frame consistency, where each frame of an input pattern is mapped to a reference pattern by controlling the mapping of several points (pivots) on the frame. The mapping of non-pivot points is given by linear interpolation between mapping of two consecutive pivots. The optimal mapping is obtained by using a dynamic programming based algorithm. The computational complexity of the algorithm is reduced to less than that of the previous time-frequency warping algorithm with inter-frame consistency. Experimental results show advantageous characteristics of the present algorithm.",
author = "Kei Yamada and Seiichi Uchida and Hiroaki Sakoe",
year = "2001",
month = "1",
day = "1",
language = "English",
volume = "6",
pages = "91--92",
journal = "Research Reports on Information Science and Electrical Engineering of Kyushu University",
issn = "1342-3819",
publisher = "Kyushu University, Faculty of Science",
number = "1",

}

TY - JOUR

T1 - Speaker normalization based on piecewise linear frequency warping

AU - Yamada, Kei

AU - Uchida, Seiichi

AU - Sakoe, Hiroaki

PY - 2001/1/1

Y1 - 2001/1/1

N2 - An efficient algorithm for speaker-independent spoken word recognition is presented. This algorithm is based on the time-frequency warping with inter-frame consistency, where each frame of an input pattern is mapped to a reference pattern by controlling the mapping of several points (pivots) on the frame. The mapping of non-pivot points is given by linear interpolation between mapping of two consecutive pivots. The optimal mapping is obtained by using a dynamic programming based algorithm. The computational complexity of the algorithm is reduced to less than that of the previous time-frequency warping algorithm with inter-frame consistency. Experimental results show advantageous characteristics of the present algorithm.

AB - An efficient algorithm for speaker-independent spoken word recognition is presented. This algorithm is based on the time-frequency warping with inter-frame consistency, where each frame of an input pattern is mapped to a reference pattern by controlling the mapping of several points (pivots) on the frame. The mapping of non-pivot points is given by linear interpolation between mapping of two consecutive pivots. The optimal mapping is obtained by using a dynamic programming based algorithm. The computational complexity of the algorithm is reduced to less than that of the previous time-frequency warping algorithm with inter-frame consistency. Experimental results show advantageous characteristics of the present algorithm.

UR - http://www.scopus.com/inward/record.url?scp=0035270745&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0035270745&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:0035270745

VL - 6

SP - 91

EP - 92

JO - Research Reports on Information Science and Electrical Engineering of Kyushu University

JF - Research Reports on Information Science and Electrical Engineering of Kyushu University

SN - 1342-3819

IS - 1

ER -