Sound source detection using multiple noise models

Shoichi Matsunaga, Masahide Yamaguchi, Katsuya Yamauchi, Masaru Yamashita

研究成果: Chapter in Book/Report/Conference proceedingConference contribution

1 引用 (Scopus)

抜粋

This paper describes a sound source detection approach based on elaborate noise-modeling techniques for audio indexing. For accurate detection, we devised two methods to generate multiple-noise models through clustering techniques. One method is based on frame-wise data similarity, and the other is based on noise source similarity. The former method employs K-means clustering and a smoothing technique to avoid inaccurate segmentation. The latter method involves noise modeling based on a tree data structure generated by the progressive merging of noise clusters. The classification experiments show that by using these proposed methods, audio sources can be detected with better accuracy than that achieved by a conventional method. When four noise models generated by the latter method were used, the noise detection performance increased by 3.9% for the periods in which the sound sources did not overlap. With regard to the experiments for an audio stream that included overlapped segments, the noise detection performance increased by 1.2% without a decrease in the speech detection performance.

元の言語英語
ホスト出版物のタイトル2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP
ページ2025-2028
ページ数4
DOI
出版物ステータス出版済み - 9 16 2008
外部発表Yes
イベント2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP - Las Vegas, NV, 米国
継続期間: 3 31 20084 4 2008

出版物シリーズ

名前ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN(印刷物)1520-6149

その他

その他2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP
米国
Las Vegas, NV
期間3/31/084/4/08

All Science Journal Classification (ASJC) codes

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

フィンガープリント Sound source detection using multiple noise models' の研究トピックを掘り下げます。これらはともに一意のフィンガープリントを構成します。

  • これを引用

    Matsunaga, S., Yamaguchi, M., Yamauchi, K., & Yamashita, M. (2008). Sound source detection using multiple noise models. : 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP (pp. 2025-2028). [4518037] (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). https://doi.org/10.1109/ICASSP.2008.4518037