Boosting based on divide and merge

Eiji Takimoto, Syuhei Koya, Akira Maruoka

研究成果: ジャーナルへの寄稿会議記事査読

1 被引用数 (Scopus)

抄録

InfoBoost is a boosting algorithm that improves the performance of the master hypothesis whenever each weak hypothesis brings non-zero mutual information about the target. We give a somewhat surprising observation that InfoBoost can be viewed as an algorithm for growing a branching program that divides and merges the domain repeatedly. We generalize the merging process and propose a new class of boosting algorithms called BP.InfoBoost with various merging schema. BP.InfoBoost assigns to each node a weight as well as a weak hypothesis and the master hypothesis is a threshold function of the sum of the weights over the path induced by a given instance. InfoBoost is a BP.InfoBoost with an extreme scheme that merges all nodes in each round. The other extreme that merges no nodes yields an algorithm for growing a decision tree. We call this particular version DT.InfoBoost. We give an evidence that DT.InfoBoost improves the master hypothesis very efficiently, but it has a risk of overfitting because the size of the master hypothesis may grow exponentially. We propose a merging scheme between these extremes that improves the master hypothesis nearly as fast as the one without merge while keeping the branching program in a moderate size.

本文言語英語
ページ(範囲)127-141
ページ数15
ジャーナルLecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)
3244
DOI
出版ステータス出版済み - 2004
外部発表はい
イベント15th International Conference ALT 2004: Algorithmic Learning Theory - Padova, イタリア
継続期間: 10月 2 200410月 5 2004

!!!All Science Journal Classification (ASJC) codes

  • 理論的コンピュータサイエンス
  • コンピュータ サイエンス(全般)

フィンガープリント

「Boosting based on divide and merge」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル