Abstract
We consider the problem of dynamically apportioning resources among a set of options in a worst-case online framework. The model we investigate is a generalization of the well studied online learning model. In particular, we allow the learner to see as additional information how high the risk of each option is. This assumption is natural in many applications like horse-race betting, where gamblers know odds for all options before placing bets. We apply Vovk's Aggregating Algorithm to this problem and give a tight performance bound. The results support our intuition that it is safe to bet more on low-risk options. Surprisingly, the loss bound of the algorithm does not depend on the values of relatively small risks.
Original language | English |
---|---|
Pages (from-to) | 2340-2347 |
Number of pages | 8 |
Journal | IEICE Transactions on Information and Systems |
Volume | E89-D |
Issue number | 8 |
DOIs | |
Publication status | Published - Aug 2006 |
Externally published | Yes |
All Science Journal Classification (ASJC) codes
- Software
- Hardware and Architecture
- Computer Vision and Pattern Recognition
- Electrical and Electronic Engineering
- Artificial Intelligence