4018

Автор(ы): 

Автор(ов): 

1

Параметры публикации

Тип публикации: 

Тезисы доклада

Название: 

Solution to Multi-Armed Bandit Problems via Mirror Descent Algorithms

Наименование конференции: 

  • Annual Meeting on Mathematical Statistics 2007, CIRM, Luminy, France

Наименование источника: 

  • Matériaux Annual Meeting on Mathematical Statistics 2007, CIRM, Luminy, France

Обозначение и номер тома: 

http://172.16.7.77/conf//2007/cirm-2/index_welcome.html

Город: 

  • Luminy

Издательство: 

  • CIRM, France

Год издания: 

2007

Страницы: 

24-24
Аннотация
In this talk the stochastic multi-armed bandit problem with unknown horizon is considered. A randomized decision strategy is presented, which is based on updating a probability distribution through a stochastic mirror descent type algorithm.

Библиографическая ссылка: 

Назин А.В. Solution to Multi-Armed Bandit Problems via Mirror Descent Algorithms / Matériaux Annual Meeting on Mathematical Statistics 2007, CIRM, Luminy, France. Luminy: CIRM, France, 2007. http://172.16.7.77/conf//2007/cirm-2/index_welcome.html. С. 24-24.