Библиографическая ссылка:
Назин А.В., Миллер Б.М. On Effectiveness of the Mirror Decent Algorithm for a Stochastic Multi-Armed Bandit Governed by a Stationary Finite Markov Chain / Proceedings of the 3rd Australian Control Conference (AUCC2013, Perth, Western Australia). Perth, Australia: Engineers Australia, 2013. С. 244-250.