Библиографическая ссылка:
Назин А.В., Миллер Б.М. Robust Mirror Decent Algorithm for a Multi-Armed Bandit Governed by a Stationary Finite Markov Chain / Proceedings of the 7th IFAC Conference on Manufacturing Modelling, Management, and Control (MIM`2013, Saint Petersburg). Saint Petersburg: Saint Petersburg State University and Saint Petersburg National Research University of Information Technologies, Mechanics, and Optics, 2013. С. 939-943.