کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
434062 689675 2014 18 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Weighted last-step min–max algorithm with improved sub-logarithmic regret
ترجمه فارسی عنوان
الگوریتم مینا با حداکثر وزن با کمترین انعکاس زیر لگاریتمی
کلمات کلیدی
یادگیری آنلاین، پسرفت، یادگیری حداقل
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
چکیده انگلیسی

In online learning the performance of an algorithm is typically compared to the performance of a fixed function from some class, with a quantity called regret. Forster [12] proposed a last-step min–max algorithm which was somewhat simpler than the algorithm of Vovk [26], yet with the same regret. In fact the algorithm he analyzed assumed that the choices of the adversary are bounded, yielding artificially only the two extreme cases. We fix this problem by weighing the examples in such a way that the min–max problem will be well defined, and provide analysis with logarithmic regret that may have better multiplicative factor than both bounds of Forster [12] and Vovk [26]. We also derive a new bound that may be sub-logarithmic, as a recent bound of Orabona et al. [21], but may have better multiplicative factor. Finally, we analyze the algorithm in a weak-type of non-stationary setting, and show a bound that is sublinear if the non-stationarity is sub-linear as well.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Theoretical Computer Science - Volume 558, 13 November 2014, Pages 107–124
نویسندگان
, ,