| Article ID | Journal | Published Year | Pages | File Type | 
|---|---|---|---|---|
| 708807 | IFAC-PapersOnLine | 2016 | 6 Pages | 
Abstract
												We consider the two-armed bandit problem as applied to data processing provided that there are two alternative processing methods with different a priori unknown efficiencies. One should determine more efficient method and ensure its preferable application. Normal two-armed bandit is a generalization which allows to process data in parallel and almost without loss of the control performance, i.e. without increasing of the minimax risk. However, it requires that methods must have close efficiencies. Below we propose the adaptive modification of the algorithm which works properly with methods which efficiencies are not obligatory close.
Related Topics
												
													Physical Sciences and Engineering
													Engineering
													Computational Mechanics
												
											Authors
												Alexander V. Kolnogorov, 
											