Search
Now showing items 1-9 of 9
Optimistic Initial Value Analysis in a Greedy Selection Approach to MAB Problems
Year: 2017
Abstract:
The Multi Arm Bandit (MAB) problem is a wellknown
decision making problem, where the gambler (operator),
seeks the highest value and best choice among arms with
different reward distributions. In recent years, many effective...
Sequential optimistic ad-hoc methods for nonstationary multi_armed bandit problem
Year: 2009
Abstract:
One of the common ways for showing the trade_off
between exploration_exploitation in reinforcement learning
problems is the multi_armed bandit problem. In this paper
we consider the MABP in a nonstationary environment which...
Study on the POP ceramic package multilayer board design
Publisher: IEEE
Year: 2014
Hollow-core fibre frequency standard
Publisher: IEEE
Year: 2014
Small signal modelling of GaN HEMT at 70GHz
Publisher: IEEE
Year: 2014
On Optimality of Myopic Policy for Opportunistic Access With Nonidentical Channels and Imperfect Sensing
Publisher: IEEE
Year: 2014
BER performance of switched diversity receivers over к-μ and η-μ fading channels
Publisher: IEEE
Year: 2014
Effects of Visual Elements into the Touch Interaction during the Drag Operation
Publisher: IEEE
Year: 2014
Sequential Testing for Sparse Recovery
Publisher: IEEE
Year: 2014