Search

Now showing items 1-9 of 9

Optimistic Initial Value Analysis in a Greedy Selection Approach to MAB Problems

Type: Conference Paper

Author : کامبیز شجاعی قندشتنی; حبیب رجبی مشهدی; kambiz shojaee ghandeshtani; Habib Rajabi Mashhadi

Year: 2017

Abstract:

The Multi Arm Bandit (MAB) problem is a wellknown

decision making problem, where the gambler (operator),

seeks the highest value and best choice among arms with

different reward distributions. In recent years, many effective...

Sequential optimistic ad-hoc methods for nonstationary multi_armed bandit problem

Type: Conference Paper

Author : مجید مازوچی; فرزانه تاتاری; محمدباقر نقیبی سیستانی; Majid Mazouchi; Farzaneh Tatari; Mohammad Bagher Naghibi Sistani

Year: 2009

Abstract:

One of the common ways for showing the trade_off

between exploration_exploitation in reinforcement learning

problems is the multi_armed bandit problem. In this paper

we consider the MABP in a nonstationary environment which...

Study on the POP ceramic package multilayer board design

Type: Conference Paper

Author : Zhang Jie; Wang Ke

Publisher: IEEE

Year: 2014

Hollow-core fibre frequency standard

Type: Conference Paper

Publisher: IEEE

Year: 2014

Optimistic Initial Value Analysis in a Greedy Selection Approach to MAB Problems

Sequential optimistic ad-hoc methods for nonstationary multi_armed bandit problem

Study on the POP ceramic package multilayer board design

Hollow-core fibre frequency standard

Small signal modelling of GaN HEMT at 70GHz

On Optimality of Myopic Policy for Opportunistic Access With Nonidentical Channels and Imperfect Sensing

BER performance of switched diversity receivers over к-μ and η-μ fading channels

Effects of Visual Elements into the Touch Interaction during the Drag Operation

Sequential Testing for Sparse Recovery

Author

Publisher

Year

Keywords

Type

Language (ISO)

Content Type

Publication Title

Search

Filters

Optimistic Initial Value Analysis in a Greedy Selection Approach to MAB Problems

Sequential optimistic ad-hoc methods for nonstationary multi_armed bandit problem

Study on the POP ceramic package multilayer board design

Hollow-core fibre frequency standard

Small signal modelling of GaN HEMT at 70GHz

On Optimality of Myopic Policy for Opportunistic Access With Nonidentical Channels and Imperfect Sensing

BER performance of switched diversity receivers over &#x043A;-&#x03BC; and &#x03B7;-&#x03BC; fading channels

Effects of Visual Elements into the Touch Interaction during the Drag Operation

Sequential Testing for Sparse Recovery

BER performance of switched diversity receivers over к-μ and η-μ fading channels