Search
Now showing items 1-1 of 1
Sequential optimistic ad-hoc methods for nonstationary multi_armed bandit problem
Year: 2009
Abstract:
-greedy and softmax action selection rules, the
probability matching method and finally the adaptive pursuit
method. For producing near optimal results we change the
ad hoc methods to sequential optimistic ad hoc methods
which provide us...