Search
نمایش تعداد 1-1 از 1
Sequential optimistic ad-hoc methods for nonstationary multi_armed bandit problem
سال: 2009
خلاصه:
-greedy and softmax action selection rules, the
probability matching method and finally the adaptive pursuit
method. For producing near optimal results we change the
ad hoc methods to sequential optimistic ad hoc methods
which provide us...