Search
نمایش تعداد 1-10 از 158
One armed bandit process with a covariate
ناشر: Springer
سال: 2013
The learning of longitudinal human driving behavior and driver assistance strategies
ناشر: Elsevier Science
سال: 2013
Using control theory for analysis of reinforcement learning and optimal policy properties in grid-world problems
سال: 2009
خلاصه:
Markov Decision Process (MDP) has enormous applications in science, engineering, economics and management. Most of decision processes have Markov property and can be modeled as MDP. Reinforcement Learning (RL) is an approach to deal with MDPs. RL...
How Behavior Trees modularize robustness and safety in hybrid systems
ناشر: IEEE
سال: 2014
A compact cylindrical dielectric resonator antenna for MIMO applications
ناشر: IEEE
سال: 2014
Task-Based Decomposition of Factored POMDPs
ناشر: IEEE
سال: 2014
Reducing the analog-digital productivity gap using time-mode signal processing
ناشر: IEEE
سال: 2014