Search

نمایش تعداد 1-10 از 158

Learning Concepts from a Sequence of Experiences by Reinforcement Learning Agents

نوع: Conference Paper

نویسنده : Farzad, Rastegar; Majid, Nili Ahmadabadi

Request PDF

One armed bandit process with a covariate

نوع: Journal Paper

نویسنده : Liang, Y. - Wang, X. - Yi, Y.

ناشر: Springer

سال: 2013

The learning of longitudinal human driving behavior and driver assistance strategies

نوع: Journal Paper

ناشر: Elsevier Science

سال: 2013

Reinforcement learning based design of sampling policies under cost constraints in Markov random fields: Application to weed map reconstruction

نوع: Journal Paper

ناشر: Elsevier Science

سال: 2014

Using control theory for analysis of reinforcement learning and optimal policy properties in grid-world problems

نوع: Conference Paper

نویسنده : سیّدمصطفی کلامی هریس; محمدباقر نقیبی سیستانی; ناصر پریز; Seyyed Mostapha Kalami; Mohammad Bagher Naghibi Sistani; Naser Pariz

سال: 2009

خلاصه:

Markov Decision Process (MDP) has enormous applications in science, engineering, economics and management. Most of decision processes have Markov property and can be modeled as MDP. Reinforcement Learning (RL) is an approach to deal with MDPs. RL...

How Behavior Trees modularize robustness and safety in hybrid systems

نوع: Conference Paper

نویسنده : Colledanchise, M.; Ogren, P.

ناشر: IEEE

سال: 2014

A compact cylindrical dielectric resonator antenna for MIMO applications

نوع: Conference Paper

ناشر: IEEE

سال: 2014

A fast filtering algorithm using the transmission mechanism of human auditory information and its application on quadruped robot speed tracking

نوع: Conference Paper

ناشر: IEEE

سال: 2014

Task-Based Decomposition of Factored POMDPs

نوع: Journal Paper

نویسنده : Shani, Guy

ناشر: IEEE

سال: 2014

Reducing the analog-digital productivity gap using time-mode signal processing

نوع: Conference Paper

نویسنده : Roberts, G.W.

ناشر: IEEE

سال: 2014

1
2
3
4
. . .
16

Search

Learning Concepts from a Sequence of Experiences by Reinforcement Learning Agents

One armed bandit process with a covariate

The learning of longitudinal human driving behavior and driver assistance strategies

Reinforcement learning based design of sampling policies under cost constraints in Markov random fields: Application to weed map reconstruction

Using control theory for analysis of reinforcement learning and optimal policy properties in grid-world problems

How Behavior Trees modularize robustness and safety in hybrid systems

A compact cylindrical dielectric resonator antenna for MIMO applications

A fast filtering algorithm using the transmission mechanism of human auditory information and its application on quadruped robot speed tracking

Task-Based Decomposition of Factored POMDPs

Reducing the analog-digital productivity gap using time-mode signal processing

نویسنده

ناشر

سال

کلیدواژه

نوع

زبان

نوع محتوا

عنوان ناشر

Search

Filters

Learning Concepts from a Sequence of Experiences by Reinforcement Learning Agents

One armed bandit process with a covariate

The learning of longitudinal human driving behavior and driver assistance strategies

Reinforcement learning based design of sampling policies under cost constraints in Markov random fields: Application to weed map reconstruction

Using control theory for analysis of reinforcement learning and optimal policy properties in grid-world problems

How Behavior Trees modularize robustness and safety in hybrid systems

A compact cylindrical dielectric resonator antenna for MIMO applications

A fast filtering algorithm using the transmission mechanism of human auditory information and its application on quadruped robot speed tracking

Task-Based Decomposition of Factored POMDPs

Reducing the analog-digital productivity gap using time-mode signal processing