Search

Now showing items 1-10 of 158

Learning Concepts from a Sequence of Experiences by Reinforcement Learning Agents

Type: Conference Paper

Author : Farzad, Rastegar; Majid, Nili Ahmadabadi

Request PDF

One armed bandit process with a covariate

Type: Journal Paper

Author : Liang, Y. - Wang, X. - Yi, Y.

Publisher: Springer

Year: 2013

The learning of longitudinal human driving behavior and driver assistance strategies

Type: Journal Paper

Publisher: Elsevier Science

Year: 2013

Reinforcement learning based design of sampling policies under cost constraints in Markov random fields: Application to weed map reconstruction

Type: Journal Paper

Publisher: Elsevier Science

Year: 2014

Using control theory for analysis of reinforcement learning and optimal policy properties in grid-world problems

Type: Conference Paper

Author : سیّدمصطفی کلامی هریس; محمدباقر نقیبی سیستانی; ناصر پریز; Seyyed Mostapha Kalami; Mohammad Bagher Naghibi Sistani; Naser Pariz

Year: 2009

Abstract:

Markov Decision Process (MDP) has enormous applications in science, engineering, economics and management. Most of decision processes have Markov property and can be modeled as MDP. Reinforcement Learning (RL) is an approach to deal with MDPs. RL...

How Behavior Trees modularize robustness and safety in hybrid systems

Type: Conference Paper

Author : Colledanchise, M.; Ogren, P.

Publisher: IEEE

Year: 2014

A compact cylindrical dielectric resonator antenna for MIMO applications

Type: Conference Paper

Publisher: IEEE

Year: 2014

A fast filtering algorithm using the transmission mechanism of human auditory information and its application on quadruped robot speed tracking

Type: Conference Paper

Publisher: IEEE

Year: 2014

Task-Based Decomposition of Factored POMDPs

Type: Journal Paper

Author : Shani, Guy

Publisher: IEEE

Year: 2014

Reducing the analog-digital productivity gap using time-mode signal processing

Type: Conference Paper

Author : Roberts, G.W.

Publisher: IEEE

Year: 2014

1
2
3
4
. . .
16

Search

Learning Concepts from a Sequence of Experiences by Reinforcement Learning Agents

One armed bandit process with a covariate

The learning of longitudinal human driving behavior and driver assistance strategies

Reinforcement learning based design of sampling policies under cost constraints in Markov random fields: Application to weed map reconstruction

Using control theory for analysis of reinforcement learning and optimal policy properties in grid-world problems

How Behavior Trees modularize robustness and safety in hybrid systems

A compact cylindrical dielectric resonator antenna for MIMO applications

A fast filtering algorithm using the transmission mechanism of human auditory information and its application on quadruped robot speed tracking

Task-Based Decomposition of Factored POMDPs

Reducing the analog-digital productivity gap using time-mode signal processing

Author

Publisher

Year

Keywords

Type

Language (ISO)

Content Type

Publication Title

Search

Filters

Learning Concepts from a Sequence of Experiences by Reinforcement Learning Agents

One armed bandit process with a covariate

The learning of longitudinal human driving behavior and driver assistance strategies

Reinforcement learning based design of sampling policies under cost constraints in Markov random fields: Application to weed map reconstruction

Using control theory for analysis of reinforcement learning and optimal policy properties in grid-world problems

How Behavior Trees modularize robustness and safety in hybrid systems

A compact cylindrical dielectric resonator antenna for MIMO applications

A fast filtering algorithm using the transmission mechanism of human auditory information and its application on quadruped robot speed tracking

Task-Based Decomposition of Factored POMDPs

Reducing the analog-digital productivity gap using time-mode signal processing