Search Sciweavers | Sciweavers

1234 search results - page 6 / 247

» Multi-criteria Reinforcement Learning

225

click to vote

ICML
2010
IEEE

188views Machine Learning» more ICML 2010»

Constructing States for Reinforcement Learning

15 years 5 months ago

Download www.icml2010.org

POMDPs are the models of choice for reinforcement learning (RL) tasks where the environment cannot be observed directly. In many applications we need to learn the POMDP structure ...

M. M. Hassan Mahmud

claim paper

Read More »

218

Voted

ICRA
2010
IEEE

137views Robotics» more ICRA 2010»

Robot reinforcement learning using EEG-based reward signals

15 years 6 months ago

Download webdiis.unizar.es

Abstract— Reinforcement learning algorithms have been successfully applied in robotics to learn how to solve tasks based on reward signals obtained during task execution. These r...

Iñaki Iturrate, Luis Montesano, Javier Ming...

claim paper

Read More »

221

click to vote

JDCTA
2010

160views more JDCTA 2010»

Learning and Decision Making in Human During a Game of Matching Pennies

15 years 2 months ago

Download www.aicit.org

To gain insights into the neural basis of such adaptive decision-making processes, we investigated the nature of learning process in humans playing a competitive game with binary ...

Jianfeng Hu, Xiaofeng Li, Jinghai Yin

claim paper

Read More »

189

click to vote

NECO
2010

97views more NECO 2010»

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

15 years 5 months ago

Download www.kyb.tuebingen.mpg.de

Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...

Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...

claim paper

Read More »

211

Voted

JAIR
2000

131views more JAIR 2000»

An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email

15 years 7 months ago

Download www.jair.org

This paper describes a novel method by which a spoken dialogue system can learn to choose an optimal dialogue strategy from its experience interacting with human users. The method...

Marilyn A. Walker

claim paper

Read More »

« Prev « First page 6 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers