Search Sciweavers | Sciweavers

513 search results - page 4 / 103

» Metric learning for reinforcement learning agents

click to vote

CORR
2011
Springer

136views Education» more CORR 2011»

Reinforcement Learning for Agents with Many Sensors and Actuators Acting in Categorizable Environments

12 years 11 months ago

Download www.aaai.org

In this paper, we confront the problem of applying reinforcement learning to agents that perceive the environment through many sensors and that can perform parallel actions using ...

Enric Celaya, Josep M. Porta

claim paper

Read More »

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

14 years 8 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

click to vote

ICMLA
2010

207views Machine Learning» more ICMLA 2010»

Multi-Agent Inverse Reinforcement Learning

13 years 5 months ago

Download ftp.cs.wisc.edu

Learning the reward function of an agent by observing its behavior is termed inverse reinforcement learning and has applications in learning from demonstration or apprenticeship l...

Sriraam Natarajan, Gautam Kunapuli, Kshitij Judah,...

claim paper

Read More »

click to vote

NIPS
2000

150views Information Technology» more NIPS 2000»

Programmable Reinforcement Learning Agents

13 years 8 months ago

Download reference.kfupm.edu.sa

We present an expressive agent design language for reinforcement learning that allows the user to constrain the policies considered by the learning process.The language includes s...

David Andre, Stuart J. Russell

claim paper

Read More »

click to vote

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Reinforcement Learning via AIXI Approximation

13 years 9 months ago

Download jveness.info

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. This approach is based on a direct approximation of AIXI, a Bayesian...

Joel Veness, Kee Siong Ng, Marcus Hutter, David Si...

claim paper

Read More »

« Prev « First page 4 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers