Search Sciweavers | Sciweavers

93 search results - page 9 / 19

» Learning action models for multi-agent planning

click to vote

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

13 years 8 months ago

Download www.cs.cmu.edu

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

click to vote

AAAI
2010

140views Intelligent Agents» more AAAI 2010»

The Model-Based Approach to Autonomous Behavior: A Personal View

13 years 9 months ago

Download www.dtic.upf.edu

The selection of the action to do next is one of the central problems faced by autonomous agents. In AI, three approaches have been used to address this problem: the programming-b...

Hector Geffner

claim paper

Read More »

click to vote

ICML
1990
IEEE

106views Machine Learning» more ICML 1990»

Explanations of Empirically Derived Reactive Plans

13 years 11 months ago

Download www.cs.uwyo.edu

Given an adequate simulation model of the task environment and payoff function that measures the quality of partially successful plans, competition-based heuristics such as geneti...

Diana F. Gordon, John J. Grefenstette

claim paper

Read More »

click to vote

ATAL
2010
Springer

171views Intelligent Agents» more ATAL 2010»

Closing the learning-planning loop with predictive state representations

13 years 8 months ago

Download www.cs.cmu.edu

A central problem in artificial intelligence is to choose actions to maximize reward in a partially observable, uncertain environment. To do so, we must learn an accurate model of ...

Byron Boots, Sajid M. Siddiqi, Geoffrey J. Gordon

claim paper

Read More »

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

14 years 8 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

« Prev « First page 9 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers