Search Sciweavers | Sciweavers

262 search results - page 34 / 53

» Bounded-Parameter Partially Observable Markov Decision Proce...

click to vote

CONNECTION
2008

178views more CONNECTION 2008»

Spoken language interaction with model uncertainty: an adaptive human-robot interaction system

13 years 10 months ago

Download people.csail.mit.edu

Spoken language is one of the most intuitive forms of interaction between humans and agents. Unfortunately, agents that interact with people using natural language often experienc...

Finale Doshi, Nicholas Roy

claim paper

Read More »

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

14 years 4 months ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

14 years 4 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

click to vote

ICMLA
2009

181views Machine Learning» more ICMLA 2009»

Sensitivity Analysis of POMDP Value Functions

13 years 7 months ago

Download www.cs.cmu.edu

In sequential decision making under uncertainty, as in many other modeling endeavors, researchers observe a dynamical system and collect data measuring its behavior over time. The...

Stéphane Ross, Masoumeh T. Izadi, Mark Merc...

claim paper

Read More »

click to vote

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

13 years 8 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

« Prev « First page 34 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers