Search Sciweavers | Sciweavers

231 search results - page 22 / 47

» Active Learning in Partially Observable Markov Decision Proc...

225

AROBOTS
2008

166views more AROBOTS 2008»

User-adapted plan recognition and user-adapted shared control: A Bayesian approach to semi-autonomous wheelchair driving

15 years 6 months ago

Download www.mech.kuleuven.be

Abstract Many elderly and physically impaired people experience difficulties when maneuvering a powered wheelchair. In order to provide improved maneuvering, powered wheelchairs ha...

Eric Demeester, Alexander Hüntemann, Dirk Van...

claim paper

Read More »

209

click to vote

ACL
2000

141views Computational Linguistics» more ACL 2000»

Spoken Dialogue Management Using Probabilistic Reasoning

15 years 8 months ago

Download web.mit.edu

Spoken dialogue managers have benefited from using stochastic planners such as Markov Decision Processes (MDPs). However, so far, MDPs do not handle well noisy and ambiguous speec...

Nicholas Roy, Joelle Pineau, Sebastian Thrun

claim paper

Read More »

222

Voted

FOCS
2007
IEEE

157views Theoretical Computer Science» more FOCS 2007»

Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards

16 years 1 months ago

Download www.cis.upenn.edu

We consider a variant of the classic multi-armed bandit problem (MAB), which we call FEEDBACK MAB, where the reward obtained by playing each of n independent arms varies according...

Sudipto Guha, Kamesh Munagala

claim paper

Read More »

175

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

16 years 1 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

201

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

16 years 8 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

« Prev « First page 22 / 47 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers