Sciweavers

231 search results - page 22 / 47
» Active Learning in Partially Observable Markov Decision Proc...
Sort
View
AROBOTS
2008
166views more  AROBOTS 2008»
13 years 7 months ago
User-adapted plan recognition and user-adapted shared control: A Bayesian approach to semi-autonomous wheelchair driving
Abstract Many elderly and physically impaired people experience difficulties when maneuvering a powered wheelchair. In order to provide improved maneuvering, powered wheelchairs ha...
Eric Demeester, Alexander Hüntemann, Dirk Van...
ACL
2000
13 years 10 months ago
Spoken Dialogue Management Using Probabilistic Reasoning
Spoken dialogue managers have benefited from using stochastic planners such as Markov Decision Processes (MDPs). However, so far, MDPs do not handle well noisy and ambiguous speec...
Nicholas Roy, Joelle Pineau, Sebastian Thrun
FOCS
2007
IEEE
14 years 3 months ago
Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards
We consider a variant of the classic multi-armed bandit problem (MAB), which we call FEEDBACK MAB, where the reward obtained by playing each of n independent arms varies according...
Sudipto Guha, Kamesh Munagala
ICANN
2007
Springer
14 years 2 months ago
Solving Deep Memory POMDPs with Recurrent Policy Gradients
Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...
Daan Wierstra, Alexander Förster, Jan Peters,...
ICML
2006
IEEE
14 years 9 months ago
An analytic solution to discrete Bayesian reinforcement learning
Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...
Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...