Search Sciweavers | Sciweavers

262 search results - page 38 / 53

» Bounded-Parameter Partially Observable Markov Decision Proce...

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

13 years 10 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

click to vote

AIED
2011
Springer

243views Artificial Intelligence» more AIED 2011»

Faster Teaching by POMDP Planning

13 years 2 months ago

Download louisville.edu

Both human and automated tutors must infer what a student knows and plan future actions to maximize learning. Though substantial research has been done on tracking and modeling stu...

Anna N. Rafferty, Emma Brunskill, Thomas L. Griffi...

claim paper

Read More »

click to vote

AAAI
2006

157views Intelligent Agents» more AAAI 2006»

Compact, Convex Upper Bound Iteration for Approximate POMDP Planning

14 years 13 days ago

Download www.aaai.org

Partially observable Markov decision processes (POMDPs) are an intuitive and general way to model sequential decision making problems under uncertainty. Unfortunately, even approx...

Tao Wang, Pascal Poupart, Michael H. Bowling, Dale...

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

14 years 5 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

click to vote

AI
2006
Springer

167views Artificial Intelligence» more AI 2006»

Belief Selection in Point-Based Planning Algorithms for POMDPs

14 years 2 months ago

Download www.cs.mcgill.ca

Abstract. Current point-based planning algorithms for solving partially observable Markov decision processes (POMDPs) have demonstrated that a good approximation of the value funct...

Masoumeh T. Izadi, Doina Precup, Danielle Azar

claim paper

Read More »

« Prev « First page 38 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers