Search Sciweavers | Sciweavers

32 search results - page 2 / 7

» Learning Policies for Partially Observable Environments: Sca...

click to vote

UAI
2000

106views Artificial Intelligence» more UAI 2000»

Learning to Cooperate via Policy Search

13 years 8 months ago

Download reference.kfupm.edu.sa

Cooperative games are those in which both agents share the same payoff structure. Valuebased reinforcement-learning algorithms, such as variants of Q-learning, have been applied t...

Leonid Peshkin, Kee-Eung Kim, Nicolas Meuleau, Les...

claim paper

Read More »

click to vote

ICML
2002
IEEE

113views Machine Learning» more ICML 2002»

Learning from Scarce Experience

14 years 8 months ago

Download www.cs.ucr.edu

Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each ...

Leonid Peshkin, Christian R. Shelton

claim paper

Read More »

click to vote

CORR
2011
Springer

161views Education» more CORR 2011»

Doubly Robust Policy Evaluation and Learning

12 years 11 months ago

Download www.icml-2011.org

We study decision making in environments where the reward is only partially observed, but can be modeled as a function of an action and an observed context. This setting, known as...

Miroslav Dudík, John Langford, Lihong Li

claim paper

Read More »

click to vote

UAI
2008

234views Artificial Intelligence» more UAI 2008»

Improving Gradient Estimation by Incorporating Sensor Data

13 years 9 months ago

Download www.cs.berkeley.edu

An efficient policy search algorithm should estimate the local gradient of the objective function, with respect to the policy parameters, from as few trials as possible. Whereas m...

Gregory Lawrence, Stuart J. Russell

claim paper

Read More »

click to vote

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

14 years 1 months ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

« Prev « First page 2 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers