Search Sciweavers | Sciweavers

32 search results - page 1 / 7

» Learning Policies for Partially Observable Environments: Sca...

click to vote

ICML
1995
IEEE

213views Machine Learning» more ICML 1995»

Learning Policies for Partially Observable Environments: Scaling Up

14 years 8 months ago

Download reference.kfupm.edu.sa

Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...

Michael L. Littman, Anthony R. Cassandra, Leslie P...

claim paper

Read More »

click to vote

AAAI
2007

131views Intelligent Agents» more AAAI 2007»

Scaling Up: Solving POMDPs through Value Based Clustering

13 years 9 months ago

Download www.aaai.org

Partially Observable Markov Decision Processes (POMDPs) provide an appropriately rich model for agents operating under partial knowledge of the environment. Since ﬁnding an opti...

Yan Virin, Guy Shani, Solomon Eyal Shimony, Ronen ...

claim paper

Read More »

click to vote

ICMLA
2008

130views Machine Learning» more ICMLA 2008»

A Predictive Model for Imitation Learning in Partially Observable Environments

13 years 9 months ago

Download www.damas.ift.ulaval.ca

Learning by imitation has shown to be a powerful paradigm for automated learning in autonomous robots. This paper presents a general framework of learning by imitation for stochas...

Abdeslam Boularias

claim paper

Read More »

click to vote

ATAL
2008
Springer

99views Intelligent Agents» more ATAL 2008»

Not all agents are equal: scaling up distributed POMDPs for agent networks

13 years 9 months ago

Download teamcore.usc.edu

Many applications of networks of agents, including mobile sensor networks, unmanned air vehicles, autonomous underwater vehicles, involve 100s of agents acting collaboratively und...

Janusz Marecki, Tapana Gupta, Pradeep Varakantham,...

claim paper

Read More »

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

14 years 2 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

« Prev « First page 1 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers