Search Sciweavers | Sciweavers

90 search results - page 12 / 18

» On the hardness of finding symmetries in Markov decision pro...

click to vote

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

14 years 8 months ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

click to vote

AIPS
2008

151views Artificial Intelligence» more AIPS 2008»

Criticality Metrics for Distributed Plan and Schedule Management

13 years 9 months ago

Download www.aaai.org

We address the problem of coordinating the plans and schedules for a team of agents in an uncertain and dynamic environment. Bounded rationality, bounded communication, subjectivi...

Rajiv T. Maheswaran, Pedro A. Szekely

claim paper

Read More »

click to vote

AAAI
2004

167views Intelligent Agents» more AAAI 2004»

Dynamic Programming for Partially Observable Stochastic Games

13 years 8 months ago

Download anytime.cs.umass.edu

We develop an exact dynamic programming algorithm for partially observable stochastic games (POSGs). The algorithm is a synthesis of dynamic programming for partially observable M...

Eric A. Hansen, Daniel S. Bernstein, Shlomo Zilber...

claim paper

Read More »

click to vote

AAAI
2006

157views Intelligent Agents» more AAAI 2006»

Compact, Convex Upper Bound Iteration for Approximate POMDP Planning

13 years 8 months ago

Download www.aaai.org

Partially observable Markov decision processes (POMDPs) are an intuitive and general way to model sequential decision making problems under uncertainty. Unfortunately, even approx...

Tao Wang, Pascal Poupart, Michael H. Bowling, Dale...

claim paper

Read More »

click to vote

ATAL
2011
Springer

169views Intelligent Agents» more ATAL 2011»

Towards a unifying characterization for quantifying weak coupling in dec-POMDPs

12 years 7 months ago

Download ai.eecs.umich.edu

Researchers in the ﬁeld of multiagent sequential decision making have commonly used the terms “weakly-coupled” and “loosely-coupled” to qualitatively classify problems i...

Stefan J. Witwicki, Edmund H. Durfee

claim paper

Read More »

« Prev « First page 12 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers