value iteration | Sciweavers

88

AAAI
2012

197views Intelligent Agents» more AAAI 2012»

MOMDPs: A Solution for Modelling Adaptive Management Problems

12 years 9 months ago

Download martinecologylab.files.wordpress.com

In conservation biology and natural resource management, adaptive management is an iterative process of improving management by reducing uncertainty via monitoring. Adaptive manag...

Iadine Chades, Josie Carwardine, Tara G. Martin, S...

claim paper

Read More »

53

click to vote

ATAL
2011
Springer

220views Intelligent Agents» more ATAL 2011»

Maximum causal entropy correlated equilibria for Markov games

13 years 7 months ago

Download www.cs.cmu.edu

Motivated by a machine learning perspective—that gametheoretic equilibria constraints should serve as guidelines for predicting agents’ strategies, we introduce maximum causal...

Brian D. Ziebart, J. Andrew Bagnell, Anind K. Dey

claim paper

Read More »

69

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

14 years 5 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

63

click to vote

TIT
2008

110views more TIT 2008»

Optimal Cross-Layer Scheduling of Transmissions Over a Fading Multiaccess Channel

14 years 7 months ago

Download ece.iisc.ernet.in

We consider the problem of several users transmitting packets to a base station, and study an optimal scheduling formulation involving three communication layers, namely, the mediu...

Munish Goyal, Anurag Kumar, Vinod Sharma

claim paper

Read More »

88

click to vote

UAI
2000

136views Artificial Intelligence» more UAI 2000»

Fast Planning in Stochastic Games

14 years 8 months ago

Download www.cis.upenn.edu

Stochastic games generalize Markov decision processes MDPs to a multiagent setting by allowing the state transitions to depend jointly on all player actions, and having rewards de...

Michael J. Kearns, Yishay Mansour, Satinder P. Sin...

claim paper

Read More »

64

click to vote

UAI
2004

108views Artificial Intelligence» more UAI 2004»

Heuristic Search Value Iteration for POMDPs

14 years 8 months ago

Download www.cs.cmu.edu

We present a novel POMDP planning algorithm called heuristic search value iteration (HSVI). HSVI is an anytime algorithm that returns a policy and a provable bound on its regret w...

Trey Smith, Reid G. Simmons

claim paper

Read More »

57

click to vote

IJCAI
2003

122views Artificial Intelligence» more IJCAI 2003»

Point-based value iteration: An anytime algorithm for POMDPs

14 years 8 months ago

Download www.cs.mcgill.ca

This paper introduces the Point-Based Value Iteration (PBVI) algorithm for POMDP planning. PBVI approximates an exact value iteration solution by selecting a small set of represen...

Joelle Pineau, Geoffrey J. Gordon, Sebastian Thrun

claim paper

Read More »

67

click to vote

AIPS
2008

148views Artificial Intelligence» more AIPS 2008»

Bounded-Parameter Partially Observable Markov Decision Processes

14 years 9 months ago

Download www.aaai.org

The POMDP is considered as a powerful model for planning under uncertainty. However, it is usually impractical to employ a POMDP with exact parameters to model precisely the real-...

Yaodong Ni, Zhi-Qiang Liu

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers