Search Sciweavers | Sciweavers

267 search results - page 43 / 54

» Qualitative Analysis of Partially-Observable Markov Decision...

151

click to vote

ICML
2005
IEEE

157views Machine Learning» more ICML 2005»

A causal approach to hierarchical decomposition of factored MDPs

16 years 6 months ago

Download www-anw.cs.umass.edu

We present Variable Influence Structure Analysis, an algorithm that dynamically performs hierarchical decomposition of factored Markov decision processes. Our algorithm determines...

Anders Jonsson, Andrew G. Barto

claim paper

Read More »

147

click to vote

CISS
2008
IEEE

100views Information Technology» more CISS 2008»

Rate adaptation via link-layer feedback for goodput maximization over a time-varying channel

16 years 8 days ago

Download www.ece.osu.edu

Abstract—We consider adapting the transmission rate to maximize the goodput, i.e., the amount of data transmitted without error, over a continuous Markov ﬂat-fading wireless ch...

Rohit Aggarwal, Phil Schniter, Can Emre Koksal

claim paper

Read More »

162

click to vote

ATAL
2005
Springer

146views Intelligent Agents» more ATAL 2005»

Exploiting belief bounds: practical POMDPs for personal assistant agents

15 years 11 months ago

Download teamcore.usc.edu

Agents or agent teams deployed to assist humans often face the challenges of monitoring the state of key processes in their environment (including the state of their human users t...

Pradeep Varakantham, Rajiv T. Maheswaran, Milind T...

claim paper

Read More »

148

click to vote

SODA
2004
ACM

94views Algorithms» more SODA 2004»

Quantitative stochastic parity games

15 years 7 months ago

Download www.dcs.warwick.ac.uk

We study perfect-information stochastic parity games. These are two-player nonterminating games which are played on a graph with turn-based probabilistic transitions. A play resul...

Krishnendu Chatterjee, Marcin Jurdzinski, Thomas A...

claim paper

Read More »

156

click to vote

ATAL
2009
Springer

135views Intelligent Agents» more ATAL 2009»

An empirical analysis of value function-based and policy search reinforcement learning

16 years 11 days ago

Download userweb.cs.utexas.edu

In several agent-oriented scenarios in the real world, an autonomous agent that is situated in an unknown environment must learn through a process of trial and error to take actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

« Prev « First page 43 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers