Search Sciweavers | Sciweavers

332 search results - page 42 / 67

» Ranking policies in discrete Markov decision processes

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

14 years 8 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

click to vote

CODES
2009
IEEE

178views Software Engineering» more CODES 2009»

An MDP-based application oriented optimal policy for wireless sensor networks

13 years 11 months ago

Download www.ann.ece.ufl.edu

Technological advancements due to Moore’s law have led to the proliferation of complex wireless sensor network (WSN) domains. One commonality across all WSN domains is the need ...

Arslan Munir, Ann Gordon-Ross

claim paper

Read More »

click to vote

INFOCOM
2012
IEEE

189views Communications» more INFOCOM 2012»

Approximately optimal adaptive learning in opportunistic spectrum access

11 years 10 months ago

Download web.eecs.umich.edu

—In this paper we develop an adaptive learning algorithm which is approximately optimal for an opportunistic spectrum access (OSA) problem with polynomial complexity. In this OSA...

Cem Tekin, Mingyan Liu

claim paper

Read More »

click to vote

ATAL
2005
Springer

146views Intelligent Agents» more ATAL 2005»

Exploiting belief bounds: practical POMDPs for personal assistant agents

14 years 1 months ago

Download teamcore.usc.edu

Agents or agent teams deployed to assist humans often face the challenges of monitoring the state of key processes in their environment (including the state of their human users t...

Pradeep Varakantham, Rajiv T. Maheswaran, Milind T...

claim paper

Read More »

click to vote

AAAI
2006

134views Intelligent Agents» more AAAI 2006»

Point-based Dynamic Programming for DEC-POMDPs

13 years 9 months ago

Download hal.archives-ouvertes.fr

We introduce point-based dynamic programming (DP) for decentralized partially observable Markov decision processes (DEC-POMDPs), a new discrete DP algorithm for planning strategie...

Daniel Szer, François Charpillet

claim paper

Read More »

« Prev « First page 42 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers