Search Sciweavers | Sciweavers

91 search results - page 13 / 19

» Magnifying-Lens Abstraction for Markov Decision Processes

click to vote

INFOCOM
2011
IEEE

323views Communications» more INFOCOM 2011»

A high-throughput routing metric for reliable multicast in multi-rate wireless mesh networks

12 years 11 months ago

Download www.cse.unsw.edu.au

Abstract—We propose a routing metric for enabling highthroughput reliable multicast in multi-rate wireless mesh networks. This new multicast routing metric, called expected multi...

Xin Zhao, Jun Guo, Chun Tung Chou, Archan Misra, S...

claim paper

Read More »

click to vote

ILP
2007
Springer

283views Automated Reasoning» more ILP 2007»

Building Relational World Models for Reinforcement Learning

14 years 1 months ago

Download ftp.cs.wisc.edu

Abstract. Many reinforcement learning domains are highly relational. While traditional temporal-difference methods can be applied to these domains, they are limited in their capaci...

Trevor Walker, Lisa Torrey, Jude W. Shavlik, Richa...

claim paper

Read More »

click to vote

ICRA
2010
IEEE

163views Robotics» more ICRA 2010»

Exploiting domain knowledge in planning for uncertain robot systems modeled as POMDPs

13 years 6 months ago

Download robotics.ai.uiuc.edu

Abstract— We propose a planning algorithm that allows usersupplied domain knowledge to be exploited in the synthesis of information feedback policies for systems modeled as parti...

Salvatore Candido, James C. Davidson, Seth Hutchin...

claim paper

Read More »

click to vote

CDC
2008
IEEE

118views Control Systems» more CDC 2008»

A density projection approach to dimension reduction for continuous-state POMDPs

14 years 2 months ago

Download netfiles.uiuc.edu

Abstract— Research on numerical solution methods for partially observable Markov decision processes (POMDPs) has primarily focused on discrete-state models, and these algorithms ...

Enlu Zhou, Michael C. Fu, Steven I. Marcus

claim paper

Read More »

click to vote

SARA
2005
Springer

102views Artificial Intelligence» more SARA 2005»

Feature-Discovering Approximate Value Iteration Methods

14 years 1 months ago

Download cobweb.ecn.purdue.edu

Sets of features in Markov decision processes can play a critical role ximately representing value and in abstracting the state space. Selection of features is crucial to the succe...

Jia-Hong Wu, Robert Givan

claim paper

Read More »

« Prev « First page 13 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers