Search Sciweavers | Sciweavers

85 search results - page 14 / 17

» Approximate Policy Iteration with a Policy Language Bias

click to vote

ICML
2010
IEEE

219views Machine Learning» more ICML 2010»

Convergence of Least Squares Temporal Difference Methods Under General Conditions

13 years 8 months ago

Download www.cs.helsinki.fi

We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...

Huizhen Yu

claim paper

Read More »

click to vote

SC
1995
ACM

99views Applied Computing» more SC 1995»

Parallel Matrix-Vector Product Using Approximate Hierarchical Methods

13 years 11 months ago

Download www.chg.ru

Matrix-vector products (mat-vecs) form the core of iterative methods used for solving dense linear systems. Often, these systems arise in the solution of integral equations used i...

Ananth Grama, Vipin Kumar, Ahmed H. Sameh

claim paper

Read More »

click to vote

ICRA
2008
IEEE

167views Robotics» more ICRA 2008»

An approximate algorithm for solving oracular POMDPs

14 years 1 months ago

Download www.cs.cmu.edu

Abstract— We propose a new approximate algorithm, LAJIV (Lookahead J-MDP Information Value), to solve Oracular Partially Observable Markov Decision Problems (OPOMDPs), a special ...

Nicholas Armstrong-Crews, Manuela M. Veloso

claim paper

Read More »

click to vote

ECML
2004
Springer

77views Machine Learning» more ECML 2004»

Filtered Reinforcement Learning

14 years 22 days ago

Download eprints.pascal-network.org

Reinforcement learning (RL) algorithms attempt to assign the credit for rewards to the actions that contributed to the reward. Thus far, credit assignment has been done in one of t...

Douglas Aberdeen

claim paper

Read More »

click to vote

ATAL
2006
Springer

157views Intelligent Agents» more ATAL 2006»

Decentralized planning under uncertainty for teams of communicating agents

13 years 11 months ago

Download www.cs.cmu.edu

Decentralized partially observable Markov decision processes (DEC-POMDPs) form a general framework for planning for groups of cooperating agents that inhabit a stochastic and part...

Matthijs T. J. Spaan, Geoffrey J. Gordon, Nikos A....

claim paper

Read More »

« Prev « First page 14 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers