Search Sciweavers | Sciweavers

698 search results - page 90 / 140

» A Deterministic Algorithm for Solving Imprecise Decision Pro...

click to vote

GECCO
2006
Springer

208views Optimization» more GECCO 2006»

Comparing evolutionary and temporal difference methods in a reinforcement learning domain

14 years 24 days ago

Download www.cs.bham.ac.uk

Both genetic algorithms (GAs) and temporal difference (TD) methods have proven effective at solving reinforcement learning (RL) problems. However, since few rigorous empirical com...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

click to vote

FLAIRS
1998

132views Artificial Intelligence» more FLAIRS 1998»

Analytical Design of Reinforcement Learning Tasks

13 years 10 months ago

Download www.aaai.org

Reinforcement learning (RL) problems constitute an important class of learning and control problems faced by artificial intelligence systems. In these problems, one is faced with ...

Robert E. Smith

claim paper

Read More »

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

14 years 10 months ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

click to vote

CDC
2008
IEEE

118views Control Systems» more CDC 2008»

A density projection approach to dimension reduction for continuous-state POMDPs

14 years 3 months ago

Download netfiles.uiuc.edu

Abstract— Research on numerical solution methods for partially observable Markov decision processes (POMDPs) has primarily focused on discrete-state models, and these algorithms ...

Enlu Zhou, Michael C. Fu, Steven I. Marcus

claim paper

Read More »

click to vote

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

13 years 10 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

« Prev « First page 90 / 140 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers