Search Sciweavers | Sciweavers

169 search results - page 28 / 34

» Planning with Continuous Actions in Partially Observable Env...

152

Voted

ICML
2010
IEEE

188views Machine Learning» more ICML 2010»

Constructing States for Reinforcement Learning

15 years 2 months ago

Download www.icml2010.org

POMDPs are the models of choice for reinforcement learning (RL) tasks where the environment cannot be observed directly. In many applications we need to learn the POMDP structure ...

M. M. Hassan Mahmud

claim paper

Read More »

118

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

15 years 5 months ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

149

click to vote

JAIR
2002

120views more JAIR 2002»

Learning Geometrically-Constrained Hidden Markov Models for Robot Navigation: Bridging the Topological-Geometrical Gap

15 years 4 months ago

Download www.jair.org

Hidden Markov models hmms and partially observable Markov decision processes pomdps provide useful tools for modeling dynamical systems. They are particularly useful for represent...

Hagit Shatkay, Leslie Pack Kaelbling

claim paper

Read More »

144

Voted

DATE
2008
IEEE

199views Hardware» more DATE 2008»

Safe Automatic Flight Back and Landing of Aircraft Flight Reconfiguration Function (FRF)

15 years 11 months ago

Download www.date-conference.com

SOFIA (Safe Automatic Flight Back and Landing of Aircraft) project is a response to the challenge of developing concepts and techniques enabling the safe and automatic return to g...

Juan Alberto Herreria Garcia

claim paper

Read More »

160

Voted

AMAI
1999
Springer

138views Artificial Intelligence» more AMAI 1999»

From Logic Programming Towards Multi-Agent Systems

15 years 4 months ago

Download www.doc.ic.ac.uk

In this paper we present an extension of logic programming (LP) that is suitable not only for the "rational" component of a single agent but also for the "reactive&...

Robert A. Kowalski, Fariba Sadri

claim paper

Read More »

« Prev « First page 28 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers