Search Sciweavers | Sciweavers

113 search results - page 17 / 23

» Model Approximation for HEXQ Hierarchical Reinforcement Lear...

182

click to vote

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

16 years 29 days ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

224

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

15 years 2 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

211

click to vote

IROS
2007
IEEE

168views Robotics» more IROS 2007»

Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression

16 years 1 months ago

Download www.cs.cmu.edu

Abstract— We propose to improve the locomotive performance of humanoid robots by using approximated biped stepping and walking dynamics with reinforcement learning (RL). Although...

Jun Morimoto, Christopher G. Atkeson, Gen Endo, Go...

claim paper

Read More »

198

click to vote

SMC
2007
IEEE

118views Control Systems» more SMC 2007»

One-class learning with multi-objective genetic programming

16 years 1 months ago

Download users.cs.dal.ca

One-class classiﬁcation naturally only provides one class of exemplars on which to construct the classiﬁcation model. In this work, multiobjective genetic programming (GP) all...

Robert Curry, Malcolm I. Heywood

claim paper

Read More »

217

click to vote

ATAL
2005
Springer

148views Intelligent Agents» more ATAL 2005»

An integrated framework for adaptive reasoning about conversation patterns

16 years 1 months ago

Download homepages.inf.ed.ac.uk

We present an integrated approach for reasoning about and learning conversation patterns in multiagent communication. The approach is based on the assumption that information abou...

Michael Rovatsos, Felix A. Fischer, Gerhard Wei&sz...

claim paper

Read More »

« Prev « First page 17 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers