Search Sciweavers | Sciweavers

473 search results - page 64 / 95

» Optimal policy switching algorithms for reinforcement learni...

157

click to vote

CPAIOR
2010
Springer

141views Operations Research» more CPAIOR 2010»

Strong Combination of Ant Colony Optimization with Constraint Programming Optimization

15 years 10 months ago

Download liris.cnrs.fr

We introduce an approach which combines ACO (Ant Colony Optimization) and IBM ILOG CP Optimizer for solving COPs (Combinatorial Optimization Problems). The problem is modeled using...

Madjid Khichane, Patrick Albert, Christine Solnon

claim paper

Read More »

142

click to vote

AAAI
2008

207views Intelligent Agents» more AAAI 2008»

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

15 years 8 months ago

Download sugiyama-www.cs.titech.ac.jp

Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...

Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...

claim paper

Read More »

172

click to vote

GECCO
2010
Springer

153views Optimization» more GECCO 2010»

Multi-task evolutionary shaping without pre-specified representations

15 years 9 months ago

Download www.science.uva.nl

Shaping functions can be used in multi-task reinforcement learning (RL) to incorporate knowledge from previously experienced tasks to speed up learning on a new task. So far, rese...

Matthijs Snel, Shimon Whiteson

claim paper

Read More »

172

click to vote

ACL
1998

129views Computational Linguistics» more ACL 1998»

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for Email

15 years 7 months ago

Download acl.eldoc.ub.rug.nl

This paper describes a novel method by which a dialogue agent can learn to choose an optimal dialogue strategy. While it is widely agreed that dialogue strategies should be formul...

Marilyn A. Walker, Jeanne Frommer, Shrikanth Naray...

claim paper

Read More »

169

click to vote

IROS
2007
IEEE

168views Robotics» more IROS 2007»

Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression

16 years 2 days ago

Download www.cs.cmu.edu

Abstract— We propose to improve the locomotive performance of humanoid robots by using approximated biped stepping and walking dynamics with reinforcement learning (RL). Although...

Jun Morimoto, Christopher G. Atkeson, Gen Endo, Go...

claim paper

Read More »

« Prev « First page 64 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers