Search Sciweavers | Sciweavers

1235 search results - page 149 / 247

» Reinforcement learning in a nutshell

171

click to vote

CORR
2012
Springer

216views Education» more CORR 2012»

Fractional Moments on Bandit Problems

13 years 11 months ago

Download www.cse.iitm.ac.in

Reinforcement learning addresses the dilemma between exploration to ﬁnd profitable actions and exploitation to act according to the best observations already made. Bandit proble...

Ananda Narayanan B., Balaraman Ravindran

claim paper

Read More »

162

click to vote

ROBOCUP
2004
Springer

114views Robotics» more ROBOCUP 2004»

Modular Learning System and Scheduling for Behavior Acquisition in Multi-agent Environment

15 years 8 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning approaches have been suﬀering from the policy alternation of others in multiagent dynamic environments such as RoboCup competitions since othe...

Yasutake Takahashi, Kazuhiro Edazawa, Minoru Asada

claim paper

Read More »

135

click to vote

IJCAI
2007

275views Artificial Intelligence» more IJCAI 2007»

Effective Control Knowledge Transfer through Learning Skill and Representation Hierarchies

15 years 4 months ago

Download www.ijcai.org

Learning capabilities of computer systems still lag far behind biological systems. One of the reasons can be seen in the inefﬁcient re-use of control knowledge acquired over the...

Mehran Asadi, Manfred Huber

claim paper

Read More »

126

click to vote

ECML
2006
Springer

141views Machine Learning» more ECML 2006»

Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks

15 years 7 months ago

Download www.montefiore.ulg.ac.be

Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...

Sébastien Jodogne, Cyril Briquet, Justus H....

claim paper

Read More »

119

click to vote

FLAIRS
2000

86views Artificial Intelligence» more FLAIRS 2000»

Resolving Conflicts Among Actions in Concurrent Behaviors

15 years 4 months ago

Download www.aaai.org

A robotic agent must coordinate its coupled concurrent behaviors to produce a coherent response to stimuli. Reinforcement learning has been used extensively in coordinating sensin...

Henry Hexmoor

claim paper

Read More »

« Prev « First page 149 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers