Search Sciweavers | Sciweavers

1234 search results - page 158 / 247

» Multi-criteria Reinforcement Learning

153

click to vote

JIRS
2000

144views more JIRS 2000»

An Integrated Approach of Learning, Planning, and Execution

15 years 3 months ago

Download laboratorios.fi.uba.ar

Agents (hardware or software) that act autonomously in an environment have to be able to integrate three basic behaviors: planning, execution, and learning. This integration is man...

Ramón García-Martínez, Daniel...

claim paper

Read More »

167

click to vote

ROBOCUP
2004
Springer

114views Robotics» more ROBOCUP 2004»

Modular Learning System and Scheduling for Behavior Acquisition in Multi-agent Environment

15 years 9 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning approaches have been suﬀering from the policy alternation of others in multiagent dynamic environments such as RoboCup competitions since othe...

Yasutake Takahashi, Kazuhiro Edazawa, Minoru Asada

claim paper

Read More »

146

click to vote

IJCAI
2007

275views Artificial Intelligence» more IJCAI 2007»

Effective Control Knowledge Transfer through Learning Skill and Representation Hierarchies

15 years 5 months ago

Download www.ijcai.org

Learning capabilities of computer systems still lag far behind biological systems. One of the reasons can be seen in the inefﬁcient re-use of control knowledge acquired over the...

Mehran Asadi, Manfred Huber

claim paper

Read More »

122

Voted

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

15 years 3 months ago

Download homes.dsi.unimi.it

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

151

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 2 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

« Prev « First page 158 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers