Search Sciweavers | Sciweavers

495 search results - page 20 / 99

» Constructing States for Reinforcement Learning

131

click to vote

NIPS
2000

150views Information Technology» more NIPS 2000»

Programmable Reinforcement Learning Agents

15 years 5 months ago

Download reference.kfupm.edu.sa

We present an expressive agent design language for reinforcement learning that allows the user to constrain the policies considered by the learning process.The language includes s...

David Andre, Stuart J. Russell

claim paper

Read More »

151

click to vote

ROBOCUP
2005
Springer

134views Robotics» more ROBOCUP 2005»

Simultaneous Learning to Acquire Competitive Behaviors in Multi-agent System Based on Modular Learning System

15 years 9 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning approaches have been suﬀering from the policy alternation of others in multiagent dynamic environments. A typical example is a case of RoboCup...

Yasutake Takahashi, Kazuhiro Edazawa, Kentarou Nom...

claim paper

Read More »

143

click to vote

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

15 years 8 months ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

119

click to vote

PRICAI
2000
Springer

127views Artificial Intelligence» more PRICAI 2000»

Constructing an Autonomous Agent with an Interdependent Heuristics

15 years 8 months ago

Download www.ai.sanken.osaka-u.ac.jp

When we construct an agent by integrating modules, there appear troubles concerning the autonomy of the agent if we introduce a heuristics that dominates the whole agent. Thus, we ...

Koichi Moriyama, Masayuki Numao

claim paper

Read More »

170

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

15 years 2 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

« Prev « First page 20 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers