Search Sciweavers | Sciweavers

There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...

Christos Dimitrakakis

posted by olethros

Read More »

187

click to vote

ATAL
2004
Springer

116views Intelligent Agents» more ATAL 2004»

Time-Extended Policies in Multi-Agent Reinforcement Learning

16 years 8 days ago

Download web.engr.oregonstate.edu

Many algorithms such as Q-learning successfully address reinforcement learning in single-agent multi-time-step problems. In addition there are methods that address reinforcement l...

Kagan Tumer, Adrian K. Agogino

claim paper

Read More »

122

click to vote

NIPS
2000

112views Information Technology» more NIPS 2000»

Balancing Multiple Sources of Reward in Reinforcement Learning

15 years 8 months ago

Download www.cc.gatech.edu

For many problems which would be natural for reinforcement learning, the reward signal is not a single scalar value but has multiple scalar components. Examples of such problems i...

Christian R. Shelton

claim paper

Read More »

187

click to vote

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

15 years 10 months ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

« Prev « First page 22 / 422 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers