Search Sciweavers | Sciweavers

170 search results - page 15 / 34

» Heuristic Selection of Actions in Multiagent Reinforcement L...

142

click to vote

AIIDE
2008

146views Artificial Intelligence» more AIIDE 2008»

Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games

15 years 8 months ago

Download www.aaai.org

We introduce the ALeRT (Action-dependent Learning Rates with Trends) algorithm that makes two modifications to the learning rate and one change to the exploration rate of traditio...

Maria Cutumisu, Duane Szafron, Michael H. Bowling,...

claim paper

Read More »

157

click to vote

ATAL
2004
Springer

97views Intelligent Agents» more ATAL 2004»

Unifying Temporal and Structural Credit Assignment Problems

15 years 11 months ago

Download ti.arc.nasa.gov

Single-agent reinforcement learners in time-extended domains and multi-agent systems share a common dilemma known as the credit assignment problem. Multi-agent systems have the st...

Adrian K. Agogino, Kagan Tumer

claim paper

Read More »

187

click to vote

TFS
2011

239views Education» more TFS 2011»

Systems Control With Generalized Probabilistic Fuzzy-Reinforcement Learning

15 years 28 days ago

Download www.triteq.com

—Reinforcement learning (RL) is a valuable learning method when the systems require a selection of control actions whose consequences emerge over long periods for which input– ...

William M. Hinojosa, Samia Nefti, Uzay Kaymak

claim paper

Read More »

169

click to vote

DAGM
2006
Springer

121views Image Processing» more DAGM 2006»

Handling Camera Movement Constraints in Reinforcement Learning Based Active Object Recognition

15 years 9 months ago

Download www5.informatik.uni-erlangen.de

In real world scenes, objects to be classified are usually not visible from every direction, since they are almost always positioned on some kind of opaque plane. When moving a cam...

Christian Derichs, Heinrich Niemann

claim paper

Read More »

167

click to vote

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 6 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

« Prev « First page 15 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers