Search Sciweavers | Sciweavers

1235 search results - page 38 / 247

» Reinforcement learning in a nutshell

click to vote

CIG
2005
IEEE

120views Applied Computing» more CIG 2005»

Adapting Reinforcement Learning for Computer Games: Using Group Utility Functions

14 years 2 months ago

Download cswww.essex.ac.uk

AbstractGroup utility functions are an extension of the common team utility function for providing multiple agents with a common reinforcement learning signal for learning cooperat...

Jay Bradley, Gillian Hayes

claim paper

Read More »

click to vote

TSMC
2008

76views more TSMC 2008»

Improved Adaptive-Reinforcement Learning Control for Morphing Unmanned Air Vehicles

13 years 8 months ago

Download jungfrau.tamu.edu

This paper presents an improved Adaptive

John Valasek, James Doebbler, Monish D. Tandale, A...

claim paper

Read More »

click to vote

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

14 years 2 months ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

14 years 9 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

click to vote

AGENTS
2001
Springer

201views Security Privacy» more AGENTS 2001»

Using background knowledge to speed reinforcement learning in physical agents

14 years 1 months ago

Download www.isle.org

This paper describes Icarus, an agent architecture that embeds a hierarchical reinforcement learning algorithm within a language for specifying agent behavior. An Icarus program e...

Daniel G. Shapiro, Pat Langley, Ross D. Shachter

claim paper

Read More »

« Prev « First page 38 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers