Search Sciweavers | Sciweavers

192 search results - page 13 / 39

» Multi-agent Relational Reinforcement Learning

159

Voted

KES
1998
Springer

229views Information Technology» more KES 1998»

An acquisition of the relation between vision and action using self-organizing map and reinforcement learning

15 years 6 months ago

Download www-kasm.nii.ac.jp

An agent must acquire internal representation appropriate for its task, environment, sensors. As a learning algorithm, reinforcement learning is often utilized to acquire the rela...

Kazunori Terada, Hideaki Takeda, Toyoaki Nishida

claim paper

Read More »

119

Voted

NN
2006
Springer

127views Neural Networks» more NN 2006»

The asymptotic equipartition property in reinforcement learning and its relation to return maximization

15 years 2 months ago

Download www.ece.uvic.ca

We discuss an important property called the asymptotic equipartition property on empirical sequences in reinforcement learning. This states that the typical set of empirical seque...

Kazunori Iwata, Kazushi Ikeda, Hideaki Sakai

claim paper

Read More »

134

click to vote

ICML
2005
IEEE

93views Machine Learning» more ICML 2005»

Relating reinforcement learning performance to classification performance

16 years 3 months ago

Download hunch.net

We prove a quantitative connection between the expected sum of rewards of a policy and binary classification performance on created subproblems. This connection holds without any ...

John Langford, Bianca Zadrozny

claim paper

Read More »

102

click to vote

AIPS
2007

81views Artificial Intelligence» more AIPS 2007»

Gradient-Based Relational Reinforcement Learning of Temporally Extended Policies

15 years 5 months ago

Download www.cs.umd.edu

Charles Gretton

claim paper

Read More »

130

Voted

ICMLA
2010

205views Machine Learning» more ICMLA 2010»

Incremental Learning of Relational Action Rules

14 years 12 months ago

Download www-lipn.univ-paris13.fr

Abstract--In the Relational Reinforcement learning framework, we propose an algorithm that learns an action model allowing to predict the resulting state of each action in any give...

Christophe Rodrigues, Pierre Gérard, C&eacu...

claim paper

Read More »

« Prev « First page 13 / 39 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers