Search Sciweavers | Sciweavers

473 search results - page 43 / 95

» Optimal policy switching algorithms for reinforcement learni...

156

click to vote

ECAI
2006
Springer

194views Artificial Intelligence» more ECAI 2006»

Strategic Foresighted Learning in Competitive Multi-Agent Games

15 years 9 months ago

Download homepages.cwi.nl

We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-agent games. We make the observation that in a competitive setting with adaptive...

Pieter Jan't Hoen, Sander M. Bohte, Han La Poutr&e...

claim paper

Read More »

176

click to vote

ICML
2005
IEEE

93views Machine Learning» more ICML 2005»

Relating reinforcement learning performance to classification performance

16 years 6 months ago

Download hunch.net

We prove a quantitative connection between the expected sum of rewards of a policy and binary classification performance on created subproblems. This connection holds without any ...

John Langford, Bianca Zadrozny

claim paper

Read More »

177

click to vote

IJCAI
2001

119views Artificial Intelligence» more IJCAI 2001»

Rational and Convergent Learning in Stochastic Games

15 years 7 months ago

Download reference.kfupm.edu.sa

This paper investigates the problem of policy learning in multiagent environments using the stochastic game framework, which we briefly overview. We introduce two properties as de...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

148

click to vote

AR
2002

157views more AR 2002»

Acquiring state from control dynamics to learn grasping policies for robot hands

15 years 5 months ago

Download www.mit.edu

Abstract--A prominent emerging theory of sensorimotor development in biological systems proposes that control knowledge is encoded in the dynamics of physical interaction with the ...

Roderic A. Grupen, Jefferson A. Coelho Jr.

claim paper

Read More »

155

click to vote

KDD
2010
ACM

282views Data Mining» more KDD 2010»

Optimizing debt collections using constrained reinforcement learning

15 years 9 months ago

Download www.prem-melville.com

In this paper, we propose and develop a novel approach to the problem of optimally managing the tax, and more generally debt, collections processes at ﬁnancial institutions. Our...

Naoki Abe, Prem Melville, Cezar Pendus, Chandan K....

claim paper

Read More »

« Prev « First page 43 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers