Search Sciweavers | Sciweavers

1512 search results - page 82 / 303

» Qualitative reinforcement learning

150

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

15 years 5 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

133

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

15 years 3 months ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

184

click to vote

CORR
2012
Springer

196views Education» more CORR 2012»

PAC-Bayesian Policy Evaluation for Reinforcement Learning

14 years 1 months ago

Download www.cs.mcgill.ca

Bayesian priors oﬀer a compact yet general means of incorporating domain knowledge into many learning tasks. The correctness of the Bayesian analysis and inference, however, lar...

Mahdi Milani Fard, Joelle Pineau, Csaba Szepesv&aa...

claim paper

Read More »

217

click to vote

AGENTS
2001
Springer

247views Security Privacy» more AGENTS 2001»

Hierarchical multi-agent reinforcement learning

15 years 10 months ago

Download www-anw.cs.umass.edu

In this paper, we investigate the use of hierarchical reinforcement learning (HRL) to speed up the acquisition of cooperative multi-agent tasks. We introduce a hierarchical multi-a...

Rajbala Makar, Sridhar Mahadevan, Mohammad Ghavamz...

claim paper

Read More »

146

click to vote

AGENTS
1999
Springer

105views Security Privacy» more AGENTS 1999»

Team-Partitioned, Opaque-Transition Reinforcement Learning

15 years 10 months ago

Download www.cs.ucf.edu

In this paper, we present a novel multi-agent learning paradigm called team-partitioned, opaque-transition reinforcement learning (TPOT-RL). TPOT-RL introduces the concept of usin...

Peter Stone, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 82 / 303 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers