Search Sciweavers | Sciweavers

1799 search results - page 69 / 360

» Filtered Reinforcement Learning

205

click to vote

ATAL
2007
Springer

122views Intelligent Agents» more ATAL 2007»

Reducing the complexity of multiagent reinforcement learning

16 years 1 months ago

Download www.damas.ift.ulaval.ca

It is known that the complexity of the reinforcement learning algorithms, such as Q-learning, may be exponential in the number of environment’s states. It was shown, however, th...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

219

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

238

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

15 years 5 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

167

click to vote

CSE
2009
IEEE

85views Theoretical Computer Science» more CSE 2009»

Reinforcement Learning of Listener Response for Mood Classification of Audio

16 years 1 months ago

Download www.oddible.com

This paper describes a method of applying a reinforcement learning artificial intelligence to categorize audio files by mood based on listener response during a performance. The s...

Jack Stockholm, Philippe Pasquier

claim paper

Read More »

160

click to vote

AI
2006
Springer

103views Artificial Intelligence» more AI 2006»

Trace Equivalence Characterization Through Reinforcement Learning

15 years 11 months ago

Download www2.ift.ulaval.ca

In the context of probabilistic verification, we provide a new notion of trace-equivalence divergence between pairs of Labelled Markov processes. This divergence corresponds to the...

Josee Desharnais, François Laviolette, Kris...

claim paper

Read More »

« Prev « First page 69 / 360 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers