Search Sciweavers | Sciweavers

494 search results - page 5 / 99

» Evaluating a Reinforcement Learning Algorithm with a General...

126

Voted

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

15 years 4 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

129

Voted

AGI
2011

286views Artificial Intelligence» more AGI 2011»

Comparing Humans and AI Agents

14 years 6 months ago

Download users.dsic.upv.es

Comparing humans and machines is one important source of information about both machine and human strengths and limitations. Most of these comparisons and competitions are performe...

Javier Insa-Cabrera, David L. Dowe, Sergio Espa&nt...

claim paper

Read More »

114

Voted

ATAL
2004
Springer

105views Intelligent Agents» more ATAL 2004»

Best-Response Multiagent Learning in Non-Stationary Environments

15 years 8 months ago

Download www.odu.edu

This paper investigates a relatively new direction in Multiagent Reinforcement Learning. Most multiagent learning techniques focus on Nash equilibria as elements of both the learn...

Michael Weinberg, Jeffrey S. Rosenschein

claim paper

Read More »

126

Voted

ICCS
1993
Springer

99views Applied Computing» more ICCS 1993»

Towards Domain-Independent Machine Intelligence

15 years 6 months ago

Download www.soe.ucsc.edu

Adaptive predictive search (APS), is a learning system framework, which given little initial domain knowledge, increases its decision-making abilities in complex problems domains....

Robert Levinson

claim paper

Read More »

177

Voted

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

14 years 9 months ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

« Prev « First page 5 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers