Search Sciweavers | Sciweavers

779 search results - page 56 / 156

» Reinforcement Using Supervised Learning for Policy Generaliz...

121

Voted

AAAI
1994

185views Intelligent Agents» more AAAI 1994»

Learning to Coordinate without Sharing Information

15 years 3 months ago

Download www.agent.ai

Researchers in the eld of Distributed Arti cial Intelligence (DAI) have been developing e cient mechanisms to coordinate the activities of multiple autonomous agents. The need for...

Sandip Sen, Mahendra Sekaran, John Hale

claim paper

Read More »

135

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 3 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

179

Voted

AGENTS
2001
Springer

247views Security Privacy» more AGENTS 2001»

Hierarchical multi-agent reinforcement learning

15 years 6 months ago

Download www-anw.cs.umass.edu

In this paper, we investigate the use of hierarchical reinforcement learning (HRL) to speed up the acquisition of cooperative multi-agent tasks. We introduce a hierarchical multi-a...

Rajbala Makar, Sridhar Mahadevan, Mohammad Ghavamz...

claim paper

Read More »

130

click to vote

MICAI
2010
Springer

214views Artificial Intelligence» more MICAI 2010»

Supervised Machine Learning for Predicting the Meaning of Verb-Noun Combinations in Spanish

15 years 24 days ago

Download www.gelbukh.com

The meaning of such verb-noun combinations as take care, undertake work, pay attention can be generalized as DO what is designated by the noun. Likewise, the meaning of make a deci...

Olga Kolesnikova, Alexander F. Gelbukh

claim paper

Read More »

138

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 3 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

« Prev « First page 56 / 156 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers