Search Sciweavers | Sciweavers

513 search results - page 33 / 103

» Metric learning for reinforcement learning agents

173

click to vote

AAAI
2007

104views Intelligent Agents» more AAAI 2007»

Active Imitation Learning

15 years 8 months ago

Download www.cs.washington.edu

Imitation learning, also called learning by watching or programming by demonstration, has emerged as a means of accelerating many reinforcement learning tasks. Previous work has s...

Aaron P. Shon, Deepak Verma, Rajesh P. N. Rao

claim paper

Read More »

124

click to vote

AAAI
2007

68views Intelligent Agents» more AAAI 2007»

A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

15 years 8 months ago

Download www.aaai.org

An Unobservable MDP (UMDP) is a POMDP in which there are no observations. An Only-Costly-Observable MDP (OCOMDP) is a POMDP which extends an UMDP by allowing a particular costly a...

Roy Fox, Moshe Tennenholtz

claim paper

Read More »

188

click to vote

ICCBR
2009
Springer

134views Automated Reasoning» more ICCBR 2009»

Improving Reinforcement Learning by Using Case Based Heuristics

16 years 18 days ago

Download www.iiia.csic.es

This work presents a new approach that allows the use of cases in a case base as heuristics to speed up Reinforcement Learning algorithms, combining Case Based Reasoning (CBR) and ...

Reinaldo A. C. Bianchi, Raquel Ros, Ramon Ló...

claim paper

Read More »

143

click to vote

PRICAI
2000
Springer

127views Artificial Intelligence» more PRICAI 2000»

Constructing an Autonomous Agent with an Interdependent Heuristics

15 years 9 months ago

Download www.ai.sanken.osaka-u.ac.jp

When we construct an agent by integrating modules, there appear troubles concerning the autonomy of the agent if we introduce a heuristics that dominates the whole agent. Thus, we ...

Koichi Moriyama, Masayuki Numao

claim paper

Read More »

159

click to vote

AAAI
1994

185views Intelligent Agents» more AAAI 1994»

Learning to Coordinate without Sharing Information

15 years 7 months ago

Download www.agent.ai

Researchers in the eld of Distributed Arti cial Intelligence (DAI) have been developing e cient mechanisms to coordinate the activities of multiple autonomous agents. The need for...

Sandip Sen, Mahendra Sekaran, John Hale

claim paper

Read More »

« Prev « First page 33 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers