Search Sciweavers | Sciweavers

188

ICML
1994
IEEE

151views Machine Learning» more ICML 1994»

Learning Without State-Estimation in Partially Observable Markovian Decision Processes

15 years 10 months ago

Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

159

click to vote

COLT
2007
Springer

104views Machine Learning» more COLT 2007»

Observational Learning in Random Networks

16 years 25 days ago

Download www.as.inf.ethz.ch

In the standard model of observational learning, n agents sequentially decide between two alternatives a or b, one of which is objectively superior. Their choice is based on a stoc...

Julian Lorenz, Martin Marciniszyn, Angelika Steger

claim paper

Read More »

150

click to vote

ICCBR
2010
Springer

261views Automated Reasoning» more ICCBR 2010»

Imitating Inscrutable Enemies: Learning from Stochastic Policy Observation, Retrieval and Reuse

15 years 10 months ago

Download www.cse.lehigh.edu

In this paper we study the topic of CBR systems learning from observations in which those observations can be represented as stochastic policies. We describe a general framework wh...

Kellen Gillespie, Justin Karneeb, Stephen Lee-Urba...

claim paper

Read More »

171

click to vote

IDEAL
2000
Springer

105views Intelligent Agents» more IDEAL 2000»

Observational Learning with Modular Networks

15 years 10 months ago

Download dmlab.snu.ac.kr

Observational learning algorithm is an ensemble algorithm where each network is initially trained with a bootstrapped data set and virtual data are generated from the ensemble for ...

Hyunjung Shin, Hyoungjoo Lee, Sungzoon Cho

claim paper

Read More »

174

click to vote

ICML
1999
IEEE

112views Machine Learning» more ICML 1999»

Learning Hierarchical Performance Knowledge by Observation

15 years 11 months ago

Download ai.eecs.umich.edu

Developing automated agents that intelligently perform complex real world tasks is time consuming and expensive. The most expensive part of developing these intelligent task perfo...

Michael van Lent, John E. Laird

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers