Search Sciweavers | Sciweavers

651 search results - page 89 / 131

» Algorithms for Inverse Reinforcement Learning

169

click to vote

FLAIRS
2008

132views Artificial Intelligence» more FLAIRS 2008»

Learning Continuous Action Models in a Real-Time Strategy Environment

15 years 8 months ago

Download www.knexusresearch.com

Although several researchers have integrated methods for reinforcement learning (RL) with case-based reasoning (CBR) to model continuous action spaces, existing integrations typic...

Matthew Molineaux, David W. Aha, Philip Moore

claim paper

Read More »

169

click to vote

ICML
2009
IEEE

186views Machine Learning» more ICML 2009»

Regularization and feature selection in least-squares temporal difference learning

16 years 7 months ago

Download ai.stanford.edu

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

139

click to vote

ICML
2001
IEEE

132views Machine Learning» more ICML 2001»

Expectation Maximization for Weakly Labeled Data

16 years 7 months ago

Download characters.media.mit.edu

We call data weakly labeled if it has no exact label but rather a numerical indication of correctness of the label "guessed" by the learning algorithm - a situation comm...

Yuri A. Ivanov, Bruce Blumberg, Alex Pentland

claim paper

Read More »

159

click to vote

IDEAL
2000
Springer

105views Intelligent Agents» more IDEAL 2000»

Observational Learning with Modular Networks

15 years 9 months ago

Download dmlab.snu.ac.kr

Observational learning algorithm is an ensemble algorithm where each network is initially trained with a bootstrapped data set and virtual data are generated from the ensemble for ...

Hyunjung Shin, Hyoungjoo Lee, Sungzoon Cho

claim paper

Read More »

147

click to vote

NIPS
2003

108views Information Technology» more NIPS 2003»

Policy Search by Dynamic Programming

15 years 7 months ago

Download books.nips.cc

We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...

J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...

claim paper

Read More »

« Prev « First page 89 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers