Search Sciweavers | Sciweavers

286 search results - page 51 / 58

» Using inaccurate models in reinforcement learning

190

click to vote

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

16 years 8 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

202

click to vote

AAAI
2010

154views Intelligent Agents» more AAAI 2010»

Towards Multiagent Meta-level Control

15 years 9 months ago

Download coitweb.uncc.edu

Embedded systems consisting of collaborating agents capable of interacting with their environment are becoming ubiquitous. It is crucial for these systems to be able to adapt to t...

Shanjun Cheng, Anita Raja, Victor R. Lesser

claim paper

Read More »

204

click to vote

NIPS
2001

106views Information Technology» more NIPS 2001»

Improvisation and Learning

15 years 8 months ago

Download books.nips.cc

This article presents a 2-phase computational learning model and application. As a demonstration, a system has been built, called CHIME for Computer Human Interacting Musical Enti...

Judy A. Franklin

claim paper

Read More »

227

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 8 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

201

click to vote

ATAL
2006
Springer

133views Intelligent Agents» more ATAL 2006»

Scalable and reliable data delivery in mobile ad hoc sensor networks

15 years 11 months ago

Download www.cs.cmu.edu

This paper studies scalable data delivery algorithms in mobile ad hoc sensor networks with node and link failures. Many algorithms have been developed for data delivery and fusion...

Bin Yu, Paul Scerri, Katia P. Sycara, Yang Xu, Mic...

claim paper

Read More »

« Prev « First page 51 / 58 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers