Search Sciweavers | Sciweavers

4544 search results - page 224 / 909

» Reinforcement Learning with Time

142

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 4 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

131

click to vote

ML
2000
ACM

150views Machine Learning» more ML 2000»

Adaptive Retrieval Agents: Internalizing Local Context and Scaling up to the Web

15 years 2 months ago

Download informatics.indiana.edu

This paper discusses a novel distributed adaptive algorithm and representation used to construct populations of adaptive Web agents. These InfoSpiders browse networked information ...

Filippo Menczer, Richard K. Belew

claim paper

Read More »

114

click to vote

ICALT
2006
IEEE

119views Machine Learning» more ICALT 2006»

Learner as a Designer of Digital Learning Tools

15 years 9 months ago

Download csdl2.computer.org

This paper will concentrate on how teacher can support learners in creating their own learning tools conducive to learning. Authors will discuss how the journey in creating differ...

Yasmin Bhattacharya, Madhumita Bhattacharya

claim paper

Read More »

130

Voted

JMLR
2006

118views more JMLR 2006»

Learning Factor Graphs in Polynomial Time and Sample Complexity

15 years 2 months ago

Download jmlr.csail.mit.edu

We study the computational and sample complexity of parameter and structure learning in graphical models. Our main result shows that the class of factor graphs with bounded degree...

Pieter Abbeel, Daphne Koller, Andrew Y. Ng

claim paper

Read More »

133

click to vote

AAAI
2012

272views Intelligent Agents» more AAAI 2012»

Model Learning and Real-Time Tracking Using Multi-Resolution Surfel Maps

13 years 5 months ago

Download www.ais.uni-bonn.de

For interaction with its environment, a robot is required to learn models of objects and to perceive these models in the livestreams from its sensors. In this paper, we propose a ...

Jörg Stückler, Sven Behnke

claim paper

Read More »

« Prev « First page 224 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers