Search Sciweavers | Sciweavers

1235 search results - page 171 / 247

» Reinforcement learning in a nutshell

click to vote

ICML
2002
IEEE

113views Machine Learning» more ICML 2002»

Learning from Scarce Experience

14 years 11 months ago

Download www.cs.ucr.edu

Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each ...

Leonid Peshkin, Christian R. Shelton

claim paper

Read More »

click to vote

FBIT
2007
IEEE

142views Information Technology» more FBIT 2007»

Learning to Drive a Real Car in 20 Minutes

14 years 4 months ago

Download www.ni.uos.de

The paper describes our ﬁrst experiments on Reinforcement Learning to steer a real robot car. The applied method, Neural Fitted Q Iteration (NFQ) is purely data-driven based on ...

Martin Riedmiller, Michael Montemerlo, Hendrik Dah...

claim paper

Read More »

click to vote

ROMAN
2007
IEEE

150views Robotics» more ROMAN 2007»

Asymmetric Interpretations of Positive and Negative Human Feedback for a Social Learning Agent

14 years 4 months ago

Download robotic.media.mit.edu

— The ability for people to interact with robots and teach them new skills will be crucial to the successful application of robots in everyday human environments. In order to des...

Andrea Lockerd Thomaz, Cynthia Breazeal

claim paper

Read More »

click to vote

IROS
2006
IEEE

107views Robotics» more IROS 2006»

Heterogeneous and Hierarchical Cooperative Learning via Combining Decision Trees

14 years 4 months ago

Download birg2.epfl.ch

Abstract— Decision trees, being human readable and hierarchically structured, provide a suitable mean to derive state-space abstraction and simplify the inclusion of the availabl...

Masoud Asadpour, Majid Nili Ahmadabadi, Roland Sie...

claim paper

Read More »

click to vote

AIPS
2008

95views Artificial Intelligence» more AIPS 2008»

Learning Heuristic Functions through Approximate Linear Programming

14 years 14 days ago

Download anytime.cs.umass.edu

Planning problems are often formulated as heuristic search. The choice of the heuristic function plays a significant role in the performance of planning systems, but a good heuris...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

« Prev « First page 171 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers