Sciweavers

1235 search results - page 171 / 247
» Reinforcement learning in a nutshell
Sort
View
ICML
2002
IEEE
14 years 11 months ago
Learning from Scarce Experience
Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each ...
Leonid Peshkin, Christian R. Shelton
FBIT
2007
IEEE
14 years 4 months ago
Learning to Drive a Real Car in 20 Minutes
The paper describes our first experiments on Reinforcement Learning to steer a real robot car. The applied method, Neural Fitted Q Iteration (NFQ) is purely data-driven based on ...
Martin Riedmiller, Michael Montemerlo, Hendrik Dah...
ROMAN
2007
IEEE
150views Robotics» more  ROMAN 2007»
14 years 4 months ago
Asymmetric Interpretations of Positive and Negative Human Feedback for a Social Learning Agent
— The ability for people to interact with robots and teach them new skills will be crucial to the successful application of robots in everyday human environments. In order to des...
Andrea Lockerd Thomaz, Cynthia Breazeal
IROS
2006
IEEE
107views Robotics» more  IROS 2006»
14 years 4 months ago
Heterogeneous and Hierarchical Cooperative Learning via Combining Decision Trees
Abstract— Decision trees, being human readable and hierarchically structured, provide a suitable mean to derive state-space abstraction and simplify the inclusion of the availabl...
Masoud Asadpour, Majid Nili Ahmadabadi, Roland Sie...
AIPS
2008
14 years 14 days ago
Learning Heuristic Functions through Approximate Linear Programming
Planning problems are often formulated as heuristic search. The choice of the heuristic function plays a significant role in the performance of planning systems, but a good heuris...
Marek Petrik, Shlomo Zilberstein