Sciweavers

779 search results - page 89 / 156
» Reinforcement Using Supervised Learning for Policy Generaliz...
Sort
View
ACMIDC
2010
13 years 12 months ago
A collaborative approach to the design and evaluation of an interactive learning tool for children with special educational need
We have developed an educational software tool (Aprendiendo) to reinforce the learning process of children with special educational needs. This tool makes use of a variety of inte...
Beatriz López-Mencía, David Dí...
ICML
2009
IEEE
14 years 11 months ago
Near-Bayesian exploration in polynomial time
We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...
J. Zico Kolter, Andrew Y. Ng
AAAI
2010
13 years 11 months ago
Towards Multiagent Meta-level Control
Embedded systems consisting of collaborating agents capable of interacting with their environment are becoming ubiquitous. It is crucial for these systems to be able to adapt to t...
Shanjun Cheng, Anita Raja, Victor R. Lesser
COLT
2006
Springer
14 years 1 months ago
Ranking with a P-Norm Push
We are interested in supervised ranking with the following twist: our goal is to design algorithms that perform especially well near the top of the ranked list, and are only requir...
Cynthia Rudin
IROS
2006
IEEE
126views Robotics» more  IROS 2006»
14 years 4 months ago
A System for Robotic Heart Surgery that Learns to Tie Knots Using Recurrent Neural Networks
Abstract— Tying suture knots is a time-consuming task performed frequently during Minimally Invasive Surgery (MIS). Automating this task could greatly reduce total surgery time f...
Hermann Georg Mayer, Faustino J. Gomez, Daan Wiers...