Sciweavers

1674 search results - page 134 / 335
» Learning Actions From the Web
Sort
View
CORR
1998
Springer
164views Education» more  CORR 1998»
15 years 3 months ago
Training Reinforcement Neurocontrollers Using the Polytope Algorithm
A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...
Aristidis Likas, Isaac E. Lagaris
IROS
2009
IEEE
120views Robotics» more  IROS 2009»
15 years 10 months ago
Interactive learning of visually symmetric objects
— This paper describes a robotic system that learns visual models of symmetric objects autonomously. Our robot learns by physically interacting with an object using its end effec...
Wai Ho Li, Lindsay Kleeman
141
Voted
ICML
2010
IEEE
15 years 5 months ago
Music Plus One and Machine Learning
A system for musical accompaniment is presented in which a computer-driven orchestra follows and learns from a soloist in a concerto-like setting. The system is decomposed into th...
Christopher Raphael
ICML
2007
IEEE
16 years 4 months ago
Percentile optimization in uncertain Markov decision processes with application to efficient exploration
Markov decision processes are an effective tool in modeling decision-making in uncertain dynamic environments. Since the parameters of these models are typically estimated from da...
Erick Delage, Shie Mannor
KCAP
2005
ACM
15 years 9 months ago
Collecting paraphrase corpora from volunteer contributors
Extensive and deep paraphrase corpora are important for a variety of natural language processing and user interaction tasks. In this paper, we present an approach which i) collect...
Timothy Chklovski