Sciweavers

710 search results - page 64 / 142
» Online Learning with Prior Knowledge
Sort
View
145
Voted
IJCAI
2001
15 years 5 months ago
Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning
Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...
Gregory Z. Grudic, Lyle H. Ungar
130
Voted
ICVS
2009
Springer
15 years 10 months ago
Learning Objects and Grasp Affordances through Autonomous Exploration
Abstract. We describe a system for autonomous learning of visual object representations and their grasp affordances on a robot-vision system. It segments objects by grasping and mo...
Dirk Kraft, Renaud Detry, Nicolas Pugeault, Emre B...
ICML
2009
IEEE
16 years 4 months ago
Piecewise-stationary bandit problems with side observations
We consider a sequential decision problem where the rewards are generated by a piecewise-stationary distribution. However, the different reward distributions are unknown and may c...
Jia Yuan Yu, Shie Mannor
116
Voted
MLMI
2007
Springer
15 years 9 months ago
Microphone Array Beamforming Approach to Blind Speech Separation
In this paper, we present a microphone array beamforming approach to blind speech separation. Unlike previous beamforming approaches, our system does not require a-priori knowledge...
Ivan Himawan, Iain McCowan, Mike Lincoln
131
Voted
VISAPP
2007
15 years 4 months ago
Extraction of multi-modal object representations in a robot vision system
We introduce one module in a cognitive system that learns the shape of objects by active exploration. More specifically, we propose a feature tracking scheme that makes use of the...
Nicolas Pugeault, Emre Baseski, Dirk Kraft, Floren...