Sciweavers

275 search results - page 23 / 55
» Learning equivalent action choices from demonstration
Sort
View
GECCO
2009
Springer
14 years 3 months ago
Novelty of behaviour as a basis for the neuro-evolution of operant reward learning
An agent that deviates from a usual or previous course of action can be said to display novel or varying behaviour. Novelty of behaviour can be seen as the result of real or appar...
Andrea Soltoggio, Ben Jones
ICML
2010
IEEE
13 years 9 months ago
Constructing States for Reinforcement Learning
POMDPs are the models of choice for reinforcement learning (RL) tasks where the environment cannot be observed directly. In many applications we need to learn the POMDP structure ...
M. M. Hassan Mahmud
ICRA
2009
IEEE
170views Robotics» more  ICRA 2009»
14 years 5 months ago
Imitation learning with generalized task descriptions
— In this paper, we present an approach that allows a robot to observe, generalize, and reproduce tasks observed from multiple demonstrations. Motion capture data is recorded in ...
Clemens Eppner, Jürgen Sturm, Maren Bennewitz...
ICCV
2005
IEEE
14 years 4 months ago
Priors for People Tracking from Small Training Sets
We advocate the use of Scaled Gaussian Process Latent Variable Models (SGPLVM) to learn prior models of 3D human pose for 3D people tracking. The SGPLVM simultaneously optimizes a...
Raquel Urtasun, David J. Fleet, Aaron Hertzmann, P...
ATAL
2005
Springer
14 years 4 months ago
Theory of moves learners: towards non-myopic equilibria
In contrast to classical game theoretic analysis of simultaneous and sequential play in bimatrix games, Steven Brams has proposed an alternative framework called the Theory of Mov...
Arjita Ghosh, Sandip Sen