Sciweavers

286 search results - page 35 / 58
» Using inaccurate models in reinforcement learning
Sort
View
FLAIRS
2009
13 years 5 months ago
Beating the Defense: Using Plan Recognition to Inform Learning Agents
In this paper, we investigate the hypothesis that plan recognition can significantly improve the performance of a casebased reinforcement learner in an adversarial action selectio...
Matthew Molineaux, David W. Aha, Gita Sukthankar
CEEMAS
2005
Springer
14 years 1 months ago
A Direct Reputation Model for VO Formation
We show that reputation is a basic ingredient in the Virtual Organisation (VO) formation process. Agents can use their experiences gained in direct past interactions to model other...
Arturo Avila-Rosas, Michael Luck
ACL
2010
13 years 5 months ago
Reading between the Lines: Learning to Map High-Level Instructions to Commands
In this paper, we address the task of mapping high-level instructions to sequences of commands in an external environment. Processing these instructions is challenging--they posit...
S. R. K. Branavan, Luke S. Zettlemoyer, Regina Bar...
ML
2002
ACM
100views Machine Learning» more  ML 2002»
13 years 7 months ago
Structure in the Space of Value Functions
Solving in an efficient manner many different optimal control tasks within the same underlying environment requires decomposing the environment into its computationally elemental ...
David J. Foster, Peter Dayan
CVPR
2008
IEEE
14 years 9 months ago
Kernel-based learning of cast shadows from a physical model of light sources and surfaces for low-level segmentation
In background subtraction, cast shadows induce silhouette distortions and object fusions hindering performance of high level algorithms in scene monitoring. We introduce a nonpara...
André Zaccarin, Nicolas Martel-Brisson