Sciweavers

995 search results - page 3 / 199
» Learning Useful Horn Approximations
Sort
View
CDC
2009
IEEE
172views Control Systems» more  CDC 2009»
14 years 3 days ago
Approximate dynamic programming using fluid and diffusion approximations with applications to power management
—TD learning and its refinements are powerful tools for approximating the solution to dynamic programming problems. However, the techniques provide the approximate solution only...
Wei Chen, Dayu Huang, Ankur A. Kulkarni, Jayakrish...
IBPRIA
2007
Springer
13 years 9 months ago
Automatic Learning of Conceptual Knowledge in Image Sequences for Human Behavior Interpretation
This work describes an approach for the interpretation and explanation of human behavior in image sequences, within the context of a Cognitive Vision System. The information source...
Pau Baiget, Carles Fernández Tena, F. Xavie...
ICML
2008
IEEE
14 years 8 months ago
Democratic approximation of lexicographic preference models
Previous algorithms for learning lexicographic preference models (LPMs) produce a "best guess" LPM that is consistent with the observations. Our approach is more democra...
Fusun Yaman, Thomas J. Walsh, Michael L. Littman, ...
ICCS
2007
Springer
14 years 1 months ago
Characterizing Implications of Injective Partial Orders
Abstract. Previous work of the authors has studied a notion of implication between sets of sequences based on the conceptual structure of a Galois lattice, and also a way of repres...
José L. Balcázar, Gemma C. Garriga
NIPS
2001
13 years 8 months ago
Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning
We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...
Gregory Z. Grudic, Lyle H. Ungar