Search Sciweavers | Sciweavers

995 search results - page 3 / 199

» Learning Useful Horn Approximations

183

click to vote

CDC
2009
IEEE

172views Control Systems» more CDC 2009»

Approximate dynamic programming using fluid and diffusion approximations with applications to power management

15 years 11 months ago

Download www.cs.caltech.edu

—TD learning and its reﬁnements are powerful tools for approximating the solution to dynamic programming problems. However, the techniques provide the approximate solution only...

Wei Chen, Dayu Huang, Ankur A. Kulkarni, Jayakrish...

claim paper

Read More »

253

click to vote

IBPRIA
2007
Springer

233views Pattern Recognition» more IBPRIA 2007»

Automatic Learning of Conceptual Knowledge in Image Sequences for Human Behavior Interpretation

15 years 8 months ago

Download iselab.cvc.uab.es

This work describes an approach for the interpretation and explanation of human behavior in image sequences, within the context of a Cognitive Vision System. The information source...

Pau Baiget, Carles Fernández Tena, F. Xavie...

claim paper

Read More »

135

click to vote

ICML
2008
IEEE

113views Machine Learning» more ICML 2008»

Democratic approximation of lexicographic preference models

16 years 7 months ago

Download maple.cs.umbc.edu

Previous algorithms for learning lexicographic preference models (LPMs) produce a "best guess" LPM that is consistent with the observations. Our approach is more democra...

Fusun Yaman, Thomas J. Walsh, Michael L. Littman, ...

claim paper

Read More »

212

click to vote

ICCS
2007
Springer

185views Applied Computing» more ICCS 2007»

Characterizing Implications of Injective Partial Orders

16 years 1 months ago

Download www.lsi.upc.es

Abstract. Previous work of the authors has studied a notion of implication between sets of sequences based on the conceptual structure of a Galois lattice, and also a way of repres...

José L. Balcázar, Gemma C. Garriga

claim paper

Read More »

170

click to vote

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

15 years 8 months ago

Download books.nips.cc

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

« Prev « First page 3 / 199 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers