Search Sciweavers | Sciweavers

87 search results - page 4 / 18

» Hybrid Least-Squares Algorithms for Approximate Policy Evalu...

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

14 years 8 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

click to vote

CVPR
2005
IEEE

230views Computer Vision» more CVPR 2005»

Rank-R Approximation of Tensors: Using Image-as-Matrix Representation

14 years 9 months ago

Download vision.ai.uiuc.edu

We present a novel multilinear algebra based approach for reduced dimensionality representation of image ensembles. We treat an image as a matrix, instead of a vector as in tradit...

Hongcheng Wang, Narendra Ahuja

claim paper

Read More »

click to vote

CSDA
2006

84views more CSDA 2006»

Three-mode partitioning

13 years 7 months ago

Download ppw.kuleuven.be

The three-mode partitioning model is a clustering model for three-way three-mode data sets that implies a simultaneous partitioning of all three modes involved in the data. In the...

Jan Schepers, Iven Van Mechelen, Eva Ceulemans

claim paper

Read More »

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

13 years 9 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

click to vote

ITIIS
2010

138views more ITIIS 2010»

Identification of Fuzzy Inference System Based on Information Granulation

13 years 2 months ago

Download www.itiis.org

In this study, we propose a space search algorithm (SSA) and then introduce a hybrid optimization of fuzzy inference systems based on SSA and information granulation (IG). In comp...

Wei Huang, Lixin Ding, Sung-Kwun Oh, Chang-Won Jeo...

claim paper

Read More »

« Prev « First page 4 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers