Search Sciweavers | Sciweavers

313 search results - page 11 / 63

» Consistent Approximations and Approximate Functions and Grad...

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

13 years 8 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

click to vote

SAC
2009
ACM

146views Applied Computing» more SAC 2009»

A gradient oriented recombination scheme for evolution strategies

14 years 3 months ago

Download www.nec-labs.com

This paper proposes a novel recombination scheme for evolutionary algorithms, which can guide the new population generation towards the maximum increase of the objective function....

Haifeng Chen, Guofei Jiang

claim paper

Read More »

click to vote

AAAI
2008

207views Intelligent Agents» more AAAI 2008»

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

13 years 11 months ago

Download sugiyama-www.cs.titech.ac.jp

Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...

Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...

claim paper

Read More »

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

13 years 10 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

click to vote

COCO
1994
Springer

140views Algorithms» more COCO 1994»

Random Debaters and the Hardness of Approximating Stochastic Functions

14 years 1 months ago

Download www.cs.yale.edu

A probabilistically checkable debate system (PCDS) for a language L consists of a probabilisticpolynomial-time veri er V and a debate between Player 1, who claims that the input x ...

Anne Condon, Joan Feigenbaum, Carsten Lund, Peter ...

claim paper

Read More »

« Prev « First page 11 / 63 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers