Search Sciweavers | Sciweavers

369 search results - page 3 / 74

» Global Optimization for Value Function Approximation

123

click to vote

PKDD
2009
Springer

152views Data Mining» more PKDD 2009»

Feature Selection for Value Function Approximation Using Bayesian Model Selection

15 years 9 months ago

Download userweb.cs.utexas.edu

Abstract. Feature selection in reinforcement learning (RL), i.e. choosing basis functions such that useful approximations of the unkown value function can be obtained, is one of th...

Tobias Jung, Peter Stone

claim paper

Read More »

107

Voted

AAAI
2008

207views Intelligent Agents» more AAAI 2008»

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

15 years 4 months ago

Download sugiyama-www.cs.titech.ac.jp

Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...

Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...

claim paper

Read More »

126

Voted

AAAI
2006

126views Intelligent Agents» more AAAI 2006»

Functional Value Iteration for Decision-Theoretic Planning with General Utility Functions

15 years 3 months ago

Download www.aaai.org

We study how to find plans that maximize the expected total utility for a given MDP, a planning objective that is important for decision making in high-stakes domains. The optimal...

Yaxin Liu, Sven Koenig

claim paper

Read More »

132

click to vote

SIAMCO
2002

121views more SIAMCO 2002»

Consistent Approximations and Approximate Functions and Gradients in Optimal Control

15 years 2 months ago

Download www.ann.jussieu.fr

As shown in [7], optimal control problems with either ODE or PDE dynamics can be solved efficiently using a setting of consistent approximations obtained by numerical discretizati...

Olivier Pironneau, Elijah Polak

claim paper

Read More »

118

click to vote

NIPS
1993

134views Information Technology» more NIPS 1993»

Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming

15 years 3 months ago

Download www.cs.cmu.edu

Dynamic programming provides a methodology to develop planners and controllers for nonlinear systems. However, general dynamic programming is computationally intractable. We have ...

Christopher G. Atkeson

claim paper

Read More »

« Prev « First page 3 / 74 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers