Sciweavers

2487 search results - page 12 / 498
» Automatic Model Selection by Modelling the Distribution of R...
Sort
View
AAAI
2008
13 years 11 months ago
Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation
Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...
Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...