active policy iteration

201

NN
2010
Springer

187views Neural Networks» more NN 2010»

Efficient exploration through active learning for value function approximation in reinforcement learning

15 years 1 months ago

Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares ...

Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiya...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers