Sciweavers

197

ICML
2007
IEEE

136views Machine Learning» more ICML 2007»

Combining online and offline knowledge in UCT

16 years 7 months ago

The UCT algorithm learns a value function online using sample-based search. The TD() algorithm can learn a value function offline for the on-policy distribution. We consider three...

Sylvain Gelly, David Silver

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers