Search Sciweavers | Sciweavers

4544 search results - page 84 / 909

» Reinforcement Learning with Time

163

click to vote

SIGCSE
1998
ACM

131views Education» more SIGCSE 1998»

Animation, visualization, and interaction in CS 1 assignments

15 years 10 months ago

Download www.cs.duke.edu

Programs that use animations or visualizations attract student interest and offer feedback that can enhance different learning styles as students work to master programming and pr...

Owen L. Astrachan, Susan H. Rodger

claim paper

Read More »

136

click to vote

CACM
2010

105views more CACM 2010»

Censored exploration and the dark pool problem

15 years 6 months ago

Download www.cis.upenn.edu

We introduce and analyze a natural algorithm for multi-venue exploration from censored data, which is motivated by the Dark Pool Problem of modern quantitative finance. We prove t...

Kuzman Ganchev, Yuriy Nevmyvaka, Michael Kearns, J...

claim paper

Read More »

173

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

15 years 1 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

165

click to vote

ICML
2005
IEEE

99views Machine Learning» more ICML 2005»

Identifying useful subgoals in reinforcement learning by local graph partitioning

16 years 7 months ago

Download www-anw.cs.umass.edu

We present a new subgoal-based method for automatically creating useful skills in reinforcement learning. Our method identifies subgoals by partitioning local state transition gra...

Özgür Simsek, Alicia P. Wolfe, Andrew G....

claim paper

Read More »

177

click to vote

IAT
2008
IEEE

161views Intelligent Agents» more IAT 2008»

Scaling Up Multi-agent Reinforcement Learning in Complex Domains

15 years 6 months ago

Download www3.ntu.edu.sg

TD-FALCON (Temporal Difference - Fusion Architecture for Learning, COgnition, and Navigation) is a class of self-organizing neural networks that incorporates Temporal Difference (...

Dan Xiao, Ah-Hwee Tan

claim paper

Read More »

« Prev « First page 84 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers