Search Sciweavers | Sciweavers

388 search results - page 60 / 78

» Learning to Optimize Plan Execution in Information Agents

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

13 years 5 months ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

click to vote

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

14 years 8 months ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

click to vote

WSC
2000

142views Modeling And Simulation» more WSC 2000»

Interactive Web-based animations for teaching and learning

13 years 8 months ago

Download www.informs-sim.org

Web-based study resources can be viewed as a basic requirement in order to remain a competitive player on a more and more globalised educational market. For that reason it is gett...

Michael Syrjakow, Jörg Berdux, Helena Szczerb...

claim paper

Read More »

click to vote

JMLR
2010

149views more JMLR 2010»

Coherent Inference on Optimal Play in Game Trees

13 years 2 months ago

Download jmlr.csail.mit.edu

Round-based games are an instance of discrete planning problems. Some of the best contemporary game tree search algorithms use random roll-outs as data. Relying on a good policy, ...

Philipp Hennig, David H. Stern, Thore Graepel

claim paper

Read More »

click to vote

IUI
2010
ACM

224views Software Engineering» more IUI 2010»

Agent-assisted task management that reduces email overload

14 years 4 months ago

Download www.cs.cmu.edu

RADAR is a multiagent system with a mixed-initiative user interface designed to help office workers cope with email overload. RADAR agents observe experts to learn models of their...

Aaron Steinfeld, Andrew Faulring, Asim Smailagic, ...

claim paper

Read More »

« Prev « First page 60 / 78 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers