Search Sciweavers | Sciweavers

1233 search results - page 59 / 247

» Reinforcement learning

172

click to vote

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Batch reinforcement learning in a complex domain

16 years 4 days ago

Download userweb.cs.utexas.edu

Temporal diﬀerence reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent’s experience based on sequential actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

188

click to vote

ICML
2007
IEEE

200views Machine Learning» more ICML 2007»

Multi-task reinforcement learning: a hierarchical Bayesian approach

16 years 6 months ago

Download www.machinelearning.org

We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...

Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...

claim paper

Read More »

166

click to vote

AUSAI
2008
Springer

105views Artificial Intelligence» more AUSAI 2008»

Partial Order Hierarchical Reinforcement Learning

15 years 8 months ago

Download www.cse.unsw.edu.au

In this paper the notion of a partial-order plan is extended to task-hierarchies. We introduce the concept of a partial-order taskhierarchy that decomposes a problem using multi-ta...

Bernhard Hengst

claim paper

Read More »

150

click to vote

NECO
2010

103views more NECO 2010»

Posterior Weighted Reinforcement Learning with State Uncertainty

15 years 4 months ago

Download www.maths.bris.ac.uk

Reinforcement learning models generally assume that a stimulus is presented that allows a learner to unambiguously identify the state of nature, and the reward received is drawn f...

Tobias Larsen, David S. Leslie, Edmund J. Collins,...

claim paper

Read More »

174

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 7 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

« Prev « First page 59 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers