Search Sciweavers | Sciweavers

36 search results - page 3 / 8

» Posterior Weighted Reinforcement Learning with State Uncerta...

click to vote

AIPS
2006

141views Artificial Intelligence» more AIPS 2006»

Combining Stochastic Task Models with Reinforcement Learning for Dynamic Scheduling

13 years 9 months ago

Download www.aaai.org

We view dynamic scheduling as a sequential decision problem. Firstly, we introduce a generalized planning operator, the stochastic task model (STM), which predicts the effects of ...

Malcolm J. A. Strens

claim paper

Read More »

click to vote

NIPS
1998

137views Information Technology» more NIPS 1998»

Risk Sensitive Reinforcement Learning

13 years 9 months ago

Download www.cs.cmu.edu

In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...

Ralph Neuneier, Oliver Mihatsch

claim paper

Read More »

click to vote

ICRA
2009
IEEE

132views Robotics» more ICRA 2009»

Smoothed Sarsa: Reinforcement learning for robot delivery tasks

14 years 2 months ago

Download alumni.media.mit.edu

— Our goal in this work is to make high level decisions for mobile robots. In particular, given a queue of prioritized object delivery tasks, we wish to ﬁnd a sequence of actio...

Deepak Ramachandran, Rakesh Gupta

claim paper

Read More »

click to vote

DIS
2004
Springer

105views Theoretical Computer Science» more DIS 2004»

Maximum a Posteriori Tree Augmented Naive Bayes Classifiers

13 years 11 months ago

Download www.iiia.csic.es

Bayesian classifiers such as Naive Bayes or Tree Augmented Naive Bayes (TAN) have shown excellent performance given their simplicity and heavy underlying independence assumptions....

Jesús Cerquides, Ramon López de M&aa...

claim paper

Read More »

click to vote

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

13 years 9 months ago

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

« Prev « First page 3 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers