Search Sciweavers | Sciweavers

4544 search results - page 94 / 909

» Reinforcement Learning with Time

158

Voted

COLT
2004
Springer

99views Machine Learning» more COLT 2004»

Reinforcement Learning for Average Reward Zero-Sum Games

15 years 11 months ago

Download www.ece.mcgill.ca

Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The ﬁrst is based on relative Q-learning and the ...

Shie Mannor

claim paper

Read More »

171

click to vote

IJCAI
2003

188views Artificial Intelligence» more IJCAI 2003»

A Bayesian Approach to Imitation in Reinforcement Learning

15 years 7 months ago

Download ijcai.org

In multiagent environments, forms of social learning such as teaching and imitation have been shown to aid the transfer of knowledge from experts to learners in reinforcement lear...

Bob Price, Craig Boutilier

claim paper

Read More »

184

click to vote

AAAI
1998

150views Intelligent Agents» more AAAI 1998»

Tree Based Discretization for Continuous State Space Reinforcement Learning

15 years 7 months ago

Download www.cs.cmu.edu

Reinforcement learning is an effective technique for learning action policies in discrete stochastic environments, but its efficiency can decay exponentially with the size of the ...

William T. B. Uther, Manuela M. Veloso

claim paper

Read More »

163

click to vote

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

15 years 7 months ago

Download www.cs.cmu.edu

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

176

click to vote

RAS
2000

161views more RAS 2000»

Active object recognition by view integration and reinforcement learning

15 years 6 months ago

Download www.emt.tu-graz.ac.at

A mobile agent with the task to classify its sensor pattern has to cope with ambiguous information. Active recognition of three-dimensional objects involves the observer in a sear...

Lucas Paletta, Axel Pinz

claim paper

Read More »

« Prev « First page 94 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers