Search Sciweavers | Sciweavers

473 search results - page 54 / 95

» Optimal policy switching algorithms for reinforcement learni...

172

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 7 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

164

click to vote

NIPS
2003

148views Information Technology» more NIPS 2003»

Approximate Planning in POMDPs with Macro-Actions

15 years 7 months ago

Download books.nips.cc

Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...

Georgios Theocharous, Leslie Pack Kaelbling

claim paper

Read More »

165

click to vote

IJCAI
2007

275views Artificial Intelligence» more IJCAI 2007»

Effective Control Knowledge Transfer through Learning Skill and Representation Hierarchies

15 years 7 months ago

Download www.ijcai.org

Learning capabilities of computer systems still lag far behind biological systems. One of the reasons can be seen in the inefﬁcient re-use of control knowledge acquired over the...

Mehran Asadi, Manfred Huber

claim paper

Read More »

141

click to vote

IJCAI
2003

118views Artificial Intelligence» more IJCAI 2003»

Simultaneous Adversarial Multi-Robot Learning

15 years 7 months ago

Download www.cs.cmu.edu

Multi-robot learning faces all of the challenges of robot learning with all of the challenges of multiagent learning. There has been a great deal of recent research on multiagent ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

138

click to vote

NIPS
2003

119views Information Technology» more NIPS 2003»

Online Learning of Non-stationary Sequences

15 years 7 months ago

Download books.nips.cc

We consider an online learning scenario in which the learner can make predictions on the basis of a ﬁxed set of experts. We derive upper and lower relative loss bounds for a cla...

Claire Monteleoni, Tommi Jaakkola

claim paper

Read More »

« Prev « First page 54 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers