Search Sciweavers | Sciweavers

3381 search results - page 342 / 677

» LEO - DB2's LEarning Optimizer

187

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

15 years 8 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

230

click to vote

AAAI
2010

382views Intelligent Agents» more AAAI 2010»

Unsupervised Learning of Event Classes from Video

15 years 7 months ago

Download www.comp.leeds.ac.uk

We present a method for unsupervised learning of event classes from videos in which multiple actions might occur simultaneously. It is assumed that all such activities are produce...

Muralikrishna Sridhar, Anthony G. Cohn, David C. H...

claim paper

Read More »

161

click to vote

IJCAI
2007

186views Artificial Intelligence» more IJCAI 2007»

Online Speed Adaptation Using Supervised Learning for High-Speed, Off-Road Autonomous Driving

15 years 7 months ago

Download hoffmann.stanford.edu

The mobile robotics community has traditionally addressed motion planning and navigation in terms of steering decisions. However, selecting the best speed is also important – be...

David Stavens, Gabriel Hoffmann, Sebastian Thrun

claim paper

Read More »

193

click to vote

NIPS
2007

142views Information Technology» more NIPS 2007»

Multi-Task Learning via Conic Programming

15 years 7 months ago

Download sugiyama-www.cs.titech.ac.jp

When we have several related tasks, solving them simultaneously is shown to be more effective than solving them individually. This approach is called multi-task learning (MTL) and...

Tsuyoshi Kato, Hisashi Kashima, Masashi Sugiyama, ...

claim paper

Read More »

172

click to vote

IJCAI
2001

163views Artificial Intelligence» more IJCAI 2001»

Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

15 years 7 months ago

Download www.cs.colorado.edu

Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

« Prev « First page 342 / 677 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers