Search Sciweavers | Sciweavers

52 search results - page 8 / 11

» Approximate Convex Optimization by Online Game Playing

click to vote

AAMAS
2010
Springer

158views Intelligent Agents» more AAMAS 2010»

Coordinated learning in multiagent MDPs with infinite state-space

13 years 7 months ago

Download gaips.inesc-id.pt

Abstract In this paper we address the problem of simultaneous learning and coordination in multiagent Markov decision problems (MMDPs) with infinite state-spaces. We separate this ...

Francisco S. Melo, M. Isabel Ribeiro

claim paper

Read More »

click to vote

ATAL
2007
Springer

112views Intelligent Agents» more ATAL 2007»

A globally optimal algorithm for TTD-MDPs

14 years 1 months ago

Download www.cc.gatech.edu

In this paper, we discuss the use of Targeted Trajectory Distribution Markov Decision Processes (TTD-MDPs)—a variant of MDPs in which the goal is to realize a speciﬁed distrib...

Sooraj Bhat, David L. Roberts, Mark J. Nelson, Cha...

claim paper

Read More »

click to vote

AAIM
2008
Springer

94views Algorithms» more AAIM 2008»

Speed Scaling with a Solar Cell

14 years 1 months ago

Download www.cs.pitt.edu

We consider the setting of a device that obtains it energy from a battery and some regenerative source such as a solar cell. We consider the speed scaling problem of scheduling a c...

Nikhil Bansal, Ho-Leung Chan, Kirk Pruhs

claim paper

Read More »

click to vote

SI3D
2005
ACM

146views Computer Graphics» more SI3D 2005»

User interfaces for interactive control of physics-based 3D characters

14 years 1 months ago

Download www.cs.ubc.ca

We present two user interfaces for the interactive control of dynamically-simulated characters. The ﬁrst interface uses an ‘action palette’ and targets sports prototyping ap...

Peng Zhao, Michiel van de Panne

claim paper

Read More »

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

14 years 1 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

« Prev « First page 8 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers