Search Sciweavers | Sciweavers

7015 search results - page 1093 / 1403

» Approximation algorithms for co-clustering

147

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

15 years 11 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

142

Voted

ATAL
2009
Springer

155views Intelligent Agents» more ATAL 2009»

Achieving goals in decentralized POMDPs

15 years 11 months ago

Download anytime.cs.umass.edu

Coordination of multiple agents under uncertainty in the decentralized POMDP model is known to be NEXP-complete, even when the agents have a joint set of goals. Nevertheless, we s...

Christopher Amato, Shlomo Zilberstein

claim paper

Read More »

153

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Transfer via soft homomorphisms

15 years 11 months ago

Download www.eecs.umich.edu

The ﬁeld of transfer learning aims to speed up learning across multiple related tasks by transferring knowledge between source and target tasks. Past work has shown that when th...

Jonathan Sorg, Satinder Singh

claim paper

Read More »

126

Voted

ESA
2009
Springer

92views Algorithms» more ESA 2009»

Minimizing Movement: Fixed-Parameter Tractability

15 years 11 months ago

Download www-math.mit.edu

Abstract. We study an extensive class of movement minimization problems which arise from many practical scenarios but so far have little theoretical study. In general, these proble...

Erik D. Demaine, MohammadTaghi Hajiaghayi, D&aacut...

claim paper

Read More »

163

click to vote

SAGT
2009
Springer

155views Game Theory» more SAGT 2009»

Anarchy, Stability, and Utopia: Creating Better Matchings

15 years 11 months ago

Download www.cs.rpi.edu

We consider the loss in social welfare caused by individual rationality in matching scenarios. We give both theoretical and experimental results comparing stable matchings with soc...

Elliot Anshelevich, Sanmay Das, Yonatan Naamad

claim paper

Read More »

« Prev « First page 1093 / 1403 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers