Search Sciweavers | Sciweavers

350 search results - page 53 / 70

» Approximation Algorithms for Unique Games

163

click to vote

AAMAS
2010
Springer

158views Intelligent Agents» more AAMAS 2010»

Coordinated learning in multiagent MDPs with infinite state-space

15 years 7 months ago

Download gaips.inesc-id.pt

Abstract In this paper we address the problem of simultaneous learning and coordination in multiagent Markov decision problems (MMDPs) with infinite state-spaces. We separate this ...

Francisco S. Melo, M. Isabel Ribeiro

claim paper

Read More »

191

click to vote

FLAIRS
2008

132views Artificial Intelligence» more FLAIRS 2008»

Learning Continuous Action Models in a Real-Time Strategy Environment

15 years 9 months ago

Download www.knexusresearch.com

Although several researchers have integrated methods for reinforcement learning (RL) with case-based reasoning (CBR) to model continuous action spaces, existing integrations typic...

Matthew Molineaux, David W. Aha, Philip Moore

claim paper

Read More »

171

click to vote

COCOON
2008
Springer

95views Combinatorics» more COCOON 2008»

Spectrum Bidding in Wireless Networks and Related

15 years 8 months ago

Download www.cs.iit.edu

In this paper, we study the spectrum assignment problem for wireless access networks. Opportunistic spectrum usage is a promising technology. However, it could suffer from the self...

Xiang-Yang Li, Ping Xu, ShaoJie Tang, Xiaowen Chu

claim paper

Read More »

183

click to vote

ICDCS
2009
IEEE

111views Distributed And Parallel Com...» more ICDCS 2009»

Stochastic Multicast with Network Coding

16 years 1 months ago

Download pages.cpsc.ucalgary.ca

The usage of network resources by content providers is commonly governed by Service Level Agreements (SLA) between the content provider and the network service provider. Resource ...

Ajay Gopinathan, Zongpeng Li

claim paper

Read More »

196

click to vote

COLT
2010
Springer

191views Machine Learning» more COLT 2010»

Best Arm Identification in Multi-Armed Bandits

15 years 4 months ago

Download www.di.ens.fr

We consider the problem of finding the best arm in a stochastic multi-armed bandit game. The regret of a forecaster is here defined by the gap between the mean reward of the optim...

Jean-Yves Audibert, Sébastien Bubeck, R&eac...

claim paper

Read More »

« Prev « First page 53 / 70 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers