Search Sciweavers | Sciweavers

473 search results - page 89 / 95

» Optimal policy switching algorithms for reinforcement learni...

155

click to vote

GECCO
2006
Springer

159views Optimization» more GECCO 2006»

Multi-step environment learning classifier systems applied to hyper-heuristics

15 years 10 months ago

Download www.cs.bham.ac.uk

Heuristic Algorithms (HA) are very widely used to tackle practical problems in operations research. They are simple, easy to understand and inspire confidence. Many of these HAs a...

Javier G. Marín-Blázquez, Sonia Schu...

claim paper

Read More »

173

click to vote

ATAL
2004
Springer

168views Intelligent Agents» more ATAL 2004»

Product Distribution Theory for Control of Multi-Agent Systems

15 years 11 months ago

Download collectives.stanford.edu

Product Distribution (PD) theory is a new framework for controlling Multi-Agent Systems (MAS’s). First we review one motivation of PD theory, as the information-theoretic extens...

Chiu Fan Lee, David H. Wolpert

claim paper

Read More »

171

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

15 years 7 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

187

Voted

JSAC
2006

120views more JSAC 2006»

A Tutorial on Cross-Layer Optimization in Wireless Networks

15 years 6 months ago

Download wmnlab.ee.ntu.edu.tw

This tutorial paper overviews recent developments in optimization-based approaches for resource allocation problems in wireless systems. We begin by overviewing important results i...

Xiaojun Lin, Ness B. Shroff, R. Srikant

claim paper

Read More »

217

click to vote

CORR
2011
Springer

210views Education» more CORR 2011»

Online Learning of Rested and Restless Bandits

15 years 1 months ago

Download www.eecs.umich.edu

In this paper we study the online learning problem involving rested and restless multiarmed bandits with multiple plays. The system consists of a single player/user and a set of K...

Cem Tekin, Mingyan Liu

claim paper

Read More »

« Prev « First page 89 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers