Search Sciweavers | Sciweavers

473 search results - page 82 / 95

» Optimal policy switching algorithms for reinforcement learni...

176

click to vote

COLT
2010
Springer

191views Machine Learning» more COLT 2010»

Best Arm Identification in Multi-Armed Bandits

15 years 3 months ago

Download www.di.ens.fr

We consider the problem of finding the best arm in a stochastic multi-armed bandit game. The regret of a forecaster is here defined by the gap between the mean reward of the optim...

Jean-Yves Audibert, Sébastien Bubeck, R&eac...

claim paper

Read More »

118

click to vote

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

15 years 12 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

172

click to vote

PE
2011
Springer

215views Optimization» more PE 2011»

Energy-aware routing in the Cognitive Packet Network

15 years 20 days ago

Download san.ee.ic.ac.uk

An energy aware routing protocol (EARP) is proposed to minimise a performance metric that combines the total consumed power in the network and the QoS that is speciﬁed for the �...

Toktam Mahmoodi

claim paper

Read More »

153

click to vote

EMO
2005
Springer

107views Optimization» more EMO 2005»

Multiobjective Water Pinch Analysis of the Cuernavaca City Water Distribution Network

15 years 11 months ago

Download ccc.inaoep.mx

Water systems often allow eﬃcient water uses via water reuse and/or recirculation. Deﬁning the network layout connecting water-using processes is a complex problem which involv...

Carlos E. Mariano-Romero, Víctor Alcocer-Ya...

claim paper

Read More »

137

click to vote

ATAL
2007
Springer

81views Intelligent Agents» more ATAL 2007»

Multiagent learning in adaptive dynamic systems

15 years 12 months ago

Download www.damas.ift.ulaval.ca

Classically, an approach to the multiagent policy learning supposed that the agents, via interactions and/or by using preliminary knowledge about the reward functions of all playe...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

« Prev « First page 82 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers