Search Sciweavers | Sciweavers

175 search results - page 8 / 35

» Forgetting Reinforced Cases

234

click to vote

JMLR
2012

200views Programming Languages» more JMLR 2012»

Contextual Bandit Learning with Predictable Rewards

13 years 9 months ago

Download www.cs.princeton.edu

Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...

Alekh Agarwal, Miroslav Dudík, Satyen Kale,...

claim paper

Read More »

197

click to vote

AAAI
1993

107views Intelligent Agents» more AAAI 1993»

Complexity Analysis of Real-Time Reinforcement Learning

15 years 8 months ago

Download www.ri.cmu.edu

This paper analyzes the complexity of on-line reinforcement learning algorithms, namely asynchronous realtime versions of Q-learning and value-iteration, applied to the problem of...

Sven Koenig, Reid G. Simmons

claim paper

Read More »

189

click to vote

ICML
2010
IEEE

282views Machine Learning» more ICML 2010»

Bayesian Multi-Task Reinforcement Learning

15 years 8 months ago

Download hal.inria.fr

We consider the problem of multi-task reinforcement learning where the learner is provided with a set of tasks, for which only a small number of samples can be generated for any g...

Alessandro Lazaric, Mohammad Ghavamzadeh

claim paper

Read More »

222

click to vote

GECCO
2005
Springer

107views Optimization» more GECCO 2005»

Minimum spanning trees made easier via multi-objective optimization

16 years 17 days ago

Download www.cs.bham.ac.uk

Many real-world problems are multi-objective optimization problems and evolutionary algorithms are quite successful on such problems. Since the task is to compute or approximate t...

Frank Neumann, Ingo Wegener

claim paper

Read More »

191

click to vote

LFCS
2009
Springer

257views Artificial Intelligence» more LFCS 2009»

ATL with Strategy Contexts and Bounded Memory

15 years 11 months ago

Download www.lsv.ens-cachan.fr

We extend the alternating-time temporal logics ATL and ATL with strategy contexts and memory constraints: the ﬁrst extension makes strategy quantiﬁers to not “forget” the s...

Thomas Brihaye, Arnaud Da Costa Lopes, Franç...

claim paper

Read More »

« Prev « First page 8 / 35 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers