Search Sciweavers | Sciweavers

473 search results - page 77 / 95

» Optimal policy switching algorithms for reinforcement learni...

158

click to vote

GECCO
2005
Springer

139views Optimization» more GECCO 2005»

Event-driven learning classifier systems for online soccer games

15 years 11 months ago

Download www.genetic-programming.org

This paper reports on the application of classifier systems to the acquisition of decision-making algorithms for agents in online soccer games. The objective of this research is t...

Yuji Sato, Ryutaro Kanno

claim paper

Read More »

153

click to vote

ATAL
2010
Springer

134views Intelligent Agents» more ATAL 2010»

Cultivating desired behaviour: policy teaching via environment-dynamics tweaks

15 years 6 months ago

Download eprints.ecs.soton.ac.uk

In this paper we study, for the first time explicitly, the implications of endowing an interested party (i.e. a teacher) with the ability to modify the underlying dynamics of the ...

Zinovi Rabinovich, Lachlan Dufton, Kate Larson, Ni...

claim paper

Read More »

174

click to vote

COLT
2007
Springer

143views Machine Learning» more COLT 2007»

Bounded Parameter Markov Decision Processes with Average Reward Criterion

15 years 12 months ago

Download ttic.uchicago.edu

Bounded parameter Markov Decision Processes (BMDPs) address the issue of dealing with uncertainty in the parameters of a Markov Decision Process (MDP). Unlike the case of an MDP, t...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

170

click to vote

PE
2010
Springer

114views Optimization» more PE 2010»

Analysis of scheduling policies under correlated job sizes

15 years 4 months ago

Download www.cs.cmu.edu

Correlations in traﬃc patterns are an important facet of the workloads faced by real systems, and one that has far-reaching consequences on the performance and optimization of t...

Varun Gupta, Michelle Burroughs, Mor Harchol-Balte...

claim paper

Read More »

149

click to vote

ATAL
2008
Springer

180views Intelligent Agents» more ATAL 2008»

On the usefulness of opponent modeling: the Kuhn Poker case study

15 years 7 months ago

Download www.ifaamas.org

The application of reinforcement learning algorithms to Partially Observable Stochastic Games (POSG) is challenging since each agent does not have access to the whole state inform...

Alessandro Lazaric, Mario Quaresimale, Marcello Re...

claim paper

Read More »

« Prev « First page 77 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers