Search Sciweavers | Sciweavers

201

Publication

352views

Efficient methods for near-optimal sequential decision making under uncertainty

16 years 2 months ago

This chapter discusses decision making under uncertainty. More specifically, it offers an overview of efficient Bayesian and distribution-free algorithms for making near-optimal se...

Christos Dimitrakakis

posted by olethros

Read More »

148

click to vote

AAMAS
2005
Springer

126views Intelligent Agents» more AAMAS 2005»

Learning to Coordinate Using Commitment Sequences in Cooperative Multi-agent Systems

15 years 11 months ago

Download como.vub.ac.be

We report on an investigation of the learning of coordination in cooperative multi-agent systems. Speciﬁcally, we study solutions that are applicable to independent agents i.e. ...

Spiros Kapetanakis, Daniel Kudenko, Malcolm J. A. ...

claim paper

Read More »

170

click to vote

AAAI
2007

122views Intelligent Agents» more AAAI 2007»

RETALIATE: Learning Winning Policies in First-Person Shooter Games

15 years 8 months ago

Download www.cse.lehigh.edu

In this paper we present RETALIATE, an online reinforcement learning algorithm for developing winning policies in team firstperson shooter games. RETALIATE has three crucial chara...

Megan Smith, Stephen Lee-Urban, Hector Muño...

claim paper

Read More »

210

Voted

COST
2009
Springer

185views Multimedia» more COST 2009»

How an Agent Can Detect and Use Synchrony Parameter of Its Own Interaction with a Human?

15 years 4 months ago

Download gaussier.free.fr

Synchrony is claimed by psychology as a crucial parameter of any social interaction: to give to human a feeling of natural interaction, a feeling of agency [17], an agent must be a...

Ken Prepin, Philippe Gaussier

claim paper

Read More »

180

Voted

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

15 years 4 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers