Search Sciweavers | Sciweavers

1166 search results - page 78 / 234

» Negotiating Using Rewards

213

click to vote

ICML
2007
IEEE

141views Machine Learning» more ICML 2007»

Reinforcement learning by reward-weighted regression for operational space control

16 years 7 months ago

Download www.machinelearning.org

Many robot control problems of practical importance, including operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of ...

Jan Peters, Stefan Schaal

claim paper

Read More »

188

click to vote

IAT
2009
IEEE

124views Intelligent Agents» more IAT 2009»

Topology and Memory Effect on Convention Emergence

16 years 1 months ago

Download www.iiia.csic.es

Abstract—Social conventions are useful self-sustaining protocols for groups to coordinate behavior without a centralized entity enforcing coordination. We perform an in-depth stu...

Daniel Villatoro, Sandip Sen, Jordi Sabater-Mir

claim paper

Read More »

184

click to vote

GLOBECOM
2008
IEEE

133views Communications» more GLOBECOM 2008»

Foresighted Resource Reciprocation Strategies in P2P Networks

16 years 1 months ago

Download medianetlab.ee.ucla.edu

—We consider peer-to-peer (P2P) networks, where multiple peers are interested in sharing content. While sharing resources, autonomous and self-interested peers need to make decis...

Hyunggon Park, Mihaela van der Schaar

claim paper

Read More »

171

click to vote

ATAL
2007
Springer

95views Intelligent Agents» more ATAL 2007»

A Q-decomposition and bounded RTDP approach to resource allocation

16 years 1 months ago

Download damas.ift.ulaval.ca

This paper contributes to solve eﬀectively stochastic resource allocation problems known to be NP-Complete. To address this complex resource management problem, a Qdecomposition...

Pierrick Plamondon, Brahim Chaib-draa, Abder Rezak...

claim paper

Read More »

175

click to vote

AAAI
2006

87views Intelligent Agents» more AAAI 2006»

Decision Tree Methods for Finding Reusable MDP Homomorphisms

15 years 8 months ago

Download www.aaai.org

straction is a useful tool for agents interacting with environments. Good state abstractions are compact, reuseable, and easy to learn from sample data. This paper and extends two...

Alicia P. Wolfe, Andrew G. Barto

claim paper

Read More »

« Prev « First page 78 / 234 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers