Search Sciweavers | Sciweavers

109 search results - page 6 / 22

» Predicting Opponent Actions by Observation

click to vote

ROBOCUP
2007
Springer

167views Robotics» more ROBOCUP 2007»

Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others

14 years 2 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning approaches have been suﬀering from the curse of dimension problem when they are applied to multiagent dynamic environments. One of the typical...

Kentarou Noma, Yasutake Takahashi, Minoru Asada

claim paper

Read More »

click to vote

TIT
2002

116views more TIT 2002»

On delayed prediction of individual sequences

13 years 7 months ago

Download www.hpl.hp.com

Prediction of individual sequences is investigated for cases in which the decision maker observes a delayed version of the sequence, or is forced to issue his/her predictions a nu...

Marcelo J. Weinberger, Erik Ordentlich

claim paper

Read More »

click to vote

ITS
2000
Springer

140views Multimedia» more ITS 2000»

Multi-agent Negotiation to Support an Economy for Online Help and Tutoring

13 years 11 months ago

Download julita.usask.ca

We are designing a computational architecture for a "learning economy" based on personal software agents who represent users in a virtual society and assist them in find...

Chhaya Mudgal, Julita Vassileva

claim paper

Read More »

click to vote

AI
1999
Springer

264views Artificial Intelligence» more AI 1999»

Cooperative Behavior Acquisition for Mobile Robots in Dynamically Changing Real Worlds Via Vision-Based Reinforcement Learning a

13 years 7 months ago

Download www.mendeley.com

In this paper, we first discuss the meaning of physical embodiment and the complexity of the environment in the context of multi-agent learning. We then propose a vision-based rei...

Minoru Asada, Eiji Uchibe, Koh Hosoda

claim paper

Read More »

click to vote

CORR
2010
Springer

132views Education» more CORR 2010»

Calibration and Internal no-Regret with Partial Monitoring

13 years 7 months ago

Download hal.archives-ouvertes.fr

Calibrated strategies can be obtained by performing strategies that have no internal regret in some auxiliary game. Such strategies can be constructed explicitly with the use of B...

Vianney Perchet

claim paper

Read More »

« Prev « First page 6 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers