Sciweavers

109 search results - page 6 / 22
» Predicting Opponent Actions by Observation
Sort
View
ROBOCUP
2007
Springer
167views Robotics» more  ROBOCUP 2007»
14 years 2 months ago
Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others
The existing reinforcement learning approaches have been suffering from the curse of dimension problem when they are applied to multiagent dynamic environments. One of the typical...
Kentarou Noma, Yasutake Takahashi, Minoru Asada
TIT
2002
116views more  TIT 2002»
13 years 7 months ago
On delayed prediction of individual sequences
Prediction of individual sequences is investigated for cases in which the decision maker observes a delayed version of the sequence, or is forced to issue his/her predictions a nu...
Marcelo J. Weinberger, Erik Ordentlich
ITS
2000
Springer
140views Multimedia» more  ITS 2000»
13 years 11 months ago
Multi-agent Negotiation to Support an Economy for Online Help and Tutoring
We are designing a computational architecture for a "learning economy" based on personal software agents who represent users in a virtual society and assist them in find...
Chhaya Mudgal, Julita Vassileva
AI
1999
Springer
13 years 7 months ago
Cooperative Behavior Acquisition for Mobile Robots in Dynamically Changing Real Worlds Via Vision-Based Reinforcement Learning a
In this paper, we first discuss the meaning of physical embodiment and the complexity of the environment in the context of multi-agent learning. We then propose a vision-based rei...
Minoru Asada, Eiji Uchibe, Koh Hosoda
CORR
2010
Springer
132views Education» more  CORR 2010»
13 years 7 months ago
Calibration and Internal no-Regret with Partial Monitoring
Calibrated strategies can be obtained by performing strategies that have no internal regret in some auxiliary game. Such strategies can be constructed explicitly with the use of B...
Vianney Perchet