Sciweavers

119 search results - page 11 / 24
» Average Reward Timed Games
Sort
View
PRIMA
2009
Springer
14 years 2 months ago
Recursive Adaptation of Stepsize Parameter for Non-stationary Environments
In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for dynamic environments. In general reinforcement learning situations, a stepsize...
Itsuki Noda
DIGITEL
2008
IEEE
14 years 2 months ago
Games as Skins for Online Tests
: Games play a dual role: they test the player’s competence, and at the same time provide learning opportunities. They offer a simple form of a reward – the pleasure of playing...
Srinivasan Ramani, Venkatagiri Sirigiri, Nila Lohi...
CORR
2010
Springer
171views Education» more  CORR 2010»
13 years 2 months ago
Online Learning in Opportunistic Spectrum Access: A Restless Bandit Approach
We consider an opportunistic spectrum access (OSA) problem where the time-varying condition of each channel (e.g., as a result of random fading or certain primary users' activ...
Cem Tekin, Mingyan Liu
IWEC
2004
13 years 9 months ago
Enhancing the Performance of Dynamic Scripting in Computer Games
Unsupervised online learning in commercial computer games allows computer-controlled opponents to adapt to the way the game is being played. As such it provides a mechanism to deal...
Pieter Spronck, Ida G. Sprinkhuizen-Kuyper, Eric O...
WINE
2010
Springer
152views Economy» more  WINE 2010»
13 years 5 months ago
Collusion in VCG Path Procurement Auctions
We consider collusion in path procurement auctions, where payments are determined using the VCG mechanism. We show that collusion can increase the utility of the agents, and in som...
Yoram Bachrach, Peter Key, Morteza Zadimoghaddam