Sciweavers

AROBOTS
1999
104views more  AROBOTS 1999»
13 years 11 months ago
Reinforcement Learning Soccer Teams with Incomplete World Models
We use reinforcement learning (RL) to compute strategies for multiagent soccer teams. RL may pro t signi cantly from world models (WMs) estimating state transition probabilities an...
Marco Wiering, Rafal Salustowicz, Jürgen Schm...
CORR
2010
Springer
127views Education» more  CORR 2010»
13 years 11 months ago
Online Algorithms for the Multi-Armed Bandit Problem with Markovian Rewards
We consider the classical multi-armed bandit problem with Markovian rewards. When played an arm changes its state in a Markovian fashion while it remains frozen when not played. Th...
Cem Tekin, Mingyan Liu
NIPS
2003
14 years 17 days ago
Robustness in Markov Decision Problems with Uncertain Transition Matrices
Optimal solutions to Markov Decision Problems (MDPs) are very sensitive with respect to the state transition probabilities. In many practical problems, the estimation of those pro...
Arnab Nilim, Laurent El Ghaoui
IJCAI
2007
14 years 19 days ago
Dynamically Weighted Hidden Markov Model for Spam Deobfuscation
Spam deobfuscation is a processing to detect obfuscated words appeared in spam emails and to convert them back to the original words for correct recognition. Lexicon tree hidden M...
Seunghak Lee, Iryoung Jeong, Seungjin Choi
ICDAR
2003
IEEE
14 years 4 months ago
An HMM On-line Signature Verifier Incorporating Signature Trajectories
Authentication of individuals is rapidly becoming an important issue. On-line signature verification is one of the methods that use biometric features. This paper proposes a new H...
Daigo Muramatsu, Takashi Matsumoto
ROBOCUP
2004
Springer
114views Robotics» more  ROBOCUP 2004»
14 years 4 months ago
Modular Learning System and Scheduling for Behavior Acquisition in Multi-agent Environment
The existing reinforcement learning approaches have been suffering from the policy alternation of others in multiagent dynamic environments such as RoboCup competitions since othe...
Yasutake Takahashi, Kazuhiro Edazawa, Minoru Asada