Sciweavers

358 search results - page 29 / 72
» Online Testing with Reinforcement Learning
Sort
View
PRICAI
2000
Springer
14 years 1 months ago
Constructing an Autonomous Agent with an Interdependent Heuristics
When we construct an agent by integrating modules, there appear troubles concerning the autonomy of the agent if we introduce a heuristics that dominates the whole agent. Thus, we ...
Koichi Moriyama, Masayuki Numao
AIS
2006
Springer
13 years 10 months ago
Context enhancement for co-intentionality and co-reference in asynchronous CMC
The regulative and semantic `distance' of electronic conferencing may impede the topical alignment and the unambiguous interpretation of messages, hindering collaborative lear...
J. van der Pol, Wilfried Admiraal, P. Simons
KDD
2010
ACM
289views Data Mining» more  KDD 2010»
13 years 7 months ago
Exploitation and exploration in a performance based contextual advertising system
The dynamic marketplace in online advertising calls for ranking systems that are optimized to consistently promote and capitalize better performing ads. The streaming nature of on...
Wei Li 0010, Xuerui Wang, Ruofei Zhang, Ying Cui, ...
GECCO
2009
Springer
150views Optimization» more  GECCO 2009»
14 years 4 months ago
Discrete dynamical genetic programming in XCS
A number of representation schemes have been presented for use within Learning Classifier Systems, ranging from binary encodings to neural networks. This paper presents results fr...
Richard Preen, Larry Bull
TSMC
2002
136views more  TSMC 2002»
13 years 9 months ago
Expertness based cooperative Q-learning
By using other agents' experiences and knowledge, a learning agent may learn faster, make fewer mistakes, and create some rules for unseen situations. These benefits would be ...
Majid Nili Ahmadabadi, Masoud Asadpour