Sciweavers

374 search results - page 37 / 75
» Multiagent Reinforcement Learning: Theoretical Framework and...
Sort
View
KDD
2002
ACM
147views Data Mining» more  KDD 2002»
14 years 8 months ago
Sequential cost-sensitive decision making with reinforcement learning
Recently, there has been increasing interest in the issues of cost-sensitive learning and decision making in a variety of applications of data mining. A number of approaches have ...
Edwin P. D. Pednault, Naoki Abe, Bianca Zadrozny
WWW
2009
ACM
14 years 8 months ago
Learning to recognize reliable users and content in social media with coupled mutual reinforcement
Community Question Answering (CQA) has emerged as a popular forum for users to pose questions for other users to answer. Over the last few years, CQA portals such as Naver and Yah...
Jiang Bian, Yandong Liu, Ding Zhou, Eugene Agichte...
AIIDE
2008
13 years 10 months ago
Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games
We introduce the ALeRT (Action-dependent Learning Rates with Trends) algorithm that makes two modifications to the learning rate and one change to the exploration rate of traditio...
Maria Cutumisu, Duane Szafron, Michael H. Bowling,...
ICML
2003
IEEE
14 years 8 months ago
TD(0) Converges Provably Faster than the Residual Gradient Algorithm
In Reinforcement Learning (RL) there has been some experimental evidence that the residual gradient algorithm converges slower than the TD(0) algorithm. In this paper, we use the ...
Ralf Schoknecht, Artur Merke
EPIA
2007
Springer
14 years 1 months ago
Intelligent Farmer Agent for Multi-agent Ecological Simulations Optimization
Abstract. This paper presents the development of a bivalve farmer agent interacting with a realistic ecological simulation system. The purpose of the farmer agent is to determine t...
Filipe Cruz, António Pereira, Pedro Valente...