Sciweavers

7853 search results - page 19 / 1571
» Learning from Each Other
Sort
View
FLAIRS
2008
13 years 11 months ago
CANDEL: An Algorithm for Same-Sentence Pronominal Resolution
This paper presents a syntactic path-based learning algorithm (CANDEL from CANDIDATE-ELIMINATION) for the coreference resolution of pronouns that have their antecedents in the sam...
Cristina Nicolae, Gabriel Nicolae
ICML
2003
IEEE
14 years 9 months ago
Q-Decomposition for Reinforcement Learning Agents
The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and...
Stuart J. Russell, Andrew Zimdars
ICML
2000
IEEE
14 years 9 months ago
Eligibility Traces for Off-Policy Policy Evaluation
Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...
Doina Precup, Richard S. Sutton, Satinder P. Singh
KDD
2002
ACM
179views Data Mining» more  KDD 2002»
14 years 9 months ago
From Data To Insight: The Community Of Multimedia Agents
Multimedia Data Mining requires the ability to automatically analyze and understand the content. The Community of Multimedia Agents project (COMMA) is devoted to creating an open ...
Gang Wei, Valery A. Petrushin, Anatole Gershman
ICTAI
2009
IEEE
14 years 3 months ago
Collaborative Concept Learning: Non Individualistic vs Individualistic Agents
This article addresses collaborative learning in a multiagent system: each agent revises incrementally its beliefs B (a concept representation) to keep it consistent with the whol...
Gauvain Bourgne, Dominique Bouthinon, Amal El Fall...