Sciweavers

4544 search results - page 163 / 909
» Reinforcement Learning with Time
Sort
View
HT
2009
ACM
15 years 9 months ago
Improving recommender systems with adaptive conversational strategies
Conversational recommender systems (CRSs) assist online users in their information-seeking and decision making tasks by supporting an interactive process. Although these processes...
Tariq Mahmood, Francesco Ricci
ATAL
2007
Springer
15 years 9 months ago
Dynamic task allocation within an open service-oriented MAS architecture
A MAS architecture consisting of service centers is proposed. Within each service center, a mediator coordinates service delivery by allocating individual tasks to corresponding t...
Ivan Jureta, Stéphane Faulkner, Youssef Ach...
GECCO
2010
Springer
153views Optimization» more  GECCO 2010»
15 years 6 months ago
Multi-task evolutionary shaping without pre-specified representations
Shaping functions can be used in multi-task reinforcement learning (RL) to incorporate knowledge from previously experienced tasks to speed up learning on a new task. So far, rese...
Matthijs Snel, Shimon Whiteson
ATAL
2008
Springer
15 years 5 months ago
Sigma point policy iteration
In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...
Michael H. Bowling, Alborz Geramifard, David Winga...
HIS
2008
15 years 4 months ago
New Crossover Operator for Evolutionary Rule Discovery in XCS
XCS is a learning classifier system that combines a reinforcement learning scheme with evolutionary algorithms to evolve rule sets on-line by means of the interaction with an envi...
Sergio Morales-Ortigosa, Albert Orriols-Puig, Este...