Sciweavers

4544 search results - page 148 / 909
» Reinforcement Learning with Time
Sort
View
ACMICEC
2008
ACM
272views ECommerce» more  ACMICEC 2008»
13 years 12 months ago
Adapting the interaction state model in conversational recommender systems
Conventional conversational recommender systems support interaction strategies that are hard-coded into the system in advance. In this context, Reinforcement Learning techniques h...
Tariq Mahmood, Francesco Ricci
ATAL
2006
Springer
14 years 1 months ago
Learning to cooperate in multi-agent social dilemmas
In many Multi-Agent Systems (MAS), agents (even if selfinterested) need to cooperate in order to maximize their own utilities. Most of the multi-agent learning algorithms focus on...
Jose Enrique Munoz de Cote, Alessandro Lazaric, Ma...
ML
1998
ACM
148views Machine Learning» more  ML 1998»
13 years 9 months ago
Colearning in Differential Games
Game playing has been a popular problem area for research in artificial intelligence and machine learning for many years. In almost every study of game playing and machine learnin...
John W. Sheppard
ALT
2006
Springer
14 years 7 months ago
Asymptotic Learnability of Reinforcement Problems with Arbitrary Dependence
We address the problem of reinforcement learning in which observations may exhibit an arbitrary form of stochastic dependence on past observations and actions. The task for an age...
Daniil Ryabko, Marcus Hutter
CEC
2010
IEEE
13 years 11 months ago
Generating a novel sort algorithm using Reinforcement Programming
Abstract-- Reinforcement Programming (RP) is a new approach to automatically generating algorithms, that uses reinforcement learning techniques. This paper describes the RP approac...
Spencer K. White, Tony R. Martinez, George L. Rudo...