Sciweavers

176 search results - page 7 / 36
» Optimal Sample Selection for Batch-mode Reinforcement Learni...
Sort
View
IJCAI
2007
13 years 9 months ago
Concept Sampling: Towards Systematic Selection in Large-Scale Mixed Concepts in Machine Learning
This paper addresses the problem of concept sampling. In many real-world applications, a large collection of mixed concepts is available for decision making. However, the collecti...
Yi Zhang 0010, Xiaoming Jin
ICML
2006
IEEE
14 years 8 months ago
PAC model-free reinforcement learning
For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...
Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...
ACMICEC
2008
ACM
272views ECommerce» more  ACMICEC 2008»
13 years 9 months ago
Adapting the interaction state model in conversational recommender systems
Conventional conversational recommender systems support interaction strategies that are hard-coded into the system in advance. In this context, Reinforcement Learning techniques h...
Tariq Mahmood, Francesco Ricci
ICML
2000
IEEE
14 years 1 days ago
A Bayesian Framework for Reinforcement Learning
The reinforcement learning problem can be decomposed into two parallel types of inference: (i) estimating the parameters of a model for the underlying process; (ii) determining be...
Malcolm J. A. Strens
ICML
2002
IEEE
14 years 8 months ago
Coordinated Reinforcement Learning
We present several new algorithms for multiagent reinforcement learning. A common feature of these algorithms is a parameterized, structured representation of a policy or value fu...
Carlos Guestrin, Michail G. Lagoudakis, Ronald Par...