Search Sciweavers | Sciweavers

176 search results - page 7 / 36

» Optimal Sample Selection for Batch-mode Reinforcement Learni...

175

Voted

IJCAI
2007

196views Artificial Intelligence» more IJCAI 2007»

Concept Sampling: Towards Systematic Selection in Large-Scale Mixed Concepts in Machine Learning

15 years 8 months ago

Download www.cs.cmu.edu

This paper addresses the problem of concept sampling. In many real-world applications, a large collection of mixed concepts is available for decision making. However, the collecti...

Yi Zhang 0010, Xiaoming Jin

claim paper

Read More »

171

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 7 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

206

Voted

ACMICEC
2008
ACM

272views ECommerce» more ACMICEC 2008»

Adapting the interaction state model in conversational recommender systems

15 years 8 months ago

Download www.inf.unibz.it

Conventional conversational recommender systems support interaction strategies that are hard-coded into the system in advance. In this context, Reinforcement Learning techniques h...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

187

click to vote

ICML
2000
IEEE

165views Machine Learning» more ICML 2000»

A Bayesian Framework for Reinforcement Learning

15 years 11 months ago

Download www.ece.uvic.ca

The reinforcement learning problem can be decomposed into two parallel types of inference: (i) estimating the parameters of a model for the underlying process; (ii) determining be...

Malcolm J. A. Strens

claim paper

Read More »

169

click to vote

ICML
2002
IEEE

133views Machine Learning» more ICML 2002»

Coordinated Reinforcement Learning

16 years 7 months ago

Download select.cs.cmu.edu

We present several new algorithms for multiagent reinforcement learning. A common feature of these algorithms is a parameterized, structured representation of a policy or value fu...

Carlos Guestrin, Michail G. Lagoudakis, Ronald Par...

claim paper

Read More »

« Prev « First page 7 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers