Search Sciweavers | Sciweavers

355 search results - page 2 / 71

» Online Learning and Exploiting Relational Models in Reinforc...

193

Voted

AAAI
1998

181views Intelligent Agents» more AAAI 1998»

Applying Online Search Techniques to Continuous-State Reinforcement Learning

15 years 8 months ago

Download www.autonlab.org

In this paper, we describe methods for e ciently computing better solutions to control problems in continuous state spaces. We provide algorithms that exploit online search to boo...

Scott Davies, Andrew Y. Ng, Andrew W. Moore

claim paper

Read More »

180

click to vote

IJCAI
2001

163views Artificial Intelligence» more IJCAI 2001»

Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

15 years 8 months ago

Download www.cs.colorado.edu

Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

165

click to vote

ATAL
2009
Springer

137views Intelligent Agents» more ATAL 2009»

Generalized model learning for reinforcement learning in factored domains

16 years 1 months ago

Download userweb.cs.utexas.edu

Improving the sample eﬃciency of reinforcement learning algorithms to scale up to larger and more realistic domains is a current research challenge in machine learning. Model-ba...

Todd Hester, Peter Stone

claim paper

Read More »

141

Voted

ICML
1997
IEEE

135views Machine Learning» more ICML 1997»

Expected Mistake Bound Model for On-Line Reinforcement Learning

16 years 7 months ago

Download www.cs.ualberta.ca

Claude-Nicolas Fiechter

claim paper

Read More »

206

Voted

ACMICEC
2008
ACM

272views ECommerce» more ACMICEC 2008»

Adapting the interaction state model in conversational recommender systems

15 years 8 months ago

Download www.inf.unibz.it

Conventional conversational recommender systems support interaction strategies that are hard-coded into the system in advance. In this context, Reinforcement Learning techniques h...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

« Prev « First page 2 / 71 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers