Sciweavers

355 search results - page 2 / 71
» Online Learning and Exploiting Relational Models in Reinforc...
Sort
View
145
Voted
AAAI
1998
15 years 4 months ago
Applying Online Search Techniques to Continuous-State Reinforcement Learning
In this paper, we describe methods for e ciently computing better solutions to control problems in continuous state spaces. We provide algorithms that exploit online search to boo...
Scott Davies, Andrew Y. Ng, Andrew W. Moore
142
Voted
IJCAI
2001
15 years 4 months ago
Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning
Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...
Gregory Z. Grudic, Lyle H. Ungar
ATAL
2009
Springer
15 years 10 months ago
Generalized model learning for reinforcement learning in factored domains
Improving the sample efficiency of reinforcement learning algorithms to scale up to larger and more realistic domains is a current research challenge in machine learning. Model-ba...
Todd Hester, Peter Stone
106
Voted
ICML
1997
IEEE
16 years 4 months ago
Expected Mistake Bound Model for On-Line Reinforcement Learning
Claude-Nicolas Fiechter
160
Voted
ACMICEC
2008
ACM
272views ECommerce» more  ACMICEC 2008»
15 years 5 months ago
Adapting the interaction state model in conversational recommender systems
Conventional conversational recommender systems support interaction strategies that are hard-coded into the system in advance. In this context, Reinforcement Learning techniques h...
Tariq Mahmood, Francesco Ricci