Sciweavers

1233 search results - page 177 / 247
» Feudal Reinforcement Learning
Sort
View
ML
2008
ACM
152views Machine Learning» more  ML 2008»
13 years 8 months ago
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...
András Antos, Csaba Szepesvári, R&ea...
AIMSA
2006
Springer
13 years 12 months ago
Machine Learning for Spoken Dialogue Management: An Experiment with Speech-Based Database Querying
Although speech and language processing techniques achieved a relative maturity during the last decade, designing a spoken dialogue system is still a tailoring task because of the ...
Olivier Pietquin
ECAI
2008
Springer
13 years 10 months ago
Exploiting locality of interactions using a policy-gradient approach in multiagent learning
In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...
Francisco S. Melo
DAGM
2007
Springer
14 years 5 days ago
Efficient Learning of Neural Networks with Evolutionary Algorithms
Abstract. In this article we present EANT2, a method that creates neural networks (NNs) by evolutionary reinforcement learning. The structure of NNs is developed using mutation ope...
Nils T. Siebel, Jochen Krause, Gerald Sommer
SIGCSE
2009
ACM
119views Education» more  SIGCSE 2009»
14 years 8 months ago
Implications of integrating test-driven development into CS1/CS2 curricula
Many academic and industry professionals have called for more testing in computer science curricula. Test-driven development (TDD) has been proposed as a solution to improve testi...
Chetan Desai, David S. Janzen, John Clements