Sciweavers

1512 search results - page 150 / 303
» Qualitative reinforcement learning
Sort
View
ML
2008
ACM
152views Machine Learning» more  ML 2008»
13 years 9 months ago
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...
András Antos, Csaba Szepesvári, R&ea...
AIMSA
2006
Springer
14 years 1 months ago
Machine Learning for Spoken Dialogue Management: An Experiment with Speech-Based Database Querying
Although speech and language processing techniques achieved a relative maturity during the last decade, designing a spoken dialogue system is still a tailoring task because of the ...
Olivier Pietquin
ECAI
2008
Springer
13 years 11 months ago
Exploiting locality of interactions using a policy-gradient approach in multiagent learning
In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...
Francisco S. Melo
ICCBR
2005
Springer
14 years 2 months ago
Opportunities for CBR in Learning by Doing
In this paper we partially describe JV2 M, a metaphorical simulation of the Java Virtual Machine where students can learn Java language compilation and reinforce object-oriented pr...
Pedro Pablo Gómez-Martín, Marco Anto...
DAGM
2007
Springer
14 years 1 months ago
Efficient Learning of Neural Networks with Evolutionary Algorithms
Abstract. In this article we present EANT2, a method that creates neural networks (NNs) by evolutionary reinforcement learning. The structure of NNs is developed using mutation ope...
Nils T. Siebel, Jochen Krause, Gerald Sommer