Search Sciweavers | Sciweavers

1512 search results - page 150 / 303

» Qualitative reinforcement learning

184

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

15 years 5 months ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

177

click to vote

AIMSA
2006
Springer

159views Artificial Intelligence» more AIMSA 2006»

Machine Learning for Spoken Dialogue Management: An Experiment with Speech-Based Database Querying

15 years 9 months ago

Download tcts.fpms.ac.be

Although speech and language processing techniques achieved a relative maturity during the last decade, designing a spoken dialogue system is still a tailoring task because of the ...

Olivier Pietquin

claim paper

Read More »

140

click to vote

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

15 years 7 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

118

click to vote

ICCBR
2005
Springer

91views Automated Reasoning» more ICCBR 2005»

Opportunities for CBR in Learning by Doing

15 years 11 months ago

Download gaia.fdi.ucm.es

In this paper we partially describe JV2 M, a metaphorical simulation of the Java Virtual Machine where students can learn Java language compilation and reinforce object-oriented pr...

Pedro Pablo Gómez-Martín, Marco Anto...

claim paper

Read More »

173

click to vote

DAGM
2007
Springer

148views Image Processing» more DAGM 2007»

Efficient Learning of Neural Networks with Evolutionary Algorithms

15 years 9 months ago

Download www.ks.informatik.uni-kiel.de

Abstract. In this article we present EANT2, a method that creates neural networks (NNs) by evolutionary reinforcement learning. The structure of NNs is developed using mutation ope...

Nils T. Siebel, Jochen Krause, Gerald Sommer

claim paper

Read More »

« Prev « First page 150 / 303 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers