Search Sciweavers | Sciweavers

1235 search results - page 143 / 247

» Reinforcement learning in a nutshell

160

Voted

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

15 years 3 months ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

153

Voted

AIMSA
2006
Springer

159views Artificial Intelligence» more AIMSA 2006»

Machine Learning for Spoken Dialogue Management: An Experiment with Speech-Based Database Querying

15 years 7 months ago

Download tcts.fpms.ac.be

Although speech and language processing techniques achieved a relative maturity during the last decade, designing a spoken dialogue system is still a tailoring task because of the ...

Olivier Pietquin

claim paper

Read More »

117

click to vote

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

15 years 5 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

click to vote

ICCBR
2005
Springer

91views Automated Reasoning» more ICCBR 2005»

Opportunities for CBR in Learning by Doing

15 years 9 months ago

Download gaia.fdi.ucm.es

In this paper we partially describe JV2 M, a metaphorical simulation of the Java Virtual Machine where students can learn Java language compilation and reinforce object-oriented pr...

Pedro Pablo Gómez-Martín, Marco Anto...

claim paper

Read More »

141

click to vote

DAGM
2007
Springer

148views Image Processing» more DAGM 2007»

Efficient Learning of Neural Networks with Evolutionary Algorithms

15 years 7 months ago

Download www.ks.informatik.uni-kiel.de

Abstract. In this article we present EANT2, a method that creates neural networks (NNs) by evolutionary reinforcement learning. The structure of NNs is developed using mutation ope...

Nils T. Siebel, Jochen Krause, Gerald Sommer

claim paper

Read More »

« Prev « First page 143 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers