Search Sciweavers | Sciweavers

1235 search results - page 183 / 247

» Reinforcement learning in a nutshell

149

click to vote

ACL
2010

176views Computational Linguistics» more ACL 2010»

Learning to Adapt to Unknown Users: Referring Expression Generation in Spoken Dialogue Systems

15 years 1 months ago

Download aclweb.org

We present a data-driven approach to learn user-adaptive referring expression generation (REG) policies for spoken dialogue systems. Referring expressions can be difficult to unde...

Srinivasan Janarthanam, Oliver Lemon

claim paper

Read More »

157

click to vote

ACG
2003
Springer

157views Computer Graphics» more ACG 2003»

Evaluation in Go by a Neural Network using Soft Segmentation

15 years 8 months ago

Download webdocs.cs.ualberta.ca

In this article a neural network architecture is presented that is able to build a soft segmentation of a two-dimensional input. This network architecture is applied to position ev...

Markus Enzenberger

claim paper

Read More »

250

click to vote

ROBOCUP
2001
Springer

125views Robotics» more ROBOCUP 2001»

Essex Wizards 2001 Team Description

15 years 7 months ago

Download cswww.essex.ac.uk

This article presents an overview of the Essex Wizards 2001 team participated in the RoboCup 2001 simulator league. Four major issues have been addressed, namely a generalized appr...

Huosheng Hu, Kostas Kostiadis, Matthew Hunter, Nik...

claim paper

Read More »

118

click to vote

NIPS
2004

120views Information Technology» more NIPS 2004»

Multi-agent Cooperation in Diverse Population Games

15 years 4 months ago

Download books.nips.cc

We consider multi-agent systems whose agents compete for resources by striving to be in the minority group. The agents adapt to the environment by reinforcement learning of the pr...

K. Y. Michael Wong, S. W. Lim, Zhuo Gao

claim paper

Read More »

121

click to vote

NIPS
2003

108views Information Technology» more NIPS 2003»

Policy Search by Dynamic Programming

15 years 4 months ago

Download books.nips.cc

We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...

J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...

claim paper

Read More »

« Prev « First page 183 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers