Search Sciweavers | Sciweavers

374 search results - page 46 / 75

» Multiagent Reinforcement Learning: Theoretical Framework and...

click to vote

IR
2010

159views Natural Language Processing» more IR 2010»

A general approximation framework for direct optimization of information retrieval measures

13 years 6 months ago

Download research.microsoft.com

Recently direct optimization of information retrieval (IR) measures becomes a new trend in learning to rank. Several methods have been proposed and the eﬀectiveness of them has ...

Tao Qin, Tie-Yan Liu, Hang Li

claim paper

Read More »

click to vote

ICML
2010
IEEE

178views Machine Learning» more ICML 2010»

Implicit Online Learning

13 years 5 months ago

Download www.icml2010.org

Online learning algorithms have recently risen to prominence due to their strong theoretical guarantees and an increasing number of practical applications for large-scale data ana...

Brian Kulis, Peter L. Bartlett

claim paper

Read More »

click to vote

ACL
1998

129views Computational Linguistics» more ACL 1998»

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for Email

13 years 9 months ago

Download acl.eldoc.ub.rug.nl

This paper describes a novel method by which a dialogue agent can learn to choose an optimal dialogue strategy. While it is widely agreed that dialogue strategies should be formul...

Marilyn A. Walker, Jeanne Frommer, Shrikanth Naray...

claim paper

Read More »

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

13 years 7 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

click to vote

ICML
2008
IEEE

105views Machine Learning» more ICML 2008»

No-regret learning in convex games

14 years 8 months ago

Download www.cs.cmu.edu

Quite a bit is known about minimizing different kinds of regret in experts problems, and how these regret types relate to types of equilibria in the multiagent setting of repeated...

Geoffrey J. Gordon, Amy R. Greenwald, Casey Marks

claim paper

Read More »

« Prev « First page 46 / 75 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers