Search Sciweavers | Sciweavers

3837 search results - page 119 / 768

» Learning Approximate Consistencies

160

click to vote

IR
2010

159views Natural Language Processing» more IR 2010»

A general approximation framework for direct optimization of information retrieval measures

15 years 2 months ago

Download research.microsoft.com

Recently direct optimization of information retrieval (IR) measures becomes a new trend in learning to rank. Several methods have been proposed and the eﬀectiveness of them has ...

Tao Qin, Tie-Yan Liu, Hang Li

claim paper

Read More »

201

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

15 years 2 months ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

146

click to vote

CDC
2010
IEEE

136views Control Systems» more CDC 2010»

Pathologies of temporal difference methods in approximate dynamic programming

14 years 11 months ago

Download web.mit.edu

Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...

Dimitri P. Bertsekas

claim paper

Read More »

119

click to vote

JAIR
2011

187views more JAIR 2011»

A Monte-Carlo AIXI Approximation

14 years 11 months ago

Download www.hutter1.net

This paper describes a computationally feasible approximation to the AIXI agent, a universal reinforcement learning agent for arbitrary environments. AIXI is scaled down in two ke...

Joel Veness, Kee Siong Ng, Marcus Hutter, William ...

claim paper

Read More »

144

click to vote

ICML
2009
IEEE

120views Machine Learning» more ICML 2009»

Learning linear dynamical systems without sequence information

15 years 11 months ago

Download www.cs.mcgill.ca

Virtually all methods of learning dynamic systems from data start from the same basic assumption: that the learning algorithm will be provided with a sequence, or trajectory, of d...

Tzu-Kuo Huang, Jeff Schneider

claim paper

Read More »

« Prev « First page 119 / 768 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers