Search Sciweavers | Sciweavers

536 search results - page 63 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

click to vote

CVPR
2010
IEEE

496views Computer Vision» more CVPR 2010»

SPEC Hashing: Similarity Preserving algorithm for Entropy-based Coding

14 years 3 months ago

Download www.cs.utoronto.ca

Searching approximate nearest neighbors in large scale high dimensional data set has been a challenging problem. This paper presents a novel and fast algorithm for learning binary...

Ruei-Sung Lin, David Ross, Jay Yagnik

claim paper

Read More »

click to vote

EWRL
2008

129views Machine Learning» more EWRL 2008»

Markov Decision Processes with Arbitrary Reward Processes

13 years 9 months ago

Download www.cim.mcgill.ca

Abstract. We consider a control problem where the decision maker interacts with a standard Markov decision process with the exception that the reward functions vary arbitrarily ove...

Jia Yuan Yu, Shie Mannor, Nahum Shimkin

claim paper

Read More »

click to vote

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Internal Rewards Mitigate Agent Boundedness

13 years 8 months ago

Download www-personal.umich.edu

Abstract--Reinforcement learning (RL) research typically develops algorithms for helping an RL agent best achieve its goals-however they came to be defined--while ignoring the rela...

Jonathan Sorg, Satinder P. Singh, Richard Lewis

claim paper

Read More »

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

13 years 2 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

click to vote

ML
2002
ACM

146views Machine Learning» more ML 2002»

Kernel Matching Pursuit

13 years 7 months ago

Download www.iro.umontreal.ca

Matching Pursuit algorithms learn a function that is a weighted sum of basis functions, by sequentially appending functions to an initially empty basis, to approximate a target fu...

Pascal Vincent, Yoshua Bengio

claim paper

Read More »

« Prev « First page 63 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers