Search Sciweavers | Sciweavers

259 search results - page 11 / 52

» Reinforcement Learning with the Use of Costly Features

157

click to vote

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

15 years 7 months ago

Download www.cs.cmu.edu

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

147

click to vote

P2P
2006
IEEE

101views Communications» more P2P 2006»

Reinforcement Learning for Query-Oriented Routing Indices in Unstructured Peer-to-Peer Networks

15 years 11 months ago

Download www.cc.gatech.edu

The idea of building query-oriented routing indices has changed the way of improving routing efﬁciency from the basis as it can learn the content distribution during the query r...

Cong Shi, Shicong Meng, Yuanjie Liu, Dingyi Han, Y...

claim paper

Read More »

175

click to vote

ICDAR
2009
IEEE

117views Document Analysis» more ICDAR 2009»

Low Cost Correction of OCR Errors Using Learning in a Multi-Engine Environment

15 years 3 months ago

Download www.cvc.uab.es

We propose a low cost method for the correction of the output of OCR engines through the use of human labor. The method employs an error estimator neural network that learns to as...

Ahmad Abdulkader, Mathew R. Casey

claim paper

Read More »

161

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

15 years 12 months ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

191

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

15 years 11 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

« Prev « First page 11 / 52 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers