Sciweavers

259 search results - page 11 / 52
» Reinforcement Learning with the Use of Costly Features
Sort
View
NIPS
1993
13 years 10 months ago
Robust Reinforcement Learning in Motion Planning
While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...
Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...
P2P
2006
IEEE
101views Communications» more  P2P 2006»
14 years 3 months ago
Reinforcement Learning for Query-Oriented Routing Indices in Unstructured Peer-to-Peer Networks
The idea of building query-oriented routing indices has changed the way of improving routing efficiency from the basis as it can learn the content distribution during the query r...
Cong Shi, Shicong Meng, Yuanjie Liu, Dingyi Han, Y...
ICDAR
2009
IEEE
13 years 6 months ago
Low Cost Correction of OCR Errors Using Learning in a Multi-Engine Environment
We propose a low cost method for the correction of the output of OCR engines through the use of human labor. The method employs an error estimator neural network that learns to as...
Ahmad Abdulkader, Mathew R. Casey
EUROCAST
2007
Springer
182views Hardware» more  EUROCAST 2007»
14 years 3 months ago
A k-NN Based Perception Scheme for Reinforcement Learning
Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...
José Antonio Martin H., Javier de Lope Asia...
ICML
2006
IEEE
14 years 3 months ago
Automatic basis function construction for approximate dynamic programming and reinforcement learning
We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...
Philipp W. Keller, Shie Mannor, Doina Precup