Sciweavers

1234 search results - page 105 / 247
» Multi-criteria Reinforcement Learning
Sort
View
ICML
2010
IEEE
15 years 5 months ago
Nonparametric Return Distribution Approximation for Reinforcement Learning
Standard Reinforcement Learning (RL) aims to optimize decision-making rules in terms of the expected return. However, especially for risk-management purposes, other criteria such ...
Tetsuro Morimura, Masashi Sugiyama, Hisashi Kashim...
MLDM
2005
Springer
15 years 10 months ago
Diagnosis of Lung Nodule Using Reinforcement Learning and Geometric Measures
This paper uses a set of 3D geometric measures with the purpose of characterizing lung nodules as malignant or benign. Based on a sample of 36 nodules, 29 benign and 7 malignant, t...
Aristófanes Corrêa Silva, Valdeci Rib...
EUROCAST
2007
Springer
182views Hardware» more  EUROCAST 2007»
15 years 10 months ago
A k-NN Based Perception Scheme for Reinforcement Learning
Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...
José Antonio Martin H., Javier de Lope Asia...
CG
2000
Springer
15 years 8 months ago
Chess Neighborhoods, Function Combination, and Reinforcement Learning
Abstract. Over the years, various research projects have attempted to develop a chess program that learns to play well given little prior knowledge beyond the rules of the game. Ea...
Robert Levinson, Ryan Weber
ICAC
2008
IEEE
15 years 11 months ago
Utility-Based Reinforcement Learning for Reactive Grids
—Large scale production grids are an important case for autonomic computing. They follow a mutualization paradigm: decision-making (human or automatic) is distributed and largely...
Julien Perez, Cécile Germain-Renaud, Bal&aa...