Search Sciweavers | Sciweavers

1234 search results - page 105 / 247

» Multi-criteria Reinforcement Learning

155

click to vote

ICML
2010
IEEE

189views Machine Learning» more ICML 2010»

Nonparametric Return Distribution Approximation for Reinforcement Learning

15 years 7 months ago

Download www.icml2010.org

Standard Reinforcement Learning (RL) aims to optimize decision-making rules in terms of the expected return. However, especially for risk-management purposes, other criteria such ...

Tetsuro Morimura, Masashi Sugiyama, Hisashi Kashim...

claim paper

Read More »

168

click to vote

MLDM
2005
Springer

112views Machine Learning» more MLDM 2005»

Diagnosis of Lung Nodule Using Reinforcement Learning and Geometric Measures

15 years 11 months ago

Download www.dee.ufma.br

This paper uses a set of 3D geometric measures with the purpose of characterizing lung nodules as malignant or benign. Based on a sample of 36 nodules, 29 benign and 7 malignant, t...

Aristófanes Corrêa Silva, Valdeci Rib...

claim paper

Read More »

165

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

16 years 13 days ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

173

click to vote

CG
2000
Springer

150views Computer Graphics» more CG 2000»

Chess Neighborhoods, Function Combination, and Reinforcement Learning

15 years 10 months ago

Download users.soe.ucsc.edu

Abstract. Over the years, various research projects have attempted to develop a chess program that learns to play well given little prior knowledge beyond the rules of the game. Ea...

Robert Levinson, Ryan Weber

claim paper

Read More »

178

Voted

ICAC
2008
IEEE

99views Applied Computing» more ICAC 2008»

Utility-Based Reinforcement Learning for Reactive Grids

16 years 21 days ago

Download hal.inria.fr

—Large scale production grids are an important case for autonomic computing. They follow a mutualization paradigm: decision-making (human or automatic) is distributed and largely...

Julien Perez, Cécile Germain-Renaud, Bal&aa...

claim paper

Read More »

« Prev « First page 105 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers