Search Sciweavers | Sciweavers

3628 search results - page 228 / 726

» The Decision Diffie-Hellman Problem

145

Voted

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

15 years 11 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

147

click to vote

IAT
2005
IEEE

132views Intelligent Agents» more IAT 2005»

Decomposing Large-Scale POMDP Via Belief State Analysis

15 years 10 months ago

Download www.comp.hkbu.edu.hk

Partially observable Markov decision process (POMDP) is commonly used to model a stochastic environment with unobservable states for supporting optimal decision making. Computing ...

Xin Li, William K. Cheung, Jiming Liu

claim paper

Read More »

122

click to vote

TARK
2005
Springer

118views Automated Reasoning» more TARK 2005»

Individual error, group error, and the value of information

15 years 10 months ago

Download www.tark.org

Abstract This paper studies the interaction of error and information both in a single-person setting and in an interactive setting. In contrast to Blackwell’s Theorem, which says...

Itai Sher

claim paper

Read More »

140

click to vote

ISCC
2000
IEEE

104views Communications» more ISCC 2000»

Dynamic Routing and Wavelength Assignment Using First Policy Iteration

15 years 9 months ago

Download www.netlab.tkk.fi

With standard assumptions the routing and wavelength assignment problem (RWA) can be viewed as a Markov Decision Process (MDP). The problem, however, deﬁes an exact solution bec...

Esa Hyytiä, Jorma T. Virtamo

claim paper

Read More »

150

click to vote

NIPS
2008

129views Information Technology» more NIPS 2008»

Support Vector Machines with a Reject Option

15 years 6 months ago

Download eprints.pascal-network.org

We consider the problem of binary classification where the classifier may abstain instead of classifying each observation. The Bayes decision rule for this setup, known as Chow�...

Yves Grandvalet, Alain Rakotomamonjy, Joseph Keshe...

claim paper

Read More »

« Prev « First page 228 / 726 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers