Sciweavers

3628 search results - page 228 / 726
» The Decision Diffie-Hellman Problem
Sort
View
145
Voted
ATAL
2009
Springer
15 years 11 months ago
Online exploration in least-squares policy iteration
One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...
Lihong Li, Michael L. Littman, Christopher R. Mans...
IAT
2005
IEEE
15 years 10 months ago
Decomposing Large-Scale POMDP Via Belief State Analysis
Partially observable Markov decision process (POMDP) is commonly used to model a stochastic environment with unobservable states for supporting optimal decision making. Computing ...
Xin Li, William K. Cheung, Jiming Liu
TARK
2005
Springer
15 years 10 months ago
Individual error, group error, and the value of information
Abstract This paper studies the interaction of error and information both in a single-person setting and in an interactive setting. In contrast to Blackwell’s Theorem, which says...
Itai Sher
ISCC
2000
IEEE
104views Communications» more  ISCC 2000»
15 years 9 months ago
Dynamic Routing and Wavelength Assignment Using First Policy Iteration
With standard assumptions the routing and wavelength assignment problem (RWA) can be viewed as a Markov Decision Process (MDP). The problem, however, defies an exact solution bec...
Esa Hyytiä, Jorma T. Virtamo
NIPS
2008
15 years 6 months ago
Support Vector Machines with a Reject Option
We consider the problem of binary classification where the classifier may abstain instead of classifying each observation. The Bayes decision rule for this setup, known as Chow�...
Yves Grandvalet, Alain Rakotomamonjy, Joseph Keshe...