Sciweavers

2566 search results - page 30 / 514
» Relating reinforcement learning performance to classificatio...
Sort
View
119
Voted
PRL
2006
78views more  PRL 2006»
15 years 3 months ago
The interaction between classification and reject performance for distance-based reject-option classifiers
Consider the class of problems in which a target class is well-defined, and an outlier class is ill-defined. In these cases new outlier classes can appear, or the class-conditiona...
Thomas Landgrebe, David M. J. Tax, Pavel Pacl&iacu...
124
Voted
ICML
2006
IEEE
16 years 4 months ago
PAC model-free reinforcement learning
For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...
Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...
KDD
2008
ACM
150views Data Mining» more  KDD 2008»
16 years 4 months ago
Hypergraph spectral learning for multi-label classification
A hypergraph is a generalization of the traditional graph in which the edges are arbitrary non-empty subsets of the vertex set. It has been applied successfully to capture highord...
Liang Sun, Shuiwang Ji, Jieping Ye
ICML
2006
IEEE
16 years 4 months ago
Using inaccurate models in reinforcement learning
In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...
Pieter Abbeel, Morgan Quigley, Andrew Y. Ng
127
Voted
ECML
2003
Springer
15 years 8 months ago
Optimising Performance of Competing Search Engines in Heterogeneous Web Environments
Abstract. Distributed heterogeneous search environments are an emerging phenomenon in Web search, in which topic-specific search engines provide search services, and metasearchers...
Rinat Khoussainov, Nicholas Kushmerick