Sciweavers

ICML
2003
IEEE
15 years 15 days ago
Exploration in Metric State Spaces
We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...
Sham Kakade, Michael J. Kearns, John Langford
ICML
2003
IEEE
15 years 15 days ago
Evolving Strategies for Focused Web Crawling
Judy Johnson, Kostas Tsioutsiouliklis, C. Lee Gile...
ICML
2003
IEEE
15 years 15 days ago
Transductive Learning via Spectral Graph Partitioning
We present a new method for transductive learning, which can be seen as a transductive version of the k nearest-neighbor classifier. Unlike for many other transductive learning me...
Thorsten Joachims
ICML
2003
IEEE
15 years 15 days ago
A Faster Iterative Scaling Algorithm for Conditional Exponential Model
Rong Jin, Rong Yan, Jian Zhang, Alexander G. Haupt...
ICML
2003
IEEE
15 years 15 days ago
Avoiding Bias when Aggregating Relational Data with Degree Disparity
David Jensen, Jennifer Neville, Michael Hay
ICML
2003
IEEE
15 years 15 days ago
Probabilistic Classifiers and the Concepts They Recognize
We investigate algebraic, logical, and geometric properties of concepts recognized by various classes of probabilistic classifiers. For this we introduce a natural hierarchy of pr...
Manfred Jaeger
ICML
2003
IEEE
15 years 15 days ago
Goal-directed Learning to Fly
Learning to fly an aircraft is a complex task that requires the development of control skills and goal achievement strategies. This paper presents a behavioural cloning system tha...
Andrew Isaac, Claude Sammut
ICML
2003
IEEE
15 years 15 days ago
Online Ranking/Collaborative Filtering Using the Perceptron Algorithm
In this paper we present a simple to implement truly online large margin version of the Perceptron ranking (PRank) algorithm, called the OAP-BPM (Online Aggregate Prank-Bayes Poin...
Edward F. Harrington
ICML
2003
IEEE
15 years 15 days ago
Correlated Q-Learning
Amy R. Greenwald, Keith Hall
ICML
2003
IEEE
15 years 15 days ago
Hierarchical Policy Gradient Algorithms
Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...
Mohammad Ghavamzadeh, Sridhar Mahadevan