133
click to vote
ICML
15 years 5 months ago
1994 IEEE
Conservation of information (COI) popularized by the no free lunch theorem is a great leveler of search algorithms, showing that on average no search outperforms any other. Yet in ...
128
click to vote
ICML
15 years 5 months ago
1994 IEEE
We explore algorithms for learning classification procedures that attempt to minimize the cost of misclassifying examples. First, we consider inductive learning of classification ...
127
click to vote
ICML
15 years 5 months ago
1994 IEEE
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...
125
click to vote
ICML
15 years 5 months ago
1994 IEEE
Model selection is important in many areas of supervised learning. Given a dataset and a set of models for predicting with that dataset, we must choose the model which is expected...
123
click to vote
ICML
15 years 5 months ago
1994 IEEE
In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function....
|