Machine Learning | Sciweavers

46

ICML
1995
IEEE

155views Machine Learning» more ICML 1995»

Stable Function Approximation in Dynamic Programming

15 years 14 days ago

The success ofreinforcement learninginpractical problems depends on the ability to combine function approximation with temporal di erence methods such as value iteration. Experime...

Geoffrey J. Gordon

claim paper

Read More »

40

click to vote

ICML
1995
IEEE

196views Machine Learning» more ICML 1995»

Ant-Q: A Reinforcement Learning Approach to the Traveling Salesman Problem

15 years 14 days ago

Download www.idsia.ch

In this paper we introduce Ant-Q, a family of algorithms which present many similarities with Q-learning (Watkins, 1989), and which we apply to the solution of symmetric and asymm...

Luca Maria Gambardella, Marco Dorigo

claim paper

Read More »

35

click to vote

ICML
1995
IEEE

127views Machine Learning» more ICML 1995»

Removing the Genetics from the Standard Genetic Algorithm

15 years 14 days ago

Download www.cse.unr.edu

Shumeet Baluja, Rich Caruana

claim paper

Read More »

50

click to vote

ICML
1995
IEEE

123views Machine Learning» more ICML 1995»

Visualizing High-Dimensional Structure with the Incremental Grid Growing Neural Network

15 years 14 days ago

Download www.cs.utexas.edu

Understanding high-dimensional real world data usually requires learning the structure of the data space. The structure maycontain high-dimensional clusters that are related in co...

Justine Blackmore, Risto Miikkulainen

claim paper

Read More »

46

click to vote

ICML
1995
IEEE

184views Machine Learning» more ICML 1995»

Residual Algorithms: Reinforcement Learning with Function Approximation

15 years 14 days ago

Download www.leemon.com

A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...

Leemon C. Baird III

claim paper

Read More »

41

click to vote

ICML
1996
IEEE

199views Machine Learning» more ICML 1996»

Applying the Multiple Cause Mixture Model to Text Categorization

15 years 14 days ago

Download robotics.stanford.edu

Mehran Sahami, Marti A. Hearst, Eric Saund

claim paper

Read More »

36

click to vote

ICML
1996
IEEE

118views Machine Learning» more ICML 1996»

Representing and Learning Quality-Improving Search Control Knowledge

15 years 14 days ago

Download reference.kfupm.edu.sa

Generating good, production-quality plans is an essential element in transforming planners from research tools into real-world applications, but one that has been frequently overl...

M. Alicia Pérez

claim paper

Read More »

39

click to vote

ICML
1996
IEEE

101views Machine Learning» more ICML 1996»

Unsupervised Learning Using MML

15 years 14 days ago

Download reference.kfupm.edu.sa

This paper discusses the unsupervised learning problem. An important part of the unsupervised learning problem is determining the numberofconstituent groups (componentsor classes)...

Jonathan J. Oliver, Rohan A. Baxter, Chris S. Wall...

claim paper

Read More »

31

click to vote

ICML
1996
IEEE

119views Machine Learning» more ICML 1996»

Searching for Structure in Multiple Streams of Data

15 years 14 days ago

Download www.cs.arizona.edu

Finding structure in multiple streams of data is an important problem. Consider the streams of data owing from a robot's sensors, the monitors in an intensive care unit, or p...

Tim Oates, Paul R. Cohen

claim paper

Read More »

40

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning

15 years 14 days ago

Download reference.kfupm.edu.sa

Research in reinforcementlearning (RL)has thus far concentrated on two optimality criteria: the discounted framework, which has been very well-studied, and the averagereward frame...

Sridhar Mahadevan

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers