Machine Learning | Sciweavers

157

ICML
2004
IEEE

123views Machine Learning» more ICML 2004»

Learning low dimensional predictive representations

16 years 7 months ago

Predictive state representations (PSRs) have recently been proposed as an alternative to partially observable Markov decision processes (POMDPs) for representing the state of a dy...

Matthew Rosencrantz, Geoffrey J. Gordon, Sebastian...

claim paper

Read More »

121

click to vote

ICML
2004
IEEE

146views Machine Learning» more ICML 2004»

Learning to cluster using local neighborhood structure

16 years 7 months ago

Download www.psi.toronto.edu

Brendan J. Frey, Kannan Achan, Rómer Rosale...

claim paper

Read More »

174

click to vote

ICML
2004
IEEE

214views Machine Learning» more ICML 2004»

Apprenticeship learning via inverse reinforcement learning

16 years 7 months ago

Download ai.stanford.edu

We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we wa...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

143

click to vote

ICML
2004
IEEE

159views Machine Learning» more ICML 2004»

A maximum entropy approach to species distribution modeling

16 years 7 months ago

Download www.cs.princeton.edu

We study the problem of modeling species geographic distributions, a critical problem in conservation biology. We propose the use of maximum-entropy techniques for this problem, s...

Miroslav Dudík, Robert E. Schapire, Steven ...

claim paper

Read More »

150

click to vote

ICML
2004
IEEE

155views Machine Learning» more ICML 2004»

Decentralized detection and classification using kernel methods

16 years 7 months ago

Download oz.berkeley.edu

We consider the problem of decentralized detection under constraints on the number of bits that can be transmitted by each sensor. In contrast to most previous work, in which the ...

XuanLong Nguyen, Martin J. Wainwright, Michael I. ...

claim paper

Read More »

146

click to vote

ICML
2004
IEEE

110views Machine Learning» more ICML 2004»

Learning first-order rules from data with multiple parts: applications on mining chemical compound data

16 years 7 months ago

Download www.ai.sanken.osaka-u.ac.jp

Inductive learning of first-order theory based on examples has serious bottleneck in the enormous hypothesis search space needed, making existing learning approaches perform poorl...

Cholwich Nattee, Sukree Sinthupinyo, Masayuki Numa...

claim paper

Read More »

156

click to vote

ICML
2004
IEEE

187views Machine Learning» more ICML 2004»

Automated hierarchical mixtures of probabilistic principal component analyzers

16 years 7 months ago

Download www.ece.neu.edu

Many clustering algorithms fail when dealing with high dimensional data. Principal component analysis (PCA) is a popular dimensionality reduction algorithm. However, it assumes a ...

Ting Su, Jennifer G. Dy

claim paper

Read More »

176

click to vote

ICML
2004
IEEE

156views Machine Learning» more ICML 2004»

Learning to fly by combining reinforcement learning with behavioural cloning

16 years 7 months ago

Download ccc.inaoep.mx

Reinforcement learning deals with learning optimal or near optimal policies while interacting with the environment. Application domains with many continuous variables are difficul...

Eduardo F. Morales, Claude Sammut

claim paper

Read More »

159

click to vote

ICML
2004
IEEE

205views Machine Learning» more ICML 2004»

Learning with non-positive kernels

16 years 7 months ago

Download eprints.pascal-network.org

In this paper we show that many kernel methods can be adapted to deal with indefinite kernels, that is, kernels which are not positive semidefinite. They do not satisfy Mercer...

Alexander J. Smola, Cheng Soon Ong, Stéphan...

claim paper

Read More »

165

click to vote

ICML
2004
IEEE

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

16 years 7 months ago

Download www.machinelearning.org

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merk...

Artur Merke, Ralf Schoknecht

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers