Search Sciweavers | Sciweavers

82 search results - page 10 / 17

» Learning Selective Averaged One-Dependence Estimators for Pr...

180

Voted

NIPS
2007

146views Information Technology» more NIPS 2007»

Optimistic Linear Programming gives Logarithmic Regret for Irreducible MDPs

15 years 8 months ago

Download books.nips.cc

We present an algorithm called Optimistic Linear Programming (OLP) for learning to optimize average reward in an irreducible but otherwise unknown Markov decision process (MDP). O...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

174

click to vote

ICPR
2000
IEEE

156views computer vision» more ICPR 2000»

On Gaussian Radial Basis Function Approximations: Interpretation, Extensions, and Learning Strategies

16 years 8 months ago

Download www.lx.it.pt

In this paper we focus on an interpretation of Gaussian radial basis functions (GRBF) which motivates extensions and learning strategies. Specifically, we show that GRBF regressio...

Mário A. T. Figueiredo

claim paper

Read More »

209

click to vote

AROBOTS
1999

104views more AROBOTS 1999»

Reinforcement Learning Soccer Teams with Incomplete World Models

15 years 7 months ago

Download igitur-archive.library.uu.nl

We use reinforcement learning (RL) to compute strategies for multiagent soccer teams. RL may pro t signi cantly from world models (WMs) estimating state transition probabilities an...

Marco Wiering, Rafal Salustowicz, Jürgen Schm...

claim paper

Read More »

202

click to vote

NIPS
2004

149views Information Technology» more NIPS 2004»

Co-Validation: Using Model Disagreement on Unlabeled Data to Validate Classification Algorithms

15 years 8 months ago

Download books.nips.cc

In the context of binary classification, we define disagreement as a measure of how often two independently-trained models differ in their classification of unlabeled data. We exp...

Omid Madani, David M. Pennock, Gary William Flake

claim paper

Read More »

265

click to vote

ICTAI
2010
IEEE

265views Artificial Intelligence» more ICTAI 2010»

Unsupervised Greedy Learning of Finite Mixture Models

15 years 4 months ago

Download nicolagreggio.altervista.org

This work deals with a new technique for the estimation of the parameters and number of components in a finite mixture model. The learning procedure is performed by means of a expe...

Nicola Greggio, Alexandre Bernardino, Cecilia Lasc...

claim paper

Read More »

« Prev « First page 10 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers