Search Sciweavers | Sciweavers

3381 search results - page 331 / 677

» LEO - DB2's LEarning Optimizer

159

click to vote

NIPS
2004

112views Information Technology» more NIPS 2004»

Learning first-order Markov models for control

15 years 7 months ago

Download books.nips.cc

First-order Markov models have been successfully applied to many problems, for example in modeling sequential data using Markov chains, and modeling control problems using the Mar...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

143

click to vote

NIPS
2004

138views Information Technology» more NIPS 2004»

New Criteria and a New Algorithm for Learning in Multi-Agent Systems

15 years 7 months ago

Download books.nips.cc

We propose a new set of criteria for learning algorithms in multi-agent systems, one that is more stringent and (we argue) better justified than previous proposed criteria. Our cr...

Rob Powers, Yoav Shoham

claim paper

Read More »

145

click to vote

UAI
2001

117views Artificial Intelligence» more UAI 2001»

Aggregating Learned Probabilistic Beliefs

15 years 7 months ago

Download ai.stanford.edu

We consider the task of aggregating beliefs of several experts. We assume that these beliefs are represented as probability distributions. We argue that the evaluation of any aggr...

Pedrito Maynard-Reid II, Urszula Chajewska

claim paper

Read More »

187

click to vote

IJCAI
2003

193views Artificial Intelligence» more IJCAI 2003»

When Discriminative Learning of Bayesian Network Parameters Is Easy

15 years 7 months ago

Download dli.iiit.ac.in

Bayesian network models are widely used for discriminative prediction tasks such as classification. Usually their parameters are determined using 'unsupervised' methods ...

Hannes Wettig, Peter Grünwald, Teemu Roos, Pe...

claim paper

Read More »

154

click to vote

ICML
2010
IEEE

189views Machine Learning» more ICML 2010»

Nonparametric Return Distribution Approximation for Reinforcement Learning

15 years 7 months ago

Download www.icml2010.org

Standard Reinforcement Learning (RL) aims to optimize decision-making rules in terms of the expected return. However, especially for risk-management purposes, other criteria such ...

Tetsuro Morimura, Masashi Sugiyama, Hisashi Kashim...

claim paper

Read More »

« Prev « First page 331 / 677 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers