Search Sciweavers | Sciweavers

45 search results - page 7 / 9

» Expected Convergence Properties of BGP

177

WSC
2001

120views Modeling And Simulation» more WSC 2001»

On improving the performance of simulation-based algorithms for average reward processes with application to network pricing

15 years 7 months ago

Download home.gwu.edu

We address performance issues associated with simulationbased algorithms for optimizing Markov reward processes. Specifically, we are concerned with algorithms that exploit the re...

Enrique Campos-Náñez, Stephen D. Pat...

claim paper

Read More »

150

click to vote

AAAI
1998

175views Intelligent Agents» more AAAI 1998»

Bayesian Q-Learning

15 years 7 months ago

Download www.aaai.org

A central problem in learning in complex environmentsis balancing exploration of untested actions against exploitation of actions that are known to be good. The benefit of explora...

Richard Dearden, Nir Friedman, Stuart J. Russell

claim paper

Read More »

179

click to vote

CORR
2010
Springer

136views Education» more CORR 2010»

Comparing Prediction Market Structures, With an Application to Market Making

15 years 2 months ago

Download www.cs.rpi.edu

Ensuring sufficient liquidity is one of the key challenges for designers of prediction markets. Various market making algorithms have been proposed in the literature and deployed ...

Aseem Brahma, Sanmay Das, Malik Magdon-Ismail

claim paper

Read More »

152

click to vote

TOMACS
2010

79views more TOMACS 2010»

A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

15 years 12 days ago

Download legacy.orie.cornell.edu

In this paper, we develop a stochastic approximation method to solve a monotone estimation problem and use this method to enhance the empirical performance of the Q-learning algor...

Sumit Kunnumkal, Huseyin Topaloglu

claim paper

Read More »

157

click to vote

ICML
2004
IEEE

118views Machine Learning» more ICML 2004»

Leveraging the margin more carefully

16 years 6 months ago

Download reference.kfupm.edu.sa

Boosting is a popular approach for building accurate classifiers. Despite the initial popular belief, boosting algorithms do exhibit overfitting and are sensitive to label noise. ...

Nir Krause, Yoram Singer

claim paper

Read More »

« Prev « First page 7 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers