Search Sciweavers | Sciweavers

548 search results - page 88 / 110

» Optimization of Convex Risk Functions

142

click to vote

ICML
2009
IEEE

186views Machine Learning» more ICML 2009»

Regularization and feature selection in least-squares temporal difference learning

16 years 4 months ago

Download ai.stanford.edu

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

135

click to vote

ICALP
2009
Springer

105views Programming Languages» more ICALP 2009»

Proportional Response Dynamics in the Fisher Market

16 years 4 months ago

Download research.microsoft.com

Abstract. In this paper, we show that the proportional response dynamics, a utility based distributed dynamics, converges to the market equilibrium in the Fisher market with consta...

Li Zhang

claim paper

Read More »

108

click to vote

GLOBECOM
2008
IEEE

94views Communications» more GLOBECOM 2008»

Outage-Based Rate Maximization in CDMA Wireless Networks

15 years 10 months ago

Download www.ee.kth.se

—The problem of maximizing the sum of the transmit rates while limiting the outage probability below an appropriate threshold is investigated for networks where the nodes have li...

M. D'Angelo, Carlo Fischione, Matteo Butussi, Ales...

claim paper

Read More »

118

click to vote

IOR
2008

109views more IOR 2008»

Polynomial-Time Algorithms for Stochastic Uncapacitated Lot-Sizing Problems

15 years 4 months ago

Download www.ise.ufl.edu

In 1958, Wagner and Whitin published a seminal paper on the deterministic uncapacitated lot-sizing problem, a fundamental model that is embedded in many practical production plann...

Yongpei Guan, Andrew J. Miller

claim paper

Read More »

121

click to vote

CORR
2010
Springer

163views Education» more CORR 2010»

Faster Rates for training Max-Margin Markov Networks

15 years 2 months ago

Download people.cs.uchicago.edu

Structured output prediction is an important machine learning problem both in theory and practice, and the max-margin Markov network (M3 N) is an effective approach. All state-of-...

Xinhua Zhang, Ankan Saha, S. V. N. Vishwanathan

claim paper

Read More »

« Prev « First page 88 / 110 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers