Search Sciweavers | Sciweavers

248 search results - page 10 / 50

» Rate of Convergence for Constrained Stochastic Approximation...

202

click to vote

ATAL
2008
Springer

92views Intelligent Agents» more ATAL 2008»

Stochastic search methods for nash equilibrium approximation in simulation-based games

15 years 9 months ago

Download www.seas.upenn.edu

We define the class of games called simulation-based games, in which the payoffs are available as an output of an oracle (simulator), rather than specified analytically or using a...

Yevgeniy Vorobeychik, Michael P. Wellman

claim paper

Read More »

196

click to vote

NIPS
1993

103views Information Technology» more NIPS 1993»

Optimal Stochastic Search and Adaptive Momentum

15 years 8 months ago

Download www.bme.ogi.edu

Stochastic optimization algorithms typically use learning rate schedules that behave asymptotically as (t) = 0=t. The ensemble dynamics (Leen and Moody, 1993) for such algorithms ...

Todd K. Leen, Genevieve B. Orr

claim paper

Read More »

217

click to vote

DCOSS
2006
Springer

179views Distributed And Parallel Com...» more DCOSS 2006»

Approximation Algorithms for Power-Aware Scheduling of Wireless Sensor Networks with Rate and Duty-Cycle Constraints

15 years 10 months ago

Download www.ece.lsu.edu

We develop algorithms for finding the minimum energy transmission schedule for duty-cycle and rate constrained wireless sensor nodes transmitting over an interference channel. Sinc...

Rajgopal Kannan, Shuangqing Wei

claim paper

Read More »

185

click to vote

ANOR
2002

100views more ANOR 2002»

A Limited-Memory Multipoint Symmetric Secant Method for Bound Constrained Optimization

15 years 7 months ago

Download www.famaf.unc.edu.ar

A new algorithm for solving smooth large-scale minimization problems with bound constraints is introduced. The way of dealing with active constraints is similar to the one used in...

Oleg P. Burdakov, José Mario Martíne...

claim paper

Read More »

156

click to vote

ICML
2003
IEEE

146views Machine Learning» more ICML 2003»

TD(0) Converges Provably Faster than the Residual Gradient Algorithm

16 years 7 months ago

Download www.hpl.hp.com

In Reinforcement Learning (RL) there has been some experimental evidence that the residual gradient algorithm converges slower than the TD(0) algorithm. In this paper, we use the ...

Ralf Schoknecht, Artur Merke

claim paper

Read More »

« Prev « First page 10 / 50 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers