Search Sciweavers | Sciweavers

3049 search results - page 20 / 610

» On the Convergence of Bound Optimization Algorithms

150

click to vote

ICML
2010
IEEE

219views Machine Learning» more ICML 2010»

Convergence, Targeted Optimality, and Safety in Multiagent Learning

15 years 7 months ago

Download www.cs.utexas.edu

This paper introduces a novel multiagent learning algorithm, Convergence with Model Learning and Safety (or CMLeS in short), which achieves convergence, targeted optimality agains...

Doran Chakraborty, Peter Stone

claim paper

Read More »

190

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

15 years 27 days ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

170

click to vote

COLT
1997
Springer

113views Machine Learning» more COLT 1997»

General Convergence Results for Linear Discriminant Updates

15 years 10 months ago

Download www.cs.iastate.edu

The problem of learning linear discriminant concepts can be solved by various mistake-driven update procedures, including the Winnow family of algorithms and the well-known Percep...

Adam J. Grove, Nick Littlestone, Dale Schuurmans

claim paper

Read More »

199

click to vote

ATAL
2008
Springer

147views Intelligent Agents» more ATAL 2008»

On k-optimal distributed constraint optimization algorithms: new bounds and algorithms

15 years 8 months ago

Download teamcore.usc.edu

Distributed constraint optimization (DCOP) is a promising approach to coordination, scheduling and task allocation in multi agent networks. In large-scale or low-bandwidth network...

Emma Bowring, Jonathan P. Pearce, Christopher Port...

claim paper

Read More »

174

Voted

INFOCOM
1995
IEEE

122views Communications» more INFOCOM 1995»

Complexity of Gradient Projection Method for Optimal Routing in Data Networks

15 years 9 months ago

Download www.cs.ou.edu

—In this paper, we derive a time-complexity bound for the gradient projection method for optimal routing in data networks. This result shows that the gradient projection algorith...

Wei Kang Tsai, John K. Antonio, Garng M. Huang

claim paper

Read More »

« Prev « First page 20 / 610 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers