Search Sciweavers | Sciweavers

1166 search results - page 173 / 234

» Negotiating Using Rewards

182

click to vote

SIGMETRICS
2002
ACM

171views Hardware» more SIGMETRICS 2002»

Characterizing the d-TLB behavior of SPEC CPU2000 benchmarks

15 years 6 months ago

Download www.cse.psu.edu

Despite the numerous optimization and evaluation studies that have been conducted with TLBs over the years, there is still a deficiency in an indepth understanding of TLB characte...

Gokul B. Kandiraju, Anand Sivasubramaniam

claim paper

Read More »

224

click to vote

ML
2007
ACM

104views Machine Learning» more ML 2007»

A general criterion and an algorithmic framework for learning in multi-agent systems

15 years 6 months ago

Download ai.stanford.edu

We oﬀer a new formal criterion for agent-centric learning in multi-agent systems, that is, learning that maximizes one’s rewards in the presence of other agents who might also...

Rob Powers, Yoav Shoham, Thuc Vu

claim paper

Read More »

196

Voted

SAB
2010
Springer

226views Optimization» more SAB 2010»

Distributed Online Learning of Central Pattern Generators in Modular Robots

15 years 5 months ago

Download modular.mmmi.sdu.dk

Abstract. In this paper we study distributed online learning of locomotion gaits for modular robots. The learning is based on a stochastic approximation method, SPSA, which optimiz...

David Johan Christensen, Alexander Spröwitz, ...

claim paper

Read More »

205

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 5 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

213

click to vote

GLOBECOM
2010
IEEE

189views Communications» more GLOBECOM 2010»

Need-Based Communication for Smart Grid: When to Inquire Power Price?

15 years 5 months ago

Download iweb.tntech.edu

In smart grid, a home appliance can adjust its power consumption level according to the realtime power price obtained from communication channels. Most studies on smart grid do not...

Husheng Li, Robert C. Qiu

claim paper

Read More »

« Prev « First page 173 / 234 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers