Search Sciweavers | Sciweavers

55 search results - page 5 / 11

» Policy Tree: Adaptive Representation for Policy Gradient

286

click to vote

TON
2010

151views more TON 2010»

Throughput Optimal Distributed Power Control of Stochastic Wireless Networks

15 years 27 days ago

Download pantheon.yale.edu

The Maximum Differential Backlog (MDB) control policy of Tassiulas and Ephremides has been shown to adaptively maximize the stable throughput of multihop wireless networks with ran...

Yufang Xi, Edmund M. Yeh

claim paper

Read More »

186

click to vote

NIPS
2003

207views Information Technology» more NIPS 2003»

Extending Q-Learning to General Adaptive Multi-Agent Systems

15 years 7 months ago

Download books.nips.cc

Recent multi-agent extensions of Q-Learning require knowledge of other agents’ payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This pap...

Gerald Tesauro

claim paper

Read More »

146

click to vote

IWMMDBMS
1998

123views more IWMMDBMS 1998»

An Adaptive Block Management Scheme Using On-Line Detection of Block Reference Patterns

15 years 7 months ago

Download embedded.dankook.ac.kr

Recent research has shown that near optimal performance can be achieved by adaptive block replacement policies that use user-level hints regarding the block reference pattern. How...

Jongmoo Choi, Sam H. Noh, Sang Lyul Min, Yookun Ch...

claim paper

Read More »

280

click to vote

ICDE
2004
IEEE

139views Database» more ICDE 2004»

Engineering a Fast Online Persistent Suffix Tree Construction

16 years 7 months ago

Download dsl.serc.iisc.ernet.in

Online persistent suffix tree construction has been considered impractical due to its excessive I/O costs. However, these prior studies have not taken into account the effects of ...

Srikanta J. Bedathur, Jayant R. Haritsa

claim paper

Read More »

143

click to vote

ATAL
2007
Springer

81views Intelligent Agents» more ATAL 2007»

Multiagent learning in adaptive dynamic systems

16 years 11 days ago

Download www.damas.ift.ulaval.ca

Classically, an approach to the multiagent policy learning supposed that the agents, via interactions and/or by using preliminary knowledge about the reward functions of all playe...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

« Prev « First page 5 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers