Search Sciweavers | Sciweavers

55 search results - page 9 / 11

» Policy Tree: Adaptive Representation for Policy Gradient

162

click to vote

AAAI
2010

185views Intelligent Agents» more AAAI 2010»

Symbolic Dynamic Programming for First-order POMDPs

15 years 7 months ago

Download www-kd.iai.uni-bonn.de

Partially-observable Markov decision processes (POMDPs) provide a powerful model for sequential decision-making problems with partially-observed state and are known to have (appro...

Scott Sanner, Kristian Kersting

claim paper

Read More »

159

click to vote

FAST
2010

139views Operating System» more FAST 2010»

quFiles: The Right File at the Right Time

15 years 8 months ago

Download www.usenix.org

is a unifying abstraction that simplifies data management by encapsulating different physical representations of the same logical data. Similar to a quBit (quantum bit), the parti...

Kaushik Veeraraghavan, Jason Flinn, Edmund B. Nigh...

claim paper

Read More »

154

click to vote

AI
2007
Springer

183views Artificial Intelligence» more AI 2007»

Competition and Coordination in Stochastic Games

16 years 11 days ago

Download www.damas.ift.ulaval.ca

Agent competition and coordination are two classical and most important tasks in multiagent systems. In recent years, there was a number of learning algorithms proposed to resolve ...

Andriy Burkov, Abdeslam Boularias, Brahim Chaib-dr...

claim paper

Read More »

161

click to vote

IPTPS
2003
Springer

130views Computer Networks» more IPTPS 2003»

Adaptive Peer Selection

15 years 11 months ago

Download iptps03.cs.berkeley.edu

In a peer-to-peer ﬁle-sharing system, a client desiring a particular ﬁle must choose a source from which to download. The problem of selecting a good data source is difﬁcult...

Daniel S. Bernstein, Zhengzhu Feng, Brian Neil Lev...

claim paper

Read More »

169

click to vote

ATAL
2010
Springer

129views Intelligent Agents» more ATAL 2010»

Learning multi-agent state space representations

15 years 7 months ago

Download como.vub.ac.be

This paper describes an algorithm, called CQ-learning, which learns to adapt the state representation for multi-agent systems in order to coordinate with other agents. We propose ...

Yann-Michaël De Hauwere, Peter Vrancx, Ann No...

claim paper

Read More »

« Prev « First page 9 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers