Search Sciweavers | Sciweavers

23

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Multi-Agent Learning with Policy Prediction

13 years 9 months ago

Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting th...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

23

click to vote

FLAIRS
2004

119views Artificial Intelligence» more FLAIRS 2004»

Recurrent Neural Networks and Pitch Representations for Music Tasks

13 years 9 months ago

Download maven.smith.edu

We present results from experiments in using several pitch representations for jazz-oriented musical tasks performed by a recurrent neural network. We have run experiments with se...

Judy A. Franklin

claim paper

Read More »

25

click to vote

NN
1998
Springer

108views Neural Networks» more NN 1998»

How embedded memory in recurrent neural network architectures helps learning long-term temporal dependencies

13 years 7 months ago

Download clgiles.ist.psu.edu

Learning long-term temporal dependencies with recurrent neural networks can be a difﬁcult problem. It has recently been shown that a class of recurrent neural networks called NA...

Tsungnan Lin, Bill G. Horne, C. Lee Giles

claim paper

Read More »

21

click to vote

ICML
2009
IEEE

131views Machine Learning» more ICML 2009»

Monte-Carlo simulation balancing

14 years 8 months ago

Download www.cs.ualberta.ca

In this paper we introduce the first algorithms for efficiently learning a simulation policy for Monte-Carlo search. Our main idea is to optimise the balance of a simulation polic...

David Silver, Gerald Tesauro

claim paper

Read More »

24

click to vote

ORL
2008

68views more ORL 2008»

On polynomial cases of the unichain classification problem for Markov Decision Processes

13 years 7 months ago

Download www.ams.sunysb.edu

The unichain classification problem detects whether a finite state and action MDP is unichain under all deterministic policies. This problem is NP-hard [11]. This paper provides p...

Eugene A. Feinberg, Fenghsu Yang

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers