Search Sciweavers | Sciweavers

29

ICML
1994
IEEE

151views Machine Learning» more ICML 1994»

Learning Without State-Estimation in Partially Observable Markovian Decision Processes

13 years 11 months ago

Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

22

click to vote

ICONIP
2008

74views Information Technology» more ICONIP 2008»

On Similarity Measures for Spike Trains

13 years 9 months ago

Download www.dauwels.com

A variety of (dis)similarity measures for one-dimensional point processes (e.g., spike trains) are investigated, including the Victor-Purpura distance metric, the van Rossum distan...

Justin Dauwels, François B. Vialatte, Theop...

claim paper

Read More »

30

click to vote

STAIRS
2008

169views Education» more STAIRS 2008»

Probabilistic Association Rules for Item-Based Recommender Systems

13 years 9 months ago

Download hal.inria.fr

Since the beginning of the 1990's, the Internet has constantly grown, proposing more and more services and sources of information. The challenge is no longer to provide users ...

Sylvain Castagnos, Armelle Brun, Anne Boyer

claim paper

Read More »

16

click to vote

IJCAI
2003

118views Artificial Intelligence» more IJCAI 2003»

Simultaneous Adversarial Multi-Robot Learning

13 years 9 months ago

Download www.cs.cmu.edu

Multi-robot learning faces all of the challenges of robot learning with all of the challenges of multiagent learning. There has been a great deal of recent research on multiagent ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

26

click to vote

AAAI
1998

129views Intelligent Agents» more AAAI 1998»

Solving Very Large Weakly Coupled Markov Decision Processes

13 years 9 months ago

Download www.cs.toronto.edu

We present a technique for computing approximately optimal solutions to stochastic resource allocation problems modeled as Markov decision processes (MDPs). We exploit two key pro...

Nicolas Meuleau, Milos Hauskrecht, Kee-Eung Kim, L...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers