Search Sciweavers | Sciweavers

115 search results - page 10 / 23

» Learning hierarchical task networks by observation

167

click to vote

ROBOCUP
2007
Springer

167views Robotics» more ROBOCUP 2007»

Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others

15 years 12 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning approaches have been suﬀering from the curse of dimension problem when they are applied to multiagent dynamic environments. One of the typical...

Kentarou Noma, Yasutake Takahashi, Minoru Asada

claim paper

Read More »

128

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

15 years 12 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

177

click to vote

AR
2008

118views more AR 2008»

Efficient Behavior Learning Based on State Value Estimation of Self and Others

15 years 5 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning methods have been seriously suffering from the curse of dimension problem especially when they are applied to multiagent dynamic environments. ...

Yasutake Takahashi, Kentarou Noma, Minoru Asada

claim paper

Read More »

148

click to vote

ICDM
2003
IEEE

119views Data Mining» more ICDM 2003»

A Dynamic Adaptive Self-Organising Hybrid Model for Text Clustering

15 years 11 months ago

Download www.informatik.uni-hamburg.de

Clustering by document concepts is a powerful way of retrieving information from a large number of documents. This task in general does not make any assumption on the data distrib...

Chihli Hung, Stefan Wermter

claim paper

Read More »

201

click to vote

AVSS
2009
IEEE

248views Signal Processing» more AVSS 2009»

Bayesian Bio-inspired Model for Learning Interactive Trajectories

15 years 9 months ago

Download www.isip40.it

—Automatic understanding of human behavior is an important and challenging objective in several surveillance applications. One of the main problems of this task consists in accur...

Alessio Dore, Carlo S. Regazzoni

claim paper

Read More »

« Prev « First page 10 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers