Search Sciweavers | Sciweavers

1234 search results - page 164 / 247

» Multi-criteria Reinforcement Learning

137

click to vote

AAAI
2010

191views Intelligent Agents» more AAAI 2010»

Relative Entropy Policy Search

15 years 5 months ago

Download www.kyb.tuebingen.mpg.de

Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...

Jan Peters, Katharina Mülling, Yasemin Altun

claim paper

Read More »

158

click to vote

ICAC
2009
IEEE

226views Applied Computing» more ICAC 2009»

Using distributed w-learning for multi-policy optimization in decentralized autonomic systems

15 years 1 months ago

Download www.scss.tcd.ie

Distributed W-Learning (DWL) is a reinforcement learningbased algorithm for multi-policy optimization in agent-based systems. In this poster we propose the use of DWL for decentra...

Ivana Dusparic, Vinny Cahill

claim paper

Read More »

129

click to vote

SIGIR
2003
ACM

116views Information Technology» more SIGIR 2003»

ReCoM: reinforcement clustering of multi-type interrelated data objects

15 years 9 months ago

Download research.microsoft.com

Most existing clustering algorithms cluster highly related data objects such as Web pages and Web users separately. The interrelation among different types of data objects is eith...

Jidong Wang, Hua-Jun Zeng, Zheng Chen, Hongjun Lu,...

claim paper

Read More »

142

click to vote

GECCO
2009
Springer

150views Optimization» more GECCO 2009»

Discrete dynamical genetic programming in XCS

15 years 10 months ago

Download www.cems.uwe.ac.uk

A number of representation schemes have been presented for use within Learning Classifier Systems, ranging from binary encodings to neural networks. This paper presents results fr...

Richard Preen, Larry Bull

claim paper

Read More »

134

click to vote

IROS
2006
IEEE

187views Robotics» more IROS 2006»

Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic

15 years 10 months ago

Download hawaii.aist-nara.ac.jp

— Recently, many researchers on humanoid robotics are interested in Quasi-Passive-Dynamic Walking (Quasi-PDW) which is similar to human walking. It is desirable that control para...

Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, To...

claim paper

Read More »

« Prev « First page 164 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers