Search Sciweavers | Sciweavers

1236 search results - page 167 / 248

» Opposition-Based Reinforcement Learning

159

click to vote

PRICAI
2000
Springer

127views Artificial Intelligence» more PRICAI 2000»

Constructing an Autonomous Agent with an Interdependent Heuristics

15 years 10 months ago

Download www.ai.sanken.osaka-u.ac.jp

When we construct an agent by integrating modules, there appear troubles concerning the autonomy of the agent if we introduce a heuristics that dominates the whole agent. Thus, we ...

Koichi Moriyama, Masayuki Numao

claim paper

Read More »

179

click to vote

AAAI
2010

191views Intelligent Agents» more AAAI 2010»

Relative Entropy Policy Search

15 years 8 months ago

Download www.kyb.tuebingen.mpg.de

Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...

Jan Peters, Katharina Mülling, Yasemin Altun

claim paper

Read More »

178

Voted

SIGIR
2003
ACM

116views Information Technology» more SIGIR 2003»

ReCoM: reinforcement clustering of multi-type interrelated data objects

16 years 15 hour ago

Download research.microsoft.com

Most existing clustering algorithms cluster highly related data objects such as Web pages and Web users separately. The interrelation among different types of data objects is eith...

Jidong Wang, Hua-Jun Zeng, Zheng Chen, Hongjun Lu,...

claim paper

Read More »

177

Voted

GECCO
2009
Springer

150views Optimization» more GECCO 2009»

Discrete dynamical genetic programming in XCS

16 years 1 months ago

Download www.cems.uwe.ac.uk

A number of representation schemes have been presented for use within Learning Classifier Systems, ranging from binary encodings to neural networks. This paper presents results fr...

Richard Preen, Larry Bull

claim paper

Read More »

179

Voted

IROS
2006
IEEE

187views Robotics» more IROS 2006»

Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic

16 years 24 days ago

Download hawaii.aist-nara.ac.jp

— Recently, many researchers on humanoid robotics are interested in Quasi-Passive-Dynamic Walking (Quasi-PDW) which is similar to human walking. It is desirable that control para...

Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, To...

claim paper

Read More »

« Prev « First page 167 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers