Search Sciweavers | Sciweavers

171 search results - page 32 / 35

» Principled Methods for Advising Reinforcement Learning Agent...

161

click to vote

AAAI
2008

141views Intelligent Agents» more AAAI 2008»

Economic Hierarchical Q-Learning

15 years 9 months ago

Download www.aaai.org

Hierarchical state decompositions address the curse-ofdimensionality in Q-learning methods for reinforcement learning (RL) but can suffer from suboptimality. In addressing this, w...

Erik G. Schultink, Ruggiero Cavallo, David C. Park...

claim paper

Read More »

191

click to vote

KI
2002
Springer

108views Artificial Intelligence» more KI 2002»

Qualitative Velocity and Ball Interception

15 years 7 months ago

Download fstolzenburg.hs-harz.de

In many approaches for qualitative spatial reasoning, navigation of an agent in a more or less static environment is considered (e.g. in the double-cross calculus [12]). However, i...

Frieder Stolzenburg, Oliver Obst, Jan Murray

claim paper

Read More »

172

click to vote

IROS
2007
IEEE

164views Robotics» more IROS 2007»

Emulation and behavior understanding through shared values

16 years 1 months ago

Download www.er.ams.eng.osaka-u.ac.jp

— Neurophysiology has revealed the existence of mirror neurons in brain of macaque monkeys and they shows similar activities during executing an observation of goal directed move...

Yasutake Takahashi, Teruyasu Kawamata, Minoru Asad...

claim paper

Read More »

189

click to vote

AAAI
2012

221views Intelligent Agents» more AAAI 2012»

Double-Bit Quantization for Hashing

13 years 9 months ago

Download www.cs.sjtu.edu.cn

Hashing, which tries to learn similarity-preserving binary codes for data representation, has been widely used for efﬁcient nearest neighbor search in massive databases due to i...

Weihao Kong, Wu-Jun Li

claim paper

Read More »

198

click to vote

ATAL
2008
Springer

146views Intelligent Agents» more ATAL 2008»

Adaptive Kanerva-based function approximation for multi-agent systems

15 years 9 months ago

Download www.aamas-conference.org

In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instanc...

Cheng Wu, Waleed Meleis

claim paper

Read More »

« Prev « First page 32 / 35 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers