Search Sciweavers | Sciweavers

1235 search results - page 215 / 247

» Reinforcement learning in a nutshell

148

Voted

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

16 years 13 days ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

133

Voted

WWW
2010
ACM

275views Internet Technology» more WWW 2010»

iRIN: image retrieval in image-rich information networks

15 years 10 months ago

Download www.cs.uiuc.edu

In this demo, we present a system called iRIN designed for performing image retrieval in image-rich information networks. We ﬁrst introduce MoK-SimRank to signiﬁcantly improve...

Xin Jin, Jiebo Luo, Jie Yu, Gang Wang, Dhiraj Josh...

claim paper

Read More »

138

Voted

IROS
2007
IEEE

168views Robotics» more IROS 2007»

Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression

15 years 9 months ago

Download www.cs.cmu.edu

Abstract— We propose to improve the locomotive performance of humanoid robots by using approximated biped stepping and walking dynamics with reinforcement learning (RL). Although...

Jun Morimoto, Christopher G. Atkeson, Gen Endo, Go...

claim paper

Read More »

125

click to vote

ATAL
2005
Springer

117views Intelligent Agents» more ATAL 2005»

Modeling task allocation using a decision theoretic model

15 years 9 months ago

Download dis.cs.umass.edu

Mediation is the process of decomposing a task into subtasks, ﬁnding agents suitable for these subtasks and negotiating with agents to obtain commitments to execute these subtas...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

115

Voted

ISCC
2003
IEEE

110views Communications» more ISCC 2003»

Intelligent Agents Serving Based On The Society Information

15 years 8 months ago

Download www3.itu.edu.tr

In this paper, we propose a serving system consisting intelligent agents processing society information in a multi-user domain. The agents use the similarity information on the us...

Sanem Sariel, B. Tevfik Akgün

claim paper

Read More »

« Prev « First page 215 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers