Search Sciweavers | Sciweavers

2108 search results - page 151 / 422

» Tracking in Reinforcement Learning

111

Voted

CVPR
2010
IEEE

292views Computer Vision» more CVPR 2010»

An Online Approach: Learning-Semantic-Scene-by-Tracking and Tracking-by-Learning-Semantic-Scene

15 years 7 months ago

Download www.cis.pku.edu.cn

Learning the knowledge of scene structure and tracking a large number of targets are both active topics of computer vision in recent years, which plays a crucial role in surveilla...

Xuan Song, Xiaowei Shao, Huijing Zhao, Jinshi Cui,...

claim paper

Read More »

136

click to vote

HT
2009
ACM

146views Internet Technology» more HT 2009»

Improving recommender systems with adaptive conversational strategies

15 years 9 months ago

Download www.inf.unibz.it

Conversational recommender systems (CRSs) assist online users in their information-seeking and decision making tasks by supporting an interactive process. Although these processes...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

Voted

ATAL
2007
Springer

108views Intelligent Agents» more ATAL 2007»

Dynamic task allocation within an open service-oriented MAS architecture

15 years 8 months ago

Download www.isys.ucl.ac.be

A MAS architecture consisting of service centers is proposed. Within each service center, a mediator coordinates service delivery by allocating individual tasks to corresponding t...

Ivan Jureta, Stéphane Faulkner, Youssef Ach...

claim paper

Read More »

131

Voted

GECCO
2010
Springer

153views Optimization» more GECCO 2010»

Multi-task evolutionary shaping without pre-specified representations

15 years 6 months ago

Download www.science.uva.nl

Shaping functions can be used in multi-task reinforcement learning (RL) to incorporate knowledge from previously experienced tasks to speed up learning on a new task. So far, rese...

Matthijs Snel, Shimon Whiteson

claim paper

Read More »

126

Voted

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

15 years 4 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

« Prev « First page 151 / 422 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers