Sciweavers

355 search results - page 31 / 71
» Online Learning and Exploiting Relational Models in Reinforc...
Sort
View
124
Voted
ATAL
2005
Springer
15 years 9 months ago
Rapid on-line temporal sequence prediction by an adaptive agent
Robust sequence prediction is an essential component of an intelligent agent acting in a dynamic world. We consider the case of near-future event prediction by an online learning ...
Steven Jensen, Daniel Boley, Maria L. Gini, Paul R...
147
Voted
ROMAN
2007
IEEE
179views Robotics» more  ROMAN 2007»
15 years 10 months ago
Online Affect Detection and Adaptation in Robot Assisted Rehabilitation for Children with Autism
–This paper presents a novel affect-sensitive human-robot interaction framework for rehabilitation of children with autism spectrum disorder (ASD) where the robot can detect the ...
Changchun Liu, Karla Conn, Nilanjan Sarkar, Wendy ...
143
Voted
SIGIR
2010
ACM
15 years 7 months ago
How good is a span of terms?: exploiting proximity to improve web retrieval
Ranking search results is a fundamental problem in information retrieval. In this paper we explore whether the use of proximity and phrase information can improve web retrieval ac...
Krysta Marie Svore, Pallika H. Kanani, Nazan Khan
153
Voted
NIPS
2001
15 years 5 months ago
Model-Free Least-Squares Policy Iteration
We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...
Michail G. Lagoudakis, Ronald Parr
120
Voted
GAMEON
2007
15 years 5 months ago
Agent Based Virtual Tutorship and E-Learning Techniques Applied to a Business Game Built on System Dynamics
An advanced Business Game is presented in the paper, built on the methodology of System Dynamics. It can be used for cognitive learning and knowledge transmission in schools and U...
Marco Remondino