Sciweavers

1235 search results - page 140 / 247
» Reinforcement learning in a nutshell
Sort
View
COST
2009
Springer
185views Multimedia» more  COST 2009»
13 years 8 months ago
How an Agent Can Detect and Use Synchrony Parameter of Its Own Interaction with a Human?
Synchrony is claimed by psychology as a crucial parameter of any social interaction: to give to human a feeling of natural interaction, a feeling of agency [17], an agent must be a...
Ken Prepin, Philippe Gaussier
AUTOMATICA
2008
198views more  AUTOMATICA 2008»
13 years 8 months ago
Asynchronous cellular learning automata
Cellular learning automata is a combination of cellular automata and learning automata. The synchronous version of cellular learning automata in which all learning automata in dif...
Hamid Beigy, Mohammad Reza Meybodi
ICML
2009
IEEE
14 years 11 months ago
Binary action search for learning continuous-action control policies
Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces are quite common in real-wor...
Jason Pazis, Michail G. Lagoudakis
CIKM
2000
Springer
14 years 2 months ago
Relevance and Reinforcement in Interactive Browsing
We consider the problem of browsing the top ranked portion of the documents returned by an information retrieval system. We describe an interactive relevance feedback agent that a...
Anton Leuski
ICML
2003
IEEE
14 years 11 months ago
Hierarchical Policy Gradient Algorithms
Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...
Mohammad Ghavamzadeh, Sridhar Mahadevan