Search Sciweavers | Sciweavers

1235 search results - page 140 / 247

» Reinforcement learning in a nutshell

167

click to vote

COST
2009
Springer

185views Multimedia» more COST 2009»

How an Agent Can Detect and Use Synchrony Parameter of Its Own Interaction with a Human?

15 years 1 months ago

Download gaussier.free.fr

Synchrony is claimed by psychology as a crucial parameter of any social interaction: to give to human a feeling of natural interaction, a feeling of agency [17], an agent must be a...

Ken Prepin, Philippe Gaussier

claim paper

Read More »

172

click to vote

AUTOMATICA
2008

198views more AUTOMATICA 2008»

Asynchronous cellular learning automata

15 years 1 months ago

Download ceit.aut.ac.ir

Cellular learning automata is a combination of cellular automata and learning automata. The synchronous version of cellular learning automata in which all learning automata in dif...

Hamid Beigy, Mohammad Reza Meybodi

claim paper

Read More »

128

click to vote

ICML
2009
IEEE

194views Machine Learning» more ICML 2009»

Binary action search for learning continuous-action control policies

16 years 4 months ago

Download www.intelligence.tuc.gr

Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces are quite common in real-wor...

Jason Pazis, Michail G. Lagoudakis

claim paper

Read More »

105

click to vote

CIKM
2000
Springer

104views Information Technology» more CIKM 2000»

Relevance and Reinforcement in Interactive Browsing

15 years 7 months ago

Download ciir.cs.umass.edu

We consider the problem of browsing the top ranked portion of the documents returned by an information retrieval system. We describe an interactive relevance feedback agent that a...

Anton Leuski

claim paper

Read More »

117

click to vote

ICML
2003
IEEE

151views Machine Learning» more ICML 2003»

Hierarchical Policy Gradient Algorithms

16 years 4 months ago

Download www.hpl.hp.com

Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 140 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers