Search Sciweavers | Sciweavers

688 search results - page 11 / 138

» Using reinforcement learning to adapt an imitation task

144

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 6 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

163

click to vote

HT
2009
ACM

146views Internet Technology» more HT 2009»

Improving recommender systems with adaptive conversational strategies

15 years 12 months ago

Download www.inf.unibz.it

Conversational recommender systems (CRSs) assist online users in their information-seeking and decision making tasks by supporting an interactive process. Although these processes...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

147

click to vote

ROMAN
2007
IEEE

134views Robotics» more ROMAN 2007»

Learning Reward Modalities for Human-Robot-Interaction in a Cooperative Training Task

15 years 11 months ago

Download www.robotopia.de

—This paper proposes a novel method of learning a users preferred reward modalities for human-robot interaction through solving a cooperative training task. A learning algorithm ...

Anja Austermann, Seiji Yamada

claim paper

Read More »

146

click to vote

TSMC
2008

229views more TSMC 2008»

A Comprehensive Survey of Multiagent Reinforcement Learning

15 years 5 months ago

Download www.dcsc.tudelft.nl

Multiagent systems are rapidly finding applications in a variety of domains, including robotics, distributed control, telecommunications, and economics. The complexity of many task...

Lucian Busoniu, Robert Babuska, Bart De Schutter

claim paper

Read More »

166

click to vote

GECCO
2009
Springer

124views Optimization» more GECCO 2009»

Reinforcement learning for games: failures and successes

15 years 10 months ago

Download www.gm.fh-koeln.de

We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...

Wolfgang Konen, Thomas Bartz-Beielstein

claim paper

Read More »

« Prev « First page 11 / 138 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers