Search Sciweavers | Sciweavers

192 search results - page 32 / 39

» Multi-agent Relational Reinforcement Learning

130

Voted

NIPS
2008

129views Information Technology» more NIPS 2008»

Structure Learning in Human Sequential Decision-Making

15 years 4 months ago

Download www-users.cs.umn.edu

We use graphical models and structure learning to explore how people learn policies in sequential decision making tasks. Studies of sequential decision-making in humans frequently...

Daniel Acuña, Paul R. Schrater

claim paper

Read More »

119

click to vote

ML
1998
ACM

136views Machine Learning» more ML 1998»

Co-Evolution in the Successful Learning of Backgammon Strategy

15 years 2 months ago

Download www.demo.cs.brandeis.edu

Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

149

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation

15 years 4 months ago

Download eprints.pascal-network.org

Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g.,...

Dotan Di Castro, Dmitry Volkinshtein, Ron Meir

claim paper

Read More »

124

Voted

ECAI
2006
Springer

127views Artificial Intelligence» more ECAI 2006»

Using Emotions for Behaviour-Selection Learning

15 years 6 months ago

Download roboticslab.uc3m.es

Emotions play a very important role in human behaviour and social interaction. In this paper we present a control architecture which uses emotions in the behaviour selection proces...

Maria Malfaz, Miguel Angel Salichs

claim paper

Read More »

108

Voted

ECAI
2000
Springer

102views Artificial Intelligence» more ECAI 2000»

Learning to Use Operational Advice

15 years 6 months ago

Download home.in.tum.de

We address the problem of advice-taking in a given domain, in particular for building a game-playing program. Our approach to solving it strives for the application of machine lea...

Johannes Fürnkranz, Bernhard Pfahringer, Herm...

claim paper

Read More »

« Prev « First page 32 / 39 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers