Search Sciweavers | Sciweavers

1234 search results - page 30 / 247

» Multi-criteria Reinforcement Learning

158

click to vote

IJCAI
2007

179views Artificial Intelligence» more IJCAI 2007»

Deictic Option Schemas

15 years 7 months ago

Download www.ijcai.org

Deictic representation is a representational paradigm, based on selective attention and pointers, that allows an agent to learn and reason about rich complex environments. In this...

Balaraman Ravindran, Andrew G. Barto, Vimal Mathew

claim paper

Read More »

155

click to vote

BC
1998

109views more BC 1998»

Learning and stabilization of altruistic behaviors in multi-agent systems by reciprocity

15 years 5 months ago

Download lis.epfl.ch

Optimization of performance in collective systems often requires altruism. The emergence and stabilization of altruistic behaviors are dicult to achieve because the agents incur ...

Javier Zamora, José del R. Millán, A...

claim paper

Read More »

208

click to vote

JMLR
2012

200views Programming Languages» more JMLR 2012»

Contextual Bandit Learning with Predictable Rewards

13 years 8 months ago

Download www.cs.princeton.edu

Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...

Alekh Agarwal, Miroslav Dudík, Satyen Kale,...

claim paper

Read More »

185

click to vote

ICCBR
2009
Springer

134views Automated Reasoning» more ICCBR 2009»

Improving Reinforcement Learning by Using Case Based Heuristics

16 years 14 days ago

Download www.iiia.csic.es

This work presents a new approach that allows the use of cases in a case base as heuristics to speed up Reinforcement Learning algorithms, combining Case Based Reasoning (CBR) and ...

Reinaldo A. C. Bianchi, Raquel Ros, Ramon Ló...

claim paper

Read More »

173

click to vote

CORR
2002
Springer

100views Education» more CORR 2002»

A neural model for multi-expert architectures

15 years 5 months ago

Download user.cs.tu-berlin.de

We present a generalization of conventional artificial neural networks that allows for a functional equivalence to multi-expert systems. The new model provides an architectural fr...

Marc Toussaint

claim paper

Read More »

« Prev « First page 30 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers