Search Sciweavers | Sciweavers

651 search results - page 66 / 131

» Algorithms for Inverse Reinforcement Learning

210

click to vote

NECO
2007

258views more NECO 2007»

Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity

15 years 5 months ago

Download www.coneural.org

The persistent modiﬁcation of synaptic efﬁcacy as a function of the relative timing of pre- and postsynaptic spikes is a phenomenon known as spiketiming-dependent plasticity (...

Razvan V. Florian

claim paper

Read More »

172

Voted

ATAL
2009
Springer

167views Intelligent Agents» more ATAL 2009»

Solving multiagent assignment Markov decision processes

16 years 25 days ago

Download www.aamas-conference.org

We consider the setting of multiple collaborative agents trying to complete a set of tasks as assigned by a centralized controller. We propose a scalable method called“Assignmen...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

192

click to vote

ACMICEC
2008
ACM

272views ECommerce» more ACMICEC 2008»

Adapting the interaction state model in conversational recommender systems

15 years 8 months ago

Download www.inf.unibz.it

Conventional conversational recommender systems support interaction strategies that are hard-coded into the system in advance. In this context, Reinforcement Learning techniques h...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

165

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

16 years 13 days ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

156

click to vote

KDD
2002
ACM

147views Data Mining» more KDD 2002»

Sequential cost-sensitive decision making with reinforcement learning

16 years 6 months ago

Download www.research.ibm.com

Recently, there has been increasing interest in the issues of cost-sensitive learning and decision making in a variety of applications of data mining. A number of approaches have ...

Edwin P. D. Pednault, Naoki Abe, Bianca Zadrozny

claim paper

Read More »

« Prev « First page 66 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers