Sciweavers

2108 search results - page 131 / 422

» Tracking in Reinforcement Learning

24

BC
2008

56views more BC 2008»

An implementation of reinforcement learning based on spike timing dependent plasticity

13 years 10 months ago

An implementation of reinforcement learning based on spike timing dependent plasticity

Download www.proberts.net

Patrick D. Roberts, Roberto A. Santiago, Gerardo L...

claim paper

Read More »

9

COLING
2008

108views Computational Linguistics» more COLING 2008»

Hybrid Reinforcement/Supervised Learning of Dialogue Policies from Fixed Data Sets

13 years 10 months ago

Hybrid Reinforcement/Supervised Learning of Dialogue Policies from Fixed Data Sets

Download www.aclweb.org

James Henderson, Oliver Lemon, Kallirroi Georgila

claim paper

Read More »

21

FGCS
2008

68views more FGCS 2008»

A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments

13 years 10 months ago

A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments

Download labs.oracle.com

David Vengerov

claim paper

Read More »

27

IJAIT
2008

60views more IJAIT 2008»

A Hybrid Multiagent Reinforcement Learning Approach Using Strategies and Fusion

13 years 10 months ago

A Hybrid Multiagent Reinforcement Learning Approach Using Strategies and Fusion

Download lpis.csd.auth.gr

Ioannis Partalas, Ioannis Feneris, Ioannis P. Vlah...

claim paper

Read More »

35

CORR
2007
Springer

73views Education» more CORR 2007»

Universal Reinforcement Learning

13 years 10 months ago

Universal Reinforcement Learning

Download www.stanford.edu

—We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can inﬂuence futu...

Vivek F. Farias, Ciamac Cyrus Moallemi, Tsachy Wei...

claim paper

Read More »

« Prev « First page 131 / 422 Last » Next »