Search Sciweavers | Sciweavers

94 search results - page 10 / 19

» Sequential cost-sensitive decision making with reinforcement...

172

click to vote

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Batch reinforcement learning in a complex domain

16 years 6 days ago

Download userweb.cs.utexas.edu

Temporal diﬀerence reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent’s experience based on sequential actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

179

click to vote

AAAI
2006

190views Intelligent Agents» more AAAI 2006»

Action Selection in Bayesian Reinforcement Learning

15 years 7 months ago

Download www.aaai.org

My research attempts to address on-line action selection in reinforcement learning from a Bayesian perspective. The idea is to develop more effective action selection techniques b...

Tao Wang

claim paper

Read More »

131

click to vote

KESAMSTA
2007
Springer

129views Intelligent Agents» more KESAMSTA 2007»

Reinforcement Learning on a Futures Market Simulator

16 years 4 days ago

Download www.jucs.org

: In recent years, market forecasting by machine learning methods has been ﬂourishing. Most existing works use a past market data set, because they assume that each trader’s in...

Koichi Moriyama, Mitsuhiro Matsumoto, Ken-ichi Fuk...

claim paper

Read More »

143

click to vote

HPDC
2009
IEEE

108views Distributed And Parallel Com...» more HPDC 2009»

Maestro: a self-organizing peer-to-peer dataflow framework using reinforcement learning

15 years 10 months ago

Download www.cs.vu.nl

In this paper we describe Maestro, a dataflow computation framework for Ibis, our Java-based grid middleware. The novelty of Maestro is that it is a self-organizing peer-to-peer s...

C. van Reeuwijk

claim paper

Read More »

163

click to vote

AAAI
2008

199views Intelligent Agents» more AAAI 2008»

Maximum Entropy Inverse Reinforcement Learning

15 years 8 months ago

Download www.andrew.cmu.edu

Recent research has shown the benefit of framing problems of imitation learning as solutions to Markov Decision Problems. This approach reduces learning to the problem of recoveri...

Brian Ziebart, Andrew L. Maas, J. Andrew Bagnell, ...

claim paper

Read More »

« Prev « First page 10 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers