Search Sciweavers | Sciweavers

54 search results - page 8 / 11

» Convergence Results for Single-Step On-Policy Reinforcement-...

184

click to vote

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Multi-Agent Learning with Policy Prediction

15 years 9 months ago

Download www.cs.umass.edu

Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting th...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

220

Voted

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

15 years 8 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

216

Voted

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

15 years 5 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

184

Voted

ACMICEC
2007
ACM

102views ECommerce» more ACMICEC 2007»

Learning to trade with insider information

15 years 11 months ago

Download www.cs.rpi.edu

This paper introduces algorithms for learning how to trade using insider (superior) information in Kyle's model of financial markets. Prior results in finance theory relied o...

Sanmay Das

claim paper

Read More »

164

click to vote

CACM
2010

105views more CACM 2010»

Censored exploration and the dark pool problem

15 years 7 months ago

Download www.cis.upenn.edu

We introduce and analyze a natural algorithm for multi-venue exploration from censored data, which is motivated by the Dark Pool Problem of modern quantitative finance. We prove t...

Kuzman Ganchev, Yuriy Nevmyvaka, Michael Kearns, J...

claim paper

Read More »

« Prev « First page 8 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers