Search Sciweavers | Sciweavers

682 search results - page 62 / 137

» One-Counter Markov Decision Processes

138

click to vote

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

15 years 5 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

131

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

15 years 3 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

124

click to vote

AIPS
2003

90views Artificial Intelligence» more AIPS 2003»

Recommendation as a Stochastic Sequential Decision Problem

15 years 5 months ago

Download www.aaai.org

Recommender systems — systems that suggest to users in e-commerce sites items that might interest them — adopt a static view of the recommendation process and treat it as a pr...

Ronen I. Brafman, David Heckerman, Guy Shani

claim paper

Read More »

126

click to vote

NIPS
2004

224views Information Technology» more NIPS 2004»

Approximately Efficient Online Mechanism Design

15 years 5 months ago

Download www.cs.cmu.edu

Online mechanism design (OMD) addresses the problem of sequential decision making in a stochastic environment with multiple self-interested agents. The goal in OMD is to make valu...

David C. Parkes, Satinder P. Singh, Dimah Yanovsky

claim paper

Read More »

164

click to vote

STACS
2012
Springer

260views Theoretical Computer Science» more STACS 2012»

Stabilization of Branching Queueing Networks

13 years 11 months ago

Download www.model.in.tum.de

Queueing networks are gaining attraction for the performance analysis of parallel computer systems. A Jackson network is a set of interconnected servers, where the completion of a...

Tomás Brázdil, Stefan Kiefer

claim paper

Read More »

« Prev « First page 62 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers