Search Sciweavers | Sciweavers

802 search results - page 69 / 161

» Experts in a Markov Decision Process

click to vote

FLAIRS
2006

101views Artificial Intelligence» more FLAIRS 2006»

Stochastic Deliberation Scheduling using GSMDPs

13 years 9 months ago

Download www.aaai.org

We propose a new decision-theoretic approach for solving execution-time deliberation scheduling problems using recent advances in Generalized Semi-Markov Decision Processes (GSMDP...

Kurt D. Krebsbach

claim paper

Read More »

click to vote

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

13 years 9 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

13 years 7 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

click to vote

ICMLA
2009

185views Machine Learning» more ICMLA 2009»

Automatic Feature Selection for Model-Based Reinforcement Learning in Factored MDPs

13 years 5 months ago

Download staff.science.uva.nl

Abstract--Feature selection is an important challenge in machine learning. Unfortunately, most methods for automating feature selection are designed for supervised learning tasks a...

Mark Kroon, Shimon Whiteson

claim paper

Read More »

click to vote

AIPS
2003

90views Artificial Intelligence» more AIPS 2003»

Recommendation as a Stochastic Sequential Decision Problem

13 years 9 months ago

Download www.aaai.org

Recommender systems — systems that suggest to users in e-commerce sites items that might interest them — adopt a static view of the recommendation process and treat it as a pr...

Ronen I. Brafman, David Heckerman, Guy Shani

claim paper

Read More »

« Prev « First page 69 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers