Search Sciweavers | Sciweavers

802 search results - page 131 / 161

» Experts in a Markov Decision Process

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

14 years 8 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

14 years 8 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Planning in the Presence of Cost Functions Controlled by an Adversary

14 years 8 months ago

Download www.cs.cmu.edu

We investigate methods for planning in a Markov Decision Process where the cost function is chosen by an adversary after we fix our policy. As a running example, we consider a rob...

H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum

claim paper

Read More »

click to vote

ICML
2002
IEEE

128views Machine Learning» more ICML 2002»

Pruning Improves Heuristic Search for Cost-Sensitive Learning

14 years 8 months ago

Download web.engr.oregonstate.edu

This paper addresses cost-sensitive classification in the setting where there are costs for measuring each attribute as well as costs for misclassification errors. We show how to ...

Valentina Bayer Zubek, Thomas G. Dietterich

claim paper

Read More »

click to vote

MOBIHOC
2008
ACM

136views Computer Networks» more MOBIHOC 2008»

Routing in a cyclic mobispace

14 years 7 months ago

Download www.cse.fau.edu

A key challenge of routing in delay tolerant networks (DTNs) is to find routes that have high delivery rates and low endto-end delays. When oracles are not available for future co...

Cong Liu, Jie Wu

claim paper

Read More »

« Prev « First page 131 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers