Search Sciweavers | Sciweavers

1138 search results - page 133 / 228

» Feature Markov Decision Processes

191

Voted

Publication

273views

Monte Carlo Value Iteration for Continuous-State POMDPs

14 years 11 months ago

Download www.comp.nus.edu.sg

Partially observable Markov decision processes (POMDPs) have been successfully applied to various robot motion planning tasks under uncertainty. However, most existing POMDP algo...

Haoyu Bai, David Hsu, Wee Sun Lee, and Vien A. Ngo

posted by bhy

Read More »

163

Voted

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 10 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

150

Voted

NN
2010
Springer

187views Neural Networks» more NN 2010»

Efficient exploration through active learning for value function approximation in reinforcement learning

14 years 10 months ago

Download sugiyama-www.cs.titech.ac.jp

Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares ...

Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiya...

claim paper

Read More »

168

Voted

AAAI
2011

136views Intelligent Agents» more AAAI 2011»

Linear Dynamic Programs for Resource Management

14 years 3 months ago

Download www.cs.umass.edu

Sustainable resource management in many domains presents large continuous stochastic optimization problems, which can often be modeled as Markov decision processes (MDPs). To solv...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

140

Voted

CORR
2002
Springer

91views Education» more CORR 2002»

Data Engineering for the Analysis of Semiconductor Manufacturing Data

15 years 3 months ago

Download cogprints.org

We have analyzed manufacturing data from several different semiconductor manufacturing plants, using decision tree induction software called Q-YIELD. The software generates rules ...

Peter D. Turney

claim paper

Read More »

« Prev « First page 133 / 228 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers