Search Sciweavers | Sciweavers

523 search results - page 60 / 105

» Structured Solution Methods for Non-Markovian Decision Proce...

133

click to vote

UAI
2008

230views Artificial Intelligence» more UAI 2008»

Partitioned Linear Programming Approximations for MDPs

15 years 5 months ago

Download uai2008.cs.helsinki.fi

Approximate linear programming (ALP) is an efficient approach to solving large factored Markov decision processes (MDPs). The main idea of the method is to approximate the optimal...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

114

click to vote

ISF
2007

119views more ISF 2007»

Managing the false alarms: A framework for assurance and verification of surveillance monitoring

15 years 3 months ago

Download scissec.scis.ecu.edu.au

This article discusses methods to support assurance of surveillance monitoring; and compliance verification knowledge management (CV-KM). The discussion includes aspects of primar...

Peter Goldschmidt

claim paper

Read More »

139

Voted

ICRA
2007
IEEE

155views Robotics» more ICRA 2007»

Value Function Approximation on Non-Linear Manifolds for Robot Motor Control

15 years 10 months ago

Download sugiyama-www.cs.titech.ac.jp

— The least squares approach works efﬁciently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...

Masashi Sugiyama, Hirotaka Hachiya, Christopher To...

claim paper

Read More »

108

click to vote

ATAL
1997
Springer

118views Intelligent Agents» more ATAL 1997»

Toward the Specification and Design of Industrial Synthetic Ecosystems

15 years 8 months ago

Download www.newvectors.net

Many agent-based systems rely for their effectiveness on the intelligence of individual agents, and interaction among agents is required simply to coordinate these individually com...

H. Van Dyke Parunak, John A. Sauter, Steve Clark

claim paper

Read More »

165

Voted

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 10 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

« Prev « First page 60 / 105 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers