Search Sciweavers | Sciweavers

238 search results - page 44 / 48

» Value-Function Approximations for Partially Observable Marko...

194

click to vote

AIPS
2008

155views Artificial Intelligence» more AIPS 2008»

HiPPo: Hierarchical POMDPs for Planning Information Processing and Sensing Actions on a Robot

15 years 9 months ago

Download www.cs.bham.ac.uk

Flexible general purpose robots need to tailor their visual processing to their task, on the fly. We propose a new approach to this within a planning framework, where the goal is ...

Mohan Sridharan, Jeremy L. Wyatt, Richard Dearden

claim paper

Read More »

177

click to vote

TR
2010

126views Hardware» more TR 2010»

Optimal Maintenance Strategies for Wind Turbine Systems Under Stochastic Weather Conditions

15 years 1 months ago

Download ise.tamu.edu

Abstract--We examine optimal repair strategies for wind turbines operated under stochastic weather conditions. In-situ sensors installed at wind turbines produce useful information...

Eunshin Byon, Lewis Ntaimo, Yu Ding

claim paper

Read More »

204

click to vote

ATAL
2003
Springer

185views Intelligent Agents» more ATAL 2003»

Optimizing information exchange in cooperative multi-agent systems

16 years 6 days ago

Download rbr.cs.umass.edu

Decentralized control of a cooperative multi-agent system is the problem faced by multiple decision-makers that share a common set of objectives. The decision-makers may be robots...

Claudia V. Goldman, Shlomo Zilberstein

claim paper

Read More »

232

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

15 years 4 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

196

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

16 years 1 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 44 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers