EWRL 2008 | Sciweavers

193

EWRL
2008

186views Machine Learning» more EWRL 2008»

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

15 years 8 months ago

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...

Kirill Dyagilev, Shie Mannor, Nahum Shimkin

claim paper

Read More »

177

click to vote

EWRL
2008

121views Machine Learning» more EWRL 2008»

Variable Metric Reinforcement Learning Methods Applied to the Noisy Mountain Car Problem

15 years 8 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

Two variable metric reinforcement learning methods, the natural actor-critic algorithm and the covariance matrix adaptation evolution strategy, are compared on a conceptual level a...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

151

click to vote

EWRL
2008

143views Machine Learning» more EWRL 2008»

New Error Bounds for Approximations from Projected Linear Equations

15 years 8 months ago

Download www.mit.edu

We consider linear fixed point equations and their approximations by projection on a low dimensional subspace. We derive new bounds on the approximation error of the solution, whi...

Huizhen Yu, Dimitri P. Bertsekas

claim paper

Read More »

127

click to vote

EWRL
2008

121views Machine Learning» more EWRL 2008»

Probabilistic Inference for Fast Learning in Control

15 years 8 months ago

Download mlg.eng.cam.ac.uk

Carl Edward Rasmussen, Marc Peter Deisenroth

claim paper

Read More »

156

click to vote

EWRL
2008

129views Machine Learning» more EWRL 2008»

Markov Decision Processes with Arbitrary Reward Processes

15 years 8 months ago

Download www.cim.mcgill.ca

Abstract. We consider a control problem where the decision maker interacts with a standard Markov decision process with the exception that the reward functions vary arbitrarily ove...

Jia Yuan Yu, Shie Mannor, Nahum Shimkin

claim paper

Read More »

142

click to vote

EWRL
2008

158views Machine Learning» more EWRL 2008»

Basis Expansion in Natural Actor Critic Methods

15 years 8 months ago

Download www.ceng.metu.edu.tr

Sertan Girgin, Philippe Preux

claim paper

Read More »

171

click to vote

EWRL
2008

104views Machine Learning» more EWRL 2008»

Optimistic Planning of Deterministic Systems

15 years 8 months ago

Download eprints.pascal-network.org

If one possesses a model of a controlled deterministic system, then from any state, one may consider the set of all possible reachable states starting from that state and using any...

Jean-François Hren, Rémi Munos

claim paper

Read More »

161

click to vote

EWRL
2008

144views Machine Learning» more EWRL 2008»

Regularized Fitted Q-Iteration: Application to Planning

15 years 8 months ago

Download eprints.pascal-network.org

We consider planning in a Markovian decision problem, i.e., the problem of finding a good policy given access to a generative model of the environment. We propose to use fitted Q-i...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

144

click to vote

EWRL
2008

133views Machine Learning» more EWRL 2008»

Exploiting Additive Structure in Factored MDPs for Reinforcement Learning

15 years 8 months ago

Download ewrl08.futurs.inria.fr

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

192

click to vote

EWRL
2008

148views Machine Learning» more EWRL 2008»

Policy Learning - A Unified Perspective with Applications in Robotics

15 years 8 months ago

Download www.kyb.tuebingen.mpg.de

Policy Learning approaches are among the best suited methods for high-dimensional, continuous control systems such as anthropomorphic robot arms and humanoid robots. In this paper,...

Jan Peters, Jens Kober, Duy Nguyen-Tuong

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers