Search Sciweavers | Sciweavers

378 search results - page 17 / 76

» Reinforcement Learning for Online Control of Evolutionary Al...

125

click to vote

GECCO
2005
Springer

119views Optimization» more GECCO 2005»

Learning, anticipation and time-deception in evolutionary online dynamic optimization

15 years 8 months ago

Download www.cs.bham.ac.uk

In this paper we focus on an important source of problem– diﬃculty in (online) dynamic optimization problems that has so far received signiﬁcantly less attention than the tr...

Peter A. N. Bosman

claim paper

Read More »

129

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 4 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

120

click to vote

ICRA
2005
IEEE

140views Robotics» more ICRA 2005»

Fast Reinforcement Learning for Vision-guided Mobile Robots

15 years 8 months ago

Download aass.oru.se

— This paper presents a new reinforcement learning algorithm for accelerating acquisition of new skills by real mobile robots, without requiring simulation. It speeds up Q-learni...

Tomás Martínez-Marín, Tom Duc...

claim paper

Read More »

117

click to vote

EWRL
2008

129views Machine Learning» more EWRL 2008»

Markov Decision Processes with Arbitrary Reward Processes

15 years 5 months ago

Download www.cim.mcgill.ca

Abstract. We consider a control problem where the decision maker interacts with a standard Markov decision process with the exception that the reward functions vary arbitrarily ove...

Jia Yuan Yu, Shie Mannor, Nahum Shimkin

claim paper

Read More »

145

click to vote

EWRL
2008

186views Machine Learning» more EWRL 2008»

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

15 years 5 months ago

Download webee.technion.ac.il

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...

Kirill Dyagilev, Shie Mannor, Nahum Shimkin

claim paper

Read More »

« Prev « First page 17 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers