Search Sciweavers | Sciweavers

473 search results - page 12 / 95

» Optimal policy switching algorithms for reinforcement learni...

165

Voted

GECCO
2006
Springer

208views Optimization» more GECCO 2006»

Comparing evolutionary and temporal difference methods in a reinforcement learning domain

15 years 9 months ago

Download www.cs.bham.ac.uk

Both genetic algorithms (GAs) and temporal difference (TD) methods have proven effective at solving reinforcement learning (RL) problems. However, since few rigorous empirical com...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

162

click to vote

JAIR
2011

144views more JAIR 2011»

Non-Deterministic Policies in Markovian Decision Processes

15 years 1 months ago

Download www.jair.org

Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

172

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 4 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

195

click to vote

SAB
2010
Springer

189views Optimization» more SAB 2010»

TeXDYNA: Hierarchical Reinforcement Learning in Factored MDPs

15 years 3 months ago

Download www.isir.upmc.fr

Reinforcement learning is one of the main adaptive mechanisms that is both well documented in animal behaviour and giving rise to computational studies in animats and robots. In th...

Olga Kozlova, Olivier Sigaud, Christophe Meyer

claim paper

Read More »

163

click to vote

NIPS
1998

137views Information Technology» more NIPS 1998»

Risk Sensitive Reinforcement Learning

15 years 7 months ago

Download www.cs.cmu.edu

In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...

Ralph Neuneier, Oliver Mihatsch

claim paper

Read More »

« Prev « First page 12 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers