Search Sciweavers | Sciweavers

109 search results - page 9 / 22

» Policy teaching through reward function learning

150

click to vote

ECML
2007
Springer

133views Machine Learning» more ECML 2007»

Transfer Learning in Reinforcement Learning Problems Through Partial Policy Recycling

16 years 26 days ago

Download dtai.cs.kuleuven.be

In this paper we investigate the relation between transfer learning in reinforcement learning with function approximation and supervised learning with concept drift. We present a n...

Jan Ramon, Kurt Driessens, Tom Croonenborghs

claim paper

Read More »

180

Voted

ML
1998
ACM

117views Machine Learning» more ML 1998»

Learning Team Strategies: Soccer Case Studies

15 years 6 months ago

Download igitur-archive.library.uu.nl

We use simulated soccer to study multiagent learning. Each team's players (agents) share action set and policy, but may behave di erently due to position-dependent inputs. All...

Rafal Salustowicz, Marco Wiering, Jürgen Schm...

claim paper

Read More »

139

Voted

NIPS
2003

196views Information Technology» more NIPS 2003»

Approximate Policy Iteration with a Policy Language Bias

15 years 8 months ago

Download www.jair.org

We study an approach to policy selection for large relational Markov Decision Processes (MDPs). We consider a variant of approximate policy iteration (API) that replaces the usual...

Alan Fern, Sung Wook Yoon, Robert Givan

claim paper

Read More »

157

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

188

click to vote

AIIA
2007
Springer

147views Artificial Intelligence» more AIIA 2007»

Reinforcement Learning in Complex Environments Through Multiple Adaptive Partitions

16 years 27 days ago

Download sequel.futurs.inria.fr

The application of Reinforcement Learning (RL) algorithms to learn tasks for robots is often limited by the large dimension of the state space, which may make prohibitive its appli...

Andrea Bonarini, Alessandro Lazaric, Marcello Rest...

claim paper

Read More »

« Prev « First page 9 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers