Search Sciweavers | Sciweavers

1236 search results - page 56 / 248

» Opposition-Based Reinforcement Learning

171

click to vote

ECML
2006
Springer

116views Machine Learning» more ECML 2006»

Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery

15 years 10 months ago

Download web.engr.oregonstate.edu

Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that ...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

157

click to vote

ML
1998
ACM

136views Machine Learning» more ML 1998»

Co-Evolution in the Successful Learning of Backgammon Strategy

15 years 5 months ago

Download www.demo.cs.brandeis.edu

Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

185

click to vote

MAGS
2010

81views more MAGS 2010»

Task allocation learning in a multiagent environment: Application to the RoboCupRescue simulation

15 years 1 months ago

Download damas.ift.ulaval.ca

Coordinating agents in a complex environment is a hard problem, but it can become even harder when certain characteristics of the tasks, like the required number of agents, are un...

Sébastien Paquet, Brahim Chaib-draa, Patric...

claim paper

Read More »

139

click to vote

PRIMA
2009
Springer

102views Intelligent Agents» more PRIMA 2009»

Recursive Adaptation of Stepsize Parameter for Non-stationary Environments

16 years 24 days ago

Download teamcore.usc.edu

In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for dynamic environments. In general reinforcement learning situations, a stepsize...

Itsuki Noda

claim paper

Read More »

153

click to vote

ACL
2010

135views Computational Linguistics» more ACL 2010»

Reading between the Lines: Learning to Map High-Level Instructions to Commands

15 years 4 months ago

Download ai.cs.washington.edu

In this paper, we address the task of mapping high-level instructions to sequences of commands in an external environment. Processing these instructions is challenging--they posit...

S. R. K. Branavan, Luke S. Zettlemoyer, Regina Bar...

claim paper

Read More »

« Prev « First page 56 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers