Sciweavers

1236 search results - page 195 / 248
» Opposition-Based Reinforcement Learning
Sort
View
NIPS
2003
13 years 11 months ago
Policy Search by Dynamic Programming
We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...
J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...
ANOR
2005
80views more  ANOR 2005»
13 years 9 months ago
Entropic Penalties in Finite Games
The main objects here are finite-strategy games in which entropic terms are subtracted from the payoffs. After such subtraction each Nash equilibrium solves an explicit, unconstra...
Sjur Didrik Flåm, E. Cavazzuti
ALIFE
2002
13 years 9 months ago
Ant Colony Optimization and Stochastic Gradient Descent
In this paper, we study the relationship between the two techniques known as ant colony optimization (aco) and stochastic gradient descent. More precisely, we show that some empir...
Nicolas Meuleau, Marco Dorigo
ICRA
2009
IEEE
125views Robotics» more  ICRA 2009»
14 years 4 months ago
Learning motor primitives for robotics
— The acquisition and self-improvement of novel motor skills is among the most important problems in robotics. Motor primitives offer one of the most promising frameworks for the...
Jens Kober, Jan Peters
IJCAI
2003
13 years 11 months ago
An Integrated Multilevel Learning Approach to Multiagent Coalition Formation
In this paper we describe an integrated multilevel learning approach to multiagent coalition formation in a real-time environment. In our domain, agents negotiate to form teams to...
Leen-Kiat Soh, Xin Li