Search Sciweavers | Sciweavers

1236 search results - page 195 / 248

» Opposition-Based Reinforcement Learning

148

click to vote

NIPS
2003

108views Information Technology» more NIPS 2003»

Policy Search by Dynamic Programming

15 years 7 months ago

Download books.nips.cc

We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...

J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...

claim paper

Read More »

173

click to vote

ANOR
2005

80views more ANOR 2005»

Entropic Penalties in Finite Games

15 years 6 months ago

Download www.science.unitn.it

The main objects here are finite-strategy games in which entropic terms are subtracted from the payoffs. After such subtraction each Nash equilibrium solves an explicit, unconstra...

Sjur Didrik Flåm, E. Cavazzuti

claim paper

Read More »

162

click to vote

ALIFE
2002

176views Modeling And Simulation» more ALIFE 2002»

Ant Colony Optimization and Stochastic Gradient Descent

15 years 6 months ago

Download ti.arc.nasa.gov

In this paper, we study the relationship between the two techniques known as ant colony optimization (aco) and stochastic gradient descent. More precisely, we show that some empir...

Nicolas Meuleau, Marco Dorigo

claim paper

Read More »

148

click to vote

ICRA
2009
IEEE

125views Robotics» more ICRA 2009»

Learning motor primitives for robotics

16 years 26 days ago

Download www.kyb.tuebingen.mpg.de

— The acquisition and self-improvement of novel motor skills is among the most important problems in robotics. Motor primitives offer one of the most promising frameworks for the...

Jens Kober, Jan Peters

claim paper

Read More »

168

click to vote

IJCAI
2003

119views Artificial Intelligence» more IJCAI 2003»

An Integrated Multilevel Learning Approach to Multiagent Coalition Formation

15 years 7 months ago

Download ijcai.org

In this paper we describe an integrated multilevel learning approach to multiagent coalition formation in a real-time environment. In our domain, agents negotiate to form teams to...

Leen-Kiat Soh, Xin Li

claim paper

Read More »

« Prev « First page 195 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers