Search Sciweavers | Sciweavers

74 search results - page 3 / 15

» Stochastic search using the natural gradient

click to vote

ESANN
2007

148views Neural Networks» more ESANN 2007»

Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning

13 years 11 months ago

Download www.dice.ucl.ac.be

In this paper, we investigate motor primitive learning with the Natural Actor-Critic approach. The Natural Actor-Critic consists out of actor updates which are achieved using natur...

Jan Peters, Stefan Schaal

claim paper

Read More »

click to vote

AAAI
2000

139views Intelligent Agents» more AAAI 2000»

Localizing Search in Reinforcement Learning

13 years 11 months ago

Download www.cs.colorado.edu

Reinforcement learning (RL) can be impractical for many high dimensional problems because of the computational cost of doing stochastic search in large state spaces. We propose a ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

click to vote

ATAL
2008
Springer

92views Intelligent Agents» more ATAL 2008»

Stochastic search methods for nash equilibrium approximation in simulation-based games

13 years 11 months ago

Download www.seas.upenn.edu

We define the class of games called simulation-based games, in which the payoffs are available as an output of an oracle (simulator), rather than specified analytically or using a...

Yevgeniy Vorobeychik, Michael P. Wellman

claim paper

Read More »

click to vote

AUTOMATICA
2007

82views more AUTOMATICA 2007»

Simulation-based optimal sensor scheduling with application to observer trajectory planning

13 years 10 months ago

Download www.cs.ubc.ca

The sensor scheduling problem can be formulated as a controlled hidden Markov model and this paper solves the problem when the state, observation and action spaces are continuous....

Sumeetpal S. Singh, Nikolaos Kantas, Ba-Ngu Vo, Ar...

claim paper

Read More »

click to vote

NIPS
1993

103views Information Technology» more NIPS 1993»

Optimal Stochastic Search and Adaptive Momentum

13 years 11 months ago

Download www.bme.ogi.edu

Stochastic optimization algorithms typically use learning rate schedules that behave asymptotically as (t) = 0=t. The ensemble dynamics (Leen and Moody, 1993) for such algorithms ...

Todd K. Leen, Genevieve B. Orr

claim paper

Read More »

« Prev « First page 3 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers