Search Sciweavers | Sciweavers

377 search results - page 17 / 76

» Convergence of Stochastic Iterative Dynamic Programming Algo...

206

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

15 years 10 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

186

click to vote

CDC
2010
IEEE

138views Control Systems» more CDC 2010»

Sensor-based robot deployment algorithms

15 years 1 months ago

Download www.seas.upenn.edu

Abstract-- In robot deployment problems, the fundamental issue is to optimize a steady state performance measure that depends on the spatial configuration of a group of robots. For...

Jerome Le Ny, George J. Pappas

claim paper

Read More »

211

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Aggregation-based model reduction of a Hidden Markov Model

15 years 1 months ago

Download mechse.illinois.edu

This paper is concerned with developing an information-theoretic framework to aggregate the state space of a Hidden Markov Model (HMM) on discrete state and observation spaces. The...

Kun Deng, Prashant G. Mehta, Sean P. Meyn

claim paper

Read More »

178

click to vote

EVOW
2005
Springer

139views Artificial Intelligence» more EVOW 2005»

Convergence Synthesis of Dynamic Frequency Modulation Tones Using an Evolution Strategy

16 years 6 days ago

Download www.teamaxe.co.uk

This paper reports on steps that have been taken to enhance previously presented evolutionary sound matching work. In doing so, the convergence characteristics are shown to provide...

Thomas J. Mitchell, Anthony G. Pipe

claim paper

Read More »

156

click to vote

JSAC
2006

79views more JSAC 2006»

Layered Multicast Rate Control Based on Lagrangian Relaxation and Dynamic Programming

15 years 6 months ago

Download www.ecse.rpi.edu

In this paper, we address the rate control problem for layered multicast traffic, with the objective of solving a generalized throughput/fairness objective. Our approach is based o...

Koushik Kar, Leandros Tassiulas

claim paper

Read More »

« Prev « First page 17 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers