Search Sciweavers | Sciweavers

576 search results - page 80 / 116

» Approximate controllability of a reaction-diffusion system

131

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 5 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

133

click to vote

NETWORKING
2000

88views Computer Networks» more NETWORKING 2000»

Fairness and Aggregation: A Primal Decomposition Study

15 years 5 months ago

Download www.ece.uwaterloo.ca

Abstract. We examine the fair allocation of capacity to a large population of best-effort connections in a typical multiple access communication system supporting some bandwidth on...

André Girard, Catherine Rosenberg, Mohammed...

claim paper

Read More »

144

click to vote

VC
2008

131views more VC 2008»

Motion synthesis with decoupled parameterization

15 years 4 months ago

Download media.korea.ac.kr

In real-time animation systems, motion interpolation techniques are widely used for their controllability and efficiency. The techniques sample the parameter space using example mo...

Dongwook Ha, JungHyun Han

claim paper

Read More »

153

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 5 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

140

click to vote

ISCAS
2006
IEEE

121views Hardware» more ISCAS 2006»

A frequency domain based TEQ design for DSL systems

15 years 10 months ago

Download www.cn.nctu.edu.tw

that the equivalent channel is approximately an impulse. In [7], Martin et al. propose a globally convergent blind adap-In this paper, we propose a frequency domain based de- tive ...

Yuan-Pei Lin, Yu-Pin Lin, See-May Phoong

claim paper

Read More »

« Prev « First page 80 / 116 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers