Search Sciweavers | Sciweavers

377 search results - page 21 / 76

» Convergence of Stochastic Iterative Dynamic Programming Algo...

261

click to vote

ICASSP
2009
IEEE

163views Signal Processing» more ICASSP 2009»

MIMO decoding based on stochastic reconstruction from multiple projections

16 years 1 months ago

Download ens.ewi.tudelft.nl

Least squares (LS) ﬁtting is one of the most fundamental techniques in science and engineering. It is used to estimate parameters from multiple noisy observations. In many probl...

Amir Leshem, Jacob Goldberger

claim paper

Read More »

152

Voted

ICC
2008
IEEE

144views Communications» more ICC 2008»

Delay-Minimal Transmission for Energy Constrained Wireless Communications

16 years 1 months ago

Download www.ece.umd.edu

—We investigate the problem of minimizing the overall transmission delay of data packets in a single-user wireless communication system, where the transmitter has a ﬁxed amount...

Jing Yang, Sennur Ulukus

claim paper

Read More »

180

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 8 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

289

click to vote

COLT
2010
Springer

238views Machine Learning» more COLT 2010»

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

15 years 4 months ago

Download www.colt2010.org

We present a new family of subgradient methods that dynamically incorporate knowledge of the geometry of the data observed in earlier iterations to perform more informative gradie...

John Duchi, Elad Hazan, Yoram Singer

claim paper

Read More »

176

click to vote

AAAI
2010

180views Intelligent Agents» more AAAI 2010»

Relational Partially Observable MDPs

15 years 8 months ago

Download www.cs.tufts.edu

Relational Markov Decision Processes (MDP) are a useraction for stochastic planning problems since one can develop abstract solutions for them that are independent of domain size ...

Chenggang Wang, Roni Khardon

claim paper

Read More »

« Prev « First page 21 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers