Search Sciweavers | Sciweavers

38 search results - page 6 / 8

» Parametric Learning and Monte Carlo Optimization

201

click to vote

ISQED
2005
IEEE

125views Hardware» more ISQED 2005»

A New Method for Design of Robust Digital Circuits

16 years 19 days ago

Download www-vlsi.stanford.edu

As technology continues to scale beyond 100nm, there is a signiﬁcant increase in performance uncertainty of CMOS logic due to process and environmental variations. Traditional c...

Dinesh Patil, Sunghee Yun, Seung-Jean Kim, Alvin C...

claim paper

Read More »

228

click to vote

JIRS
2010

153views more JIRS 2010»

Active Visual Perception for Mobile Robot Localization

15 years 5 months ago

Download www2.ing.puc.cl

Abstract Localization is a key issue for a mobile robot, in particular in environments where a globally accurate positioning system, such as GPS, is not available. In these environ...

Javier Correa, Alvaro Soto

claim paper

Read More »

181

click to vote

GECCO
2005
Springer

111views Optimization» more GECCO 2005»

XCS with eligibility traces

16 years 18 days ago

Download www.bcs.rochester.edu

The development of the XCS Learning Classiﬁer System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investig...

Jan Drugowitsch, Alwyn Barry

claim paper

Read More »

171

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

185

click to vote

ICML
2002
IEEE

133views Machine Learning» more ICML 2002»

Coordinated Reinforcement Learning

16 years 7 months ago

Download select.cs.cmu.edu

We present several new algorithms for multiagent reinforcement learning. A common feature of these algorithms is a parameterized, structured representation of a policy or value fu...

Carlos Guestrin, Michail G. Lagoudakis, Ronald Par...

claim paper

Read More »

« Prev « First page 6 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers