Search Sciweavers | Sciweavers

164 search results - page 29 / 33

» Self-Optimizing Memory Controllers: A Reinforcement Learning...

167

click to vote

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

A Modular Q-Learning Architecture for Manipulator Task Decomposition

15 years 10 months ago

Download mi.eng.cam.ac.uk

Compositional Q-Learning (CQ-L) (Singh 1992) is a modular approach to learning to performcomposite tasks made up of several elemental tasks by reinforcement learning. Skills acqui...

Chen K. Tham, Richard W. Prager

claim paper

Read More »

198

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 8 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

188

click to vote

GECCO
2008
Springer

138views Optimization» more GECCO 2008»

Modular neuroevolution for multilegged locomotion

15 years 7 months ago

Download nn.cs.utexas.edu

Legged robots are useful in tasks such as search and rescue because they can effectively navigate on rugged terrain. However, it is difﬁcult to design controllers for them that ...

Vinod K. Valsalam, Risto Miikkulainen

claim paper

Read More »

179

click to vote

JSAC
2007

189views more JSAC 2007»

Non-Cooperative Power Control for Wireless Ad Hoc Networks with Repeated Games

15 years 6 months ago

Download www.cs.ust.hk

— One of the distinctive features in a wireless ad hoc network is lack of any central controller or single point of authority, in which each node/link then makes its own decision...

Chengnian Long, Qian Zhang, Bo Li, Huilong Yang, X...

claim paper

Read More »

223

click to vote

EMNLP
2011

164views Natural Language Processing» more EMNLP 2011»

Watermarking the Outputs of Structured Prediction with an application in Statistical Machine Translation

14 years 6 months ago

Download cs.jhu.edu

We propose a general method to watermark and probabilistically identify the structured outputs of machine learning algorithms. Our method is robust to local editing operations and...

Ashish Venugopal, Jakob Uszkoreit, David Talbot, F...

claim paper

Read More »

« Prev « First page 29 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers