Sciweavers

437 search results - page 29 / 88
» Policy Gradient Critics
Sort
View
IPPS
2006
IEEE
14 years 2 months ago
Comparative study of price-based resource allocation algorithms for ad hoc networks
As mobile ad hoc networks provide a wide range of possibly critical services, providing quality of service guarantees becomes an essential element. Yet there is a limited understa...
Marcel Lüthi, Simin Nadjm-Tehrani, Calin Cure...
EMNLP
2009
13 years 6 months ago
Consensus Training for Consensus Decoding in Machine Translation
We propose a novel objective function for discriminatively tuning log-linear machine translation models. Our objective explicitly optimizes the BLEU score of expected n-gram count...
Adam Pauls, John DeNero, Dan Klein
ICRA
2008
IEEE
129views Robotics» more  ICRA 2008»
14 years 3 months ago
Compliant manipulation for peg-in-hole: Is passive compliance a key to learn contact motion?
— We examine the usefulness of passive compliance in a manipulator that learns contact motion. Based on the notice that humans outperforms robots with the contact motion, we foll...
Seung-kook Yun
JMLR
2010
227views more  JMLR 2010»
13 years 7 months ago
PyBrain
PyBrain is a versatile machine learning library for Python. Its goal is to provide flexible, easyto-use yet still powerful algorithms for machine learning tasks, including a vari...
Tom Schaul, Justin Bayer, Daan Wierstra, Yi Sun, M...
POLICY
2007
Springer
14 years 2 months ago
An Automated Framework for Validating Firewall Policy Enforcement
The implementation of network security devices such as firewalls and IDSs are constantly being improved to accommodate higher security and performance standards. Using reliable a...
Adel El-Atawy, Taghrid Samak, Zein Wali, Ehab Al-S...