Sciweavers

27 search results - page 4 / 6
» Improved Rates for the Stochastic Continuum-Armed Bandit Pro...
Sort
View
GECCO
2004
Springer
140views Optimization» more  GECCO 2004»
14 years 14 days ago
A Sensitivity Analysis of a Cooperative Coevolutionary Algorithm Biased for Optimization
Abstract. Recent theoretical work helped explain certain optimizationrelated pathologies in cooperative coevolutionary algorithms (CCEAs). Such explanations have led to adopting sp...
Liviu Panait, R. Paul Wiegand, Sean Luke
CISS
2010
IEEE
12 years 10 months ago
The maximum stable broadcast throughput for wireless line networks with network coding and topology control
—We consider broadcasting from a single source to multiple destinations in a linear wireless erasure network with feedback. The problem is to find the maximum stable throughput ...
Ka-Hung Hui, Yalin Evren Sagduyu, Dongning Guo, Ra...
ICRA
2010
IEEE
145views Robotics» more  ICRA 2010»
13 years 5 months ago
Reinforcement learning of motor skills in high dimensions: A path integral approach
— Reinforcement learning (RL) is one of the most general approaches to learning control. Its applicability to complex motor systems, however, has been largely impossible so far d...
Evangelos Theodorou, Jonas Buchli, Stefan Schaal
JMLR
2010
148views more  JMLR 2010»
13 years 1 months ago
A Generalized Path Integral Control Approach to Reinforcement Learning
With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...
Evangelos Theodorou, Jonas Buchli, Stefan Schaal
NLE
2007
180views more  NLE 2007»
13 years 6 months ago
Segmentation and alignment of parallel text for statistical machine translation
We address the problem of extracting bilingual chunk pairs from parallel text to create training sets for statistical machine translation. We formulate the problem in terms of a s...
Yonggang Deng, Shankar Kumar, William Byrne