Sciweavers

437 search results - page 51 / 88
» Policy Gradient Critics
Sort
View
ATAL
2005
Springer
14 years 2 months ago
Improving reinforcement learning function approximators via neuroevolution
Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...
Shimon Whiteson
VEE
2012
ACM
234views Virtualization» more  VEE 2012»
12 years 4 months ago
REEact: a customizable virtual execution manager for multicore platforms
With the shift to many-core chip multiprocessors (CMPs), a critical issue is how to effectively coordinate and manage the execution of applications and hardware resources to overc...
Wei Wang, Tanima Dey, Ryan W. Moore, Mahmut Aktaso...
CHI
2007
ACM
14 years 9 months ago
Understanding and developing models for detecting and differentiating breakpoints during interactive tasks
The ability to detect and differentiate breakpoints during task execution is critical for enabling defer-to-breakpoint policies within interruption management. In this work, we ex...
Shamsi T. Iqbal, Brian P. Bailey
HICSS
2009
IEEE
99views Biometrics» more  HICSS 2009»
14 years 3 months ago
Local-Global: Reconciling Mismatched Ontologies in Development Information Systems
This paper extends pre-existing digital divide conceptualizations to further investigate the important issue of mismatches between the ontologies of state-created information syst...
Jessica Seddon Wallack, Ramesh Srinivasan
SC
2009
ACM
14 years 3 months ago
Enabling software management for multicore caches with a lightweight hardware support
The management of shared caches in multicore processors is a critical and challenging task. Many hardware and OS-based methods have been proposed. However, they may be hardly adop...
Jiang Lin, Qingda Lu, Xiaoning Ding, Zhao Zhang, X...