Sciweavers

15614 search results - page 2971 / 3123
» The State of State
Sort
View
ICML
2010
IEEE
15 years 5 months ago
Toward Off-Policy Learning Control with Function Approximation
We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...
Hamid Reza Maei, Csaba Szepesvári, Shalabh ...
145
Voted
ICML
2010
IEEE
15 years 5 months ago
Inverse Optimal Control with Linearly-Solvable MDPs
We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...
Dvijotham Krishnamurthy, Emanuel Todorov
HASKELL
2008
ACM
15 years 5 months ago
Lightweight monadic regions
We present Haskell libraries that statically ensure the safe use of resources such as file handles. We statically prevent accessing an already closed handle or forgetting to clos...
Oleg Kiselyov, Chung-chieh Shan
ISLPED
2010
ACM
204views Hardware» more  ISLPED 2010»
15 years 4 months ago
Maximum power transfer tracking for a photovoltaic-supercapacitor energy system
It is important to maintain high efficiency when charging electrical energy storage elements so as to achieve holistic optimization from an energy generation source (e.g., a solar...
Younghyun Kim, Naehyuck Chang, Yanzhi Wang, Massou...
169
Voted
MM
2010
ACM
251views Multimedia» more  MM 2010»
15 years 4 months ago
A cognitive approach for effective coding and transmission of 3D video
Reliable delivery of 3D video contents to a wide set of users is expected to be the next big revolution in multimedia applications provided that it is possible to grant a certain ...
Simone Milani, Giancarlo Calvagno
« Prev « First page 2971 / 3123 Last » Next »