Sciweavers

1211 search results - page 125 / 243
» Adaptive critics for dynamic optimization
Sort
View
AGI
2011
14 years 8 months ago
Reinforcement Learning and the Bayesian Control Rule
We present an actor-critic scheme for reinforcement learning in complex domains. The main contribution is to show that planning and I/O dynamics can be separated such that an intra...
Pedro Alejandro Ortega, Daniel Alexander Braun, Si...
SIAMCO
2002
121views more  SIAMCO 2002»
15 years 4 months ago
Consistent Approximations and Approximate Functions and Gradients in Optimal Control
As shown in [7], optimal control problems with either ODE or PDE dynamics can be solved efficiently using a setting of consistent approximations obtained by numerical discretizati...
Olivier Pironneau, Elijah Polak
TPDS
2008
87views more  TPDS 2008»
15 years 4 months ago
rStream: Resilient and Optimal Peer-to-Peer Streaming with Rateless Codes
Due to the lack of stability and reliability in peer-to-peer networks, multimedia streaming over peer-to-peer networks represents several fundamental engineering challenges. First...
Chuan Wu, Baochun Li
ICC
2008
IEEE
156views Communications» more  ICC 2008»
15 years 11 months ago
Cross-Layer Design for the MIMO System with Zero-Forcing Receiver in the Presence of Channel Estimation Error
—Multiple input multiple output (MIMO) system has been recognized as a promising candidate for future wireless communication. The adaptive modulation which adjusts the transmitte...
Feng Jiang, Ying Wang, Xi Fang, Kai Sun, Guona Hu,...
CGO
2005
IEEE
15 years 10 months ago
Maintaining Consistency and Bounding Capacity of Software Code Caches
Software code caches are becoming ubiquitous, in dynamic optimizers, runtime tool platforms, dynamic translators, fast simulators and emulators, and dynamic compilers. Caching fre...
Derek Bruening, Saman P. Amarasinghe