Sciweavers

1236 search results - page 53 / 248
» Efficient Interpretation Policies
Sort
View
NIPS
2001
13 years 10 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...
OSDI
1994
ACM
13 years 10 months ago
HiPEC: High Performance External Virtual Memory Caching
Traditional operating systems use a xed LRU-like page replacement policy and centralized frame pool that cannot properly serve all types of memory access patterns of various appli...
Chao-Hsien Lee, Meng Chang Chen, Ruei-Chuan Chang
INFOCOM
2011
IEEE
13 years 15 days ago
Resource management for fading wireless channels with energy harvesting nodes
—Wireless systems comprised of rechargeable nodes have a significantly prolonged lifetime and are sustainable. A distinct characteristic of these systems is the fact that the no...
Omur Ozel, Kaya Tutuncuoglu, Jing Yang, Sennur Ulu...
NIPS
2008
13 years 10 months ago
Fitted Q-iteration by Advantage Weighted Regression
Recently, fitted Q-iteration (FQI) based methods have become more popular due to their increased sample efficiency, a more stable learning process and the higher quality of the re...
Gerhard Neumann, Jan Peters
LOGCOM
2008
133views more  LOGCOM 2008»
13 years 8 months ago
Linking Semantic and Knowledge Representations in a Multi-Domain Dialogue System
We describe a two-layer architecture for supporting semantic interpretation and domain reasoning in dialogue systems. Building systems that support both semantic interpretation an...
Myroslava Dzikovska, James F. Allen, Mary D. Swift