Sciweavers

1176 search results - page 42 / 236
» Sparse reward processes
Sort
View
143
Voted
ICC
2008
IEEE
169views Communications» more  ICC 2008»
15 years 10 months ago
Optimality of Myopic Sensing in Multi-Channel Opportunistic Access
—We consider opportunistic communications over multiple channels where the state (“good” or “bad”) of each channel evolves as independent and identically distributed Mark...
Tara Javidi, Bhaskar Krishnamachari, Qing Zhao, Mi...
135
Voted
IWANN
1999
Springer
15 years 8 months ago
Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning
To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...
R. Matthew Kretchmar, Charles W. Anderson
138
Voted
NECO
2007
150views more  NECO 2007»
15 years 3 months ago
Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule
Learning agents, whether natural or artificial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...
Dorit Baras, Ron Meir
142
Voted
IGARSS
2009
15 years 1 months ago
General Framework on Change Detection in a Sparse Domain
The paper presents a general framework for change detection in radar images, for an operational purpose and in the context of environmental monitoring. This framework is based on ...
Abdourrahmane M. Atto, Grégoire Mercier, Do...
CVPR
1999
IEEE
16 years 5 months ago
Integrating Shape from Shading and Range Data Using Neural Networks
This paper presents a framework for integrating multiple sensory data, sparse range data and dense depth maps from shape from shading in order to improve the 3D reconstruction of ...
Mostafa G.-H. Mostafa, Sameh M. Yamany, Aly A. Far...