Sciweavers

813 search results - page 154 / 163
» Ensemble Algorithms in Reinforcement Learning
Sort
View
DSP
2006
13 years 7 months ago
Adaptive multi-modality sensor scheduling for detection and tracking of smart targets
This paper considers the problem of sensor scheduling for the purposes of detection and tracking of "smart" targets. Smart targets are targets that can detect when they ...
Christopher M. Kreucher, Doron Blatt, Alfred O. He...
AR
1998
106views more  AR 1998»
13 years 7 months ago
A cognitive robot architecture based on tactile and visual information
In this paper, we propose an architecture for a cognitive robot based on tactile and visual information. Visual information contains various features such as location and area of ...
Kazunori Terada, Takayuki Nakamura, Hideaki Takeda...
NN
2010
Springer
125views Neural Networks» more  NN 2010»
13 years 5 months ago
Parameter-exploring policy gradients
We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...
Frank Sehnke, Christian Osendorfer, Thomas Rü...
ATAL
2006
Springer
13 years 11 months ago
Efficient agents for cliff-edge environments with a large set of decision options
This paper proposes an efficient agent for competing in Cliff Edge (CE) environments, such as sealed-bid auctions, dynamic pricing and the ultimatum game. The agent competes in on...
Ron Katz, Sarit Kraus
KDD
2010
ACM
289views Data Mining» more  KDD 2010»
13 years 5 months ago
Exploitation and exploration in a performance based contextual advertising system
The dynamic marketplace in online advertising calls for ranking systems that are optimized to consistently promote and capitalize better performing ads. The streaming nature of on...
Wei Li 0010, Xuerui Wang, Ruofei Zhang, Ying Cui, ...