Sciweavers

1166 search results - page 174 / 234
» Negotiating Using Rewards
Sort
View
PKDD
2010
Springer
164views Data Mining» more  PKDD 2010»
13 years 8 months ago
Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations
Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...
Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...
WDAG
2010
Springer
164views Algorithms» more  WDAG 2010»
13 years 8 months ago
It's on Me! The Benefit of Altruism in BAR Environments
Abstract. Cooperation, a necessity for any peer-to-peer (P2P) cooperative service, is often achieved by rewarding good behavior now with the promise of future benefits. However, in...
Edmund L. Wong, Joshua B. Leners, Lorenzo Alvisi
GECCO
2009
Springer
113views Optimization» more  GECCO 2009»
13 years 7 months ago
Single step evolution of robot controllers for sequential tasks
The generation of robot controllers for a task requiring a sequence of elementary behaviors is still a challenge. If these behaviors are known, intermediate steps can be given to ...
Stéphane Doncieux, Jean-Baptiste Mouret
CORR
2011
Springer
175views Education» more  CORR 2011»
13 years 5 months ago
Adaptive Channel Recommendation for Dynamic Spectrum Access
—We propose a dynamic spectrum access scheme where secondary users recommend “good” channels to each other and access accordingly. We formulate the problem as an average rewa...
Xu Chen, Jianwei Huang, Husheng Li
JSAC
2010
130views more  JSAC 2010»
13 years 4 months ago
Adaptive Spatial Intercell Interference Cancellation in Multicell Wireless Networks
Downlink spatial intercell interference cancellation (ICIC) is considered for mitigating other-cell interference using multiple transmit antennas. A principle question we explore ...
Jun Zhang, Jeffrey G. Andrews