Sciweavers

540 search results - page 70 / 108
» The Sinkhorn-Knopp Algorithm: Convergence and Applications
Sort
View
AAAI
2004
13 years 9 months ago
Performance Bounded Reinforcement Learning in Strategic Interactions
Despite increasing deployment of agent technologies in several business and industry domains, user confidence in fully automated agent driven applications is noticeably lacking. T...
Bikramjit Banerjee, Jing Peng
AAMAS
2007
Springer
13 years 8 months ago
Generalized multiagent learning with performance bound
Abstract – Despite increasing deployment of agent technologies in several business and industry domains, user confidence in fully automated agent driven applications is noticeab...
Bikramjit Banerjee, Jing Peng
IJCAI
2007
13 years 9 months ago
Forward Search Value Iteration for POMDPs
Recent scaling up of POMDP solvers towards realistic applications is largely due to point-based methods which quickly converge to an approximate solution for medium-sized problems...
Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony
NIPS
2004
13 years 9 months ago
A Generalized Bradley-Terry Model: From Group Competition to Individual Skill
The Bradley-Terry model for paired comparison has been popular in many areas. We propose a generalized version in which paired individual comparisons are extended to paired team c...
Tzu-Kuo Huang, Chih-Jen Lin, Ruby C. Weng
TWC
2010
13 years 2 months ago
Distributed power allocation in multi-user multi-channel cellular relay networks
In this paper, we consider the amplify-and-forward relaying transmission in the downlink of a multi-channel cellular network with one base station and multiple relay-destination pa...
Shaolei Ren, Mihaela van der Schaar