Sciweavers

214 search results - page 38 / 43
» Finding Optimal Strategies for Imperfect Information Games
Sort
View
ATAL
2003
Springer
14 years 28 days ago
Coordination in multiagent reinforcement learning: a Bayesian approach
Much emphasis in multiagent reinforcement learning (MARL) research is placed on ensuring that MARL algorithms (eventually) converge to desirable equilibria. As in standard reinfor...
Georgios Chalkiadakis, Craig Boutilier
AAAI
2010
13 years 9 months ago
Reinforcement Learning Via Practice and Critique Advice
We consider the problem of incorporating end-user advice into reinforcement learning (RL). In our setting, the learner alternates between practicing, where learning is based on ac...
Kshitij Judah, Saikat Roy, Alan Fern, Thomas G. Di...
SIAMDM
2002
94views more  SIAMDM 2002»
13 years 7 months ago
How to Be an Efficient Snoop, or the Probe Complexity of Quorum Systems
A quorum system is a collection of sets (quorums) every two of which intersect. Quorum systems have been used for many applications in the area of distributed systems, including mu...
David Peleg, Avishai Wool
ICC
2009
IEEE
162views Communications» more  ICC 2009»
14 years 2 months ago
Design and Evaluation of a Multilevel Decoder for Satellite Communications
—In this paper, we propose a multilevel coding (MLC) scheme suitable for satellite communications, where different QoS levels are required. We introduce a novel characterization ...
Aharon Vargas, Marco Breiling, Wolfgang H. Gerstac...
AINA
2008
IEEE
14 years 2 months ago
Introducing Variable Gap Penalties into Three-Sequence Alignment for Protein Sequences
The common-use gap penalty strategies, constant penalty and affine gap penalty, have been adopted in the traditional three-sequence alignment algorithm which considers the inserti...
Che-Lun Hung, Chun-Yuan Lin, Yeh-Ching Chung, Chua...