Search Sciweavers | Sciweavers

15

ICML
2003
IEEE

137views Machine Learning» more ICML 2003»

BL-WoLF: A Framework For Loss-Bounded Learnability In Zero-Sum Games

14 years 7 months ago

We present BL-WoLF, a framework for learnability in repeated zero-sum games where the cost of learning is measured by the losses the learning agent accrues (rather than the number...

Vincent Conitzer, Tuomas Sandholm

claim paper

Read More »

22

click to vote

ICML
2008
IEEE

105views Machine Learning» more ICML 2008»

No-regret learning in convex games

14 years 7 months ago

Download www.cs.cmu.edu

Quite a bit is known about minimizing different kinds of regret in experts problems, and how these regret types relate to types of equilibria in the multiagent setting of repeated...

Geoffrey J. Gordon, Amy R. Greenwald, Casey Marks

claim paper

Read More »

31

click to vote

PKAW
2010

148views Knowledge Management» more PKAW 2010»

MMG: A Learning Game Platform for Understanding and Predicting Human Recall Memory

13 years 5 months ago

Download bi.snu.ac.kr

How humans infer probable information from the limited observed data? How they are able to build on little knowledge about the context in hand? Is the human memory repeatedly const...

Umer Fareed, Byoung-Tak Zhang

claim paper

Read More »

19

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

13 years 8 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

21

click to vote

NIPS
2003

152views Information Technology» more NIPS 2003»

Learning Near-Pareto-Optimal Conventions in Polynomial Time

13 years 8 months ago

Download www-2.cs.cmu.edu

We study how to learn to play a Pareto-optimal strict Nash equilibrium when there exist multiple equilibria and agents may have different preferences among the equilibria. We focu...

Xiao Feng Wang, Tuomas Sandholm

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers