Search Sciweavers | Sciweavers

97 search results - page 4 / 20

» Learning against multiple opponents

183

click to vote

COLT
2006
Springer

132views Machine Learning» more COLT 2006»

Online Learning with Variable Stage Duration

15 years 10 months ago

Download www.ece.mcgill.ca

We consider online learning in repeated decision problems, within the framework of a repeated game against an arbitrary opponent. For repeated matrix games, well known results esta...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

151

click to vote

NIPS
2004

138views Information Technology» more NIPS 2004»

New Criteria and a New Algorithm for Learning in Multi-Agent Systems

15 years 8 months ago

Download books.nips.cc

We propose a new set of criteria for learning algorithms in multi-agent systems, one that is more stringent and (we argue) better justified than previous proposed criteria. Our cr...

Rob Powers, Yoav Shoham

claim paper

Read More »

150

click to vote

ICCBR
2010
Springer

261views Automated Reasoning» more ICCBR 2010»

Imitating Inscrutable Enemies: Learning from Stochastic Policy Observation, Retrieval and Reuse

15 years 10 months ago

Download www.cse.lehigh.edu

In this paper we study the topic of CBR systems learning from observations in which those observations can be represented as stochastic policies. We describe a general framework wh...

Kellen Gillespie, Justin Karneeb, Stephen Lee-Urba...

claim paper

Read More »

188

Voted

COLT
2003
Springer

141views Machine Learning» more COLT 2003»

On-Line Learning with Imperfect Monitoring

15 years 12 months ago

Download www.ece.mcgill.ca

We study on-line play of repeated matrix games in which the observations of past actions of the other player and the obtained reward are partial and stochastic. We deﬁne the Part...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

156

click to vote

AAAI
2004

135views Intelligent Agents» more AAAI 2004»

Performance Bounded Reinforcement Learning in Strategic Interactions

15 years 8 months ago

Download www.aaai.org

Despite increasing deployment of agent technologies in several business and industry domains, user confidence in fully automated agent driven applications is noticeably lacking. T...

Bikramjit Banerjee, Jing Peng

claim paper

Read More »

« Prev « First page 4 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers