Search Sciweavers | Sciweavers

28

ACG
2009
Springer

348views Computer Graphics» more ACG 2009»

A Lock-Free Multithreaded Monte-Carlo Tree Search Algorithm

14 years 2 months ago

With the recent success of Monte-Carlo tree search algorithms in Go and other games, and the increasing number of cores in standard CPUs, the efﬁcient parallelization of the sear...

Markus Enzenberger, Martin Müller 0003

claim paper

Read More »

20

click to vote

ANOR
2002

89views more ANOR 2002»

Imperfect Inspection Games Over Time

13 years 7 months ago

Download www.ratio.huji.ac.il

We consider an inspection game played on a continuous finite time interval. The inspector wishes to detect a violation as soon as possible after it has been made by the operator. T...

Daniel Rothenstein, Shmuel Zamir

claim paper

Read More »

24

click to vote

IUI
2003
ACM

124views Software Engineering» more IUI 2003»

Inferring user goals from personality and behavior in a causal model of user affect

14 years 22 days ago

Download people.cs.ubc.ca

We present a probabilistic model, based on Dynamic Decision Networks, to assess user affect from possible causes of emotional arousal. The model relies on the OCC cognitive theory...

Xiaoming Zhou, Cristina Conati

claim paper

Read More »

29

click to vote

ATAL
2004
Springer

189views Intelligent Agents» more ATAL 2004»

Approximate Solutions for Partially Observable Stochastic Games with Common Payoffs

14 years 27 days ago

Download www.cs.cmu.edu

Partially observable decentralized decision making in robot teams is fundamentally different from decision making in fully observable problems. Team members cannot simply apply si...

Rosemary Emery-Montemerlo, Geoffrey J. Gordon, Jef...

claim paper

Read More »

21

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

13 years 8 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers