Sciweavers

725 search results - page 115 / 145
» Learning Behaviors Models for Robot Execution Control
Sort
View
IJCAI
2001
13 years 10 months ago
R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning
R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...
Ronen I. Brafman, Moshe Tennenholtz
SP
2010
IEEE
194views Security Privacy» more  SP 2010»
14 years 16 days ago
Identifying Dormant Functionality in Malware Programs
—To handle the growing flood of malware, security vendors and analysts rely on tools that automatically identify and analyze malicious code. Current systems for automated malwar...
Paolo Milani Comparetti, Guido Salvaneschi, Engin ...
IVA
2010
Springer
13 years 7 months ago
Using Artificial Team Members for Team Training in Virtual Environments
In a good team, members do not only perform their individual task, they also coordinate their actions with other members of the team. Developing such team skills usually involves e...
Jurriaan van Diggelen, Tijmen Muller, Karel van de...
PLDI
2012
ACM
11 years 11 months ago
Dynamic synthesis for relaxed memory models
Modern architectures implement relaxed memory models which may reorder memory operations or execute them non-atomically. Special instructions called memory fences are provided, al...
Feng Liu, Nayden Nedev, Nedyalko Prisadnikov, Mart...
ANSS
2000
IEEE
14 years 1 months ago
Flow Control and Dynamic Load Balancing in Time Warp
We present, in this paper, an algorithm which integrates flow control and dynamic load balancing in Time Warp. The algorithm is intended for use in a distributed memory environme...
Myongsu Choe, Carl Tropper