Sciweavers

1236 search results - page 239 / 248
» Opposition-Based Reinforcement Learning
Sort
View
UAI
2008
13 years 11 months ago
Knowledge Combination in Graphical Multiagent Models
A graphical multiagent model (GMM) represents a joint distribution over the behavior of a set of agents. One source of knowledge aboutagents'behaviormaycomefromgametheoretic ...
Quang Duong, Michael P. Wellman, Satinder P. Singh
AAAI
2006
13 years 11 months ago
Hard Constrained Semi-Markov Decision Processes
In multiple criteria Markov Decision Processes (MDP) where multiple costs are incurred at every decision point, current methods solve them by minimising the expected primary cost ...
Wai-Leong Yeow, Chen-Khong Tham, Wai-Choong Wong
FLAIRS
2004
13 years 11 months ago
Intelligent Control of Closed-Loop Sedation in Simulated ICU Patients
The intensive care unit is a challenging environment to both patient and caregiver. Continued shortages in staffing, principally in nursing, increase risk to patient and healthcar...
Brett L. Moore, Eric D. Sinzinger, Todd M. Quasny,...
ICINCO
2004
165views Robotics» more  ICINCO 2004»
13 years 11 months ago
Active Sensing Strategies for Robotic Platforms, with an Application in Vision-Based Gripping
: We present a vision-based robotic system which uses a combination of several active sensing strategies to grip a free-standing small target object with an initially unknown posit...
Benjamin Deutsch, Frank Deinzer, Matthias Zobel, J...
NIPS
2001
13 years 11 months ago
The Emergence of Multiple Movement Units in the Presence of Noise and Feedback Delay
Tangential hand velocity profiles of rapid human arm movements often appear as sequences of several bell-shaped acceleration-deceleration phases called submovements or movement un...
Michael Kositsky, Andrew G. Barto