Sciweavers

2011 search results - page 315 / 403
» Universal Reinforcement Learning
Sort
View
IJCAI
2007
13 years 9 months ago
Using Linear Programming for Bayesian Exploration in Markov Decision Processes
A key problem in reinforcement learning is finding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...
Pablo Samuel Castro, Doina Precup
UAI
2008
13 years 9 months ago
Knowledge Combination in Graphical Multiagent Models
A graphical multiagent model (GMM) represents a joint distribution over the behavior of a set of agents. One source of knowledge aboutagents'behaviormaycomefromgametheoretic ...
Quang Duong, Michael P. Wellman, Satinder P. Singh
AAAI
2006
13 years 9 months ago
Hard Constrained Semi-Markov Decision Processes
In multiple criteria Markov Decision Processes (MDP) where multiple costs are incurred at every decision point, current methods solve them by minimising the expected primary cost ...
Wai-Leong Yeow, Chen-Khong Tham, Wai-Choong Wong
FLAIRS
2004
13 years 9 months ago
Intelligent Control of Closed-Loop Sedation in Simulated ICU Patients
The intensive care unit is a challenging environment to both patient and caregiver. Continued shortages in staffing, principally in nursing, increase risk to patient and healthcar...
Brett L. Moore, Eric D. Sinzinger, Todd M. Quasny,...
ICINCO
2004
165views Robotics» more  ICINCO 2004»
13 years 9 months ago
Active Sensing Strategies for Robotic Platforms, with an Application in Vision-Based Gripping
: We present a vision-based robotic system which uses a combination of several active sensing strategies to grip a free-standing small target object with an initially unknown posit...
Benjamin Deutsch, Frank Deinzer, Matthias Zobel, J...