Sciweavers

7504 search results - page 221 / 1501
» Computing with Action Potentials
Sort
View
CDC
2009
IEEE
132views Control Systems» more  CDC 2009»
14 years 1 months ago
Q-learning and Pontryagin's Minimum Principle
Abstract— Q-learning is a technique used to compute an optimal policy for a controlled Markov chain based on observations of the system controlled using a non-optimal policy. It ...
Prashant G. Mehta, Sean P. Meyn
ESEC
1997
Springer
14 years 13 days ago
Verification of Liveness Properties Using Compositional Reachability Analysis
The software architecture of a distributed program can be represented by a hierarchical composition of subsystems, with interacting processes at the leaves of the hierarchy. Compo...
Shing-Chi Cheung, Dimitra Giannakopoulou, Jeff Kra...
ICRA
1995
IEEE
123views Robotics» more  ICRA 1995»
13 years 11 months ago
Vision-Based Reinforcement Learning for Purposive Behavior Acquisition
This paper presents a method of vision-based reinforcement learning by which a robot learns to shoot a ball into a goal, and discusses several issues in applying the reinforcement...
Minoru Asada, Shoichi Noda, Sukoya Tawaratsumida, ...
ERCIMDL
2009
Springer
193views Education» more  ERCIMDL 2009»
13 years 11 months ago
The Planets Interoperability Framework
We report on the implementation of a software infrastructure for preservation actions, carried out in the context of the European Integrated Project Planets – the Planets Interop...
Ross King, Rainer Schmidt, Andrew N. Jackson, Carl...
AIPS
2003
13 years 9 months ago
Reasoning about Autonomous Processes in an Estimated-Regression Planner
We examine the issues that arise in extending an estimatedregression (ER) planner to reason about autonomous processes that run and have continuous and discrete effects without th...
Drew V. McDermott