Sciweavers

166 search results - page 27 / 34
» Safe exploration for reinforcement learning
Sort
View
AI
2006
Springer
14 years 11 days ago
Adaptive Fraud Detection Using Benford's Law
Abstract. Adaptive Benford's Law [1] is a digital analysis technique that specifies the probabilistic distribution of digits for many commonly occurring phenomena, even for in...
Fletcher Lu, J. Efrim Boritz, H. Dominic Covvey
IROS
2006
IEEE
147views Robotics» more  IROS 2006»
14 years 2 months ago
A Hybrid Control Architecture for Autonomous Robotic Fish
— This paper presents a hybrid control architecture for autonomous robotic fishes which are able to swim and navigate in unknown or dynamically changing environments. It has a t...
Jindong Liu, Huosheng Hu, Dongbing Gu
AROBOTS
1998
111views more  AROBOTS 1998»
13 years 8 months ago
Emergence and Categorization of Coordinated Visual Behavior Through Embodied Interaction
This paper discusses the emergence of sensorimotor coordination for ESCHeR, a 4DOF redundant foveated robot-head, by interaction with its environment. A feedback-error-learning(FEL...
Luc Berthouze, Yasuo Kuniyoshi
JAIR
2008
148views more  JAIR 2008»
13 years 8 months ago
Learning Partially Observable Deterministic Action Models
We present exact algorithms for identifying deterministic-actions' effects and preconditions in dynamic partially observable domains. They apply when one does not know the ac...
Eyal Amir, Allen Chang
ICML
2010
IEEE
13 years 9 months ago
Toward Off-Policy Learning Control with Function Approximation
We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...
Hamid Reza Maei, Csaba Szepesvári, Shalabh ...