Search Sciweavers | Sciweavers

166 search results - page 27 / 34

» Safe exploration for reinforcement learning

171

click to vote

AI
2006
Springer

197views Artificial Intelligence» more AI 2006»

Adaptive Fraud Detection Using Benford's Law

15 years 9 months ago

Download csc.lsu.edu

Abstract. Adaptive Benford's Law [1] is a digital analysis technique that specifies the probabilistic distribution of digits for many commonly occurring phenomena, even for in...

Fletcher Lu, J. Efrim Boritz, H. Dominic Covvey

claim paper

Read More »

129

click to vote

IROS
2006
IEEE

147views Robotics» more IROS 2006»

A Hybrid Control Architecture for Autonomous Robotic Fish

15 years 11 months ago

Download cswww.essex.ac.uk

— This paper presents a hybrid control architecture for autonomous robotic ﬁshes which are able to swim and navigate in unknown or dynamically changing environments. It has a t...

Jindong Liu, Huosheng Hu, Dongbing Gu

claim paper

Read More »

165

click to vote

AROBOTS
1998

111views more AROBOTS 1998»

Emergence and Categorization of Coordinated Visual Behavior Through Embodied Interaction

15 years 5 months ago

Download www.informatics.sussex.ac.uk

This paper discusses the emergence of sensorimotor coordination for ESCHeR, a 4DOF redundant foveated robot-head, by interaction with its environment. A feedback-error-learning(FEL...

Luc Berthouze, Yasuo Kuniyoshi

claim paper

Read More »

209

Voted

JAIR
2008

148views more JAIR 2008»

Learning Partially Observable Deterministic Action Models

15 years 5 months ago

Download www.jair.org

We present exact algorithms for identifying deterministic-actions' effects and preconditions in dynamic partially observable domains. They apply when one does not know the ac...

Eyal Amir, Allen Chang

claim paper

Read More »

167

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

15 years 6 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

« Prev « First page 27 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers