Sciweavers

166 search results - page 30 / 34
» Safe exploration for reinforcement learning
Sort
View
ICRA
2005
IEEE
128views Robotics» more  ICRA 2005»
14 years 2 months ago
Vibration-based Terrain Analysis for Mobile Robots
—Safe, autonomous mobility in rough terrain is an important requirement for planetary exploration rovers. Knowledge of local terrain properties is critical to ensure a rover’s ...
Christopher A. Brooks, Karl Iagnemma, Steven Dubow...
COGSR
2011
71views more  COGSR 2011»
13 years 3 months ago
Psychological models of human and optimal performance in bandit problems
In bandit problems, a decision-maker must choose between a set of alternatives, each of which has a fixed but unknown rate of reward, to maximize their total number of rewards ov...
Michael D. Lee, Shunan Zhang, Miles Munro, Mark St...
COLT
2010
Springer
13 years 6 months ago
An Asymptotically Optimal Bandit Algorithm for Bounded Support Models
Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...
Junya Honda, Akimichi Takemura
JMLR
2010
141views more  JMLR 2010»
13 years 3 months ago
Pinview: Implicit Feedback in Content-Based Image Retrieval
This paper describes Pinview, a content-based image retrieval system that exploits implicit relevance feedback during a search session. Pinview contains several novel methods that...
Peter Auer, Zakria Hussain, Samuel Kaski, Arto Kla...
ISCAS
2002
IEEE
153views Hardware» more  ISCAS 2002»
14 years 1 months ago
Biological learning modeled in an adaptive floating-gate system
We have implemented an aspect of learning and memory in the nervous system using analog electronics. Using a simple synaptic circuit we realize networks with Hebbian type adaptati...
Christal Gordon, Paul E. Hasler