Sciweavers

495 search results - page 76 / 99
» Constructing States for Reinforcement Learning
Sort
View
AR
1998
106views more  AR 1998»
15 years 3 months ago
A cognitive robot architecture based on tactile and visual information
In this paper, we propose an architecture for a cognitive robot based on tactile and visual information. Visual information contains various features such as location and area of ...
Kazunori Terada, Takayuki Nakamura, Hideaki Takeda...
COGSR
2011
71views more  COGSR 2011»
14 years 11 months ago
Psychological models of human and optimal performance in bandit problems
In bandit problems, a decision-maker must choose between a set of alternatives, each of which has a fixed but unknown rate of reward, to maximize their total number of rewards ov...
Michael D. Lee, Shunan Zhang, Miles Munro, Mark St...
FORMATS
2004
Springer
15 years 9 months ago
Learning of Event-Recording Automata
Abstract. We extend Angluin’s algorithm for on-line learning of regular languages to the setting of timed systems. We consider systems that can be described by a class of determi...
Olga Grinchtein, Bengt Jonsson, Martin Leucker
ICPR
2010
IEEE
15 years 3 months ago
Learning Non-Linear Dynamical Systems by Alignment of Local Linear Models
Abstract—Learning dynamical systems is one of the important problems in many fields. In this paper, we present an algorithm for learning non-linear dynamical systems which works...
Masao Joko, Yoshinobu Kawahara, Takehisa Yairi
CDC
2008
IEEE
142views Control Systems» more  CDC 2008»
15 years 10 months ago
Convergence of rule-of-thumb learning rules in social networks
— We study the problem of dynamic learning by a social network of agents. Each agent receives a signal about an underlying state and communicates with a subset of agents (his nei...
Daron Acemoglu, Angelia Nedic, Asuman E. Ozdaglar