Sciweavers

789 search results - page 137 / 158
» simulation 1998
Sort
View
NIPS
1998
13 years 10 months ago
Gradient Descent for General Reinforcement Learning
A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...
Leemon C. Baird III, Andrew W. Moore
AIEDAM
1998
87views more  AIEDAM 1998»
13 years 8 months ago
Learning to set up numerical optimizations of engineering designs
Gradient-based numerical optimization of complex engineering designs offers the promise of rapidly producing better designs. However, such methods generally assume that the object...
Mark Schwabacher, Thomas Ellman, Haym Hirsh
AR
1998
106views more  AR 1998»
13 years 8 months ago
A cognitive robot architecture based on tactile and visual information
In this paper, we propose an architecture for a cognitive robot based on tactile and visual information. Visual information contains various features such as location and area of ...
Kazunori Terada, Takayuki Nakamura, Hideaki Takeda...
BC
1998
85views more  BC 1998»
13 years 8 months ago
Spatial asymmetries in cat retinal ganglion cell responses
Abstract. Enroth-Cugell and Robson (1966) ®rst proposed a classi®cation of retinal ganglion cells into X cells, which exhibit approximate linear spatial summation and largely sus...
Paolo Gaudiano, Andrzej W. Przybyszewski, Richard ...
BC
1998
109views more  BC 1998»
13 years 8 months ago
Learning and stabilization of altruistic behaviors in multi-agent systems by reciprocity
Optimization of performance in collective systems often requires altruism. The emergence and stabilization of altruistic behaviors are dicult to achieve because the agents incur ...
Javier Zamora, José del R. Millán, A...