Search Sciweavers | Sciweavers

181

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

15 years 8 months ago

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

171

click to vote

AIEDAM
1998

87views more AIEDAM 1998»

Learning to set up numerical optimizations of engineering designs

15 years 6 months ago

Download ti.arc.nasa.gov

Gradient-based numerical optimization of complex engineering designs offers the promise of rapidly producing better designs. However, such methods generally assume that the object...

Mark Schwabacher, Thomas Ellman, Haym Hirsh

claim paper

Read More »

182

click to vote

AR
1998

106views more AR 1998»

A cognitive robot architecture based on tactile and visual information

15 years 6 months ago

Download www-kasm.nii.ac.jp

In this paper, we propose an architecture for a cognitive robot based on tactile and visual information. Visual information contains various features such as location and area of ...

Kazunori Terada, Takayuki Nakamura, Hideaki Takeda...

claim paper

Read More »

144

click to vote

BC
1998

85views more BC 1998»

Spatial asymmetries in cat retinal ganglion cell responses

15 years 6 months ago

Download web.mac.com

Abstract. Enroth-Cugell and Robson (1966) ®rst proposed a classi®cation of retinal ganglion cells into X cells, which exhibit approximate linear spatial summation and largely sus...

Paolo Gaudiano, Andrzej W. Przybyszewski, Richard ...

claim paper

Read More »

181

click to vote

BC
1998

109views more BC 1998»

Learning and stabilization of altruistic behaviors in multi-agent systems by reciprocity

15 years 6 months ago

Download lis.epfl.ch

Optimization of performance in collective systems often requires altruism. The emergence and stabilization of altruistic behaviors are dicult to achieve because the agents incur ...

Javier Zamora, José del R. Millán, A...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers