Search Sciweavers | Sciweavers

4485 search results - page 751 / 897

» Equivalences on Observable Processes

152

Voted

ALENEX
2001

105views Algorithms» more ALENEX 2001»

A Probabilistic Spell for the Curse of Dimensionality

15 years 5 months ago

Download www.dcc.uchile.cl

Range searches in metric spaces can be very di cult if the space is \high dimensional", i.e. when the histogram of distances has a large mean and a small variance. The so-cal...

Edgar Chávez, Gonzalo Navarro

claim paper

Read More »

143

Voted

IJCAI
2001

163views Artificial Intelligence» more IJCAI 2001»

Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

15 years 5 months ago

Download www.cs.colorado.edu

Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

135

Voted

IJCAI
2001

174views Artificial Intelligence» more IJCAI 2001»

Complexity of Probabilistic Planning under Average Rewards

15 years 5 months ago

Download www.informatik.uni-freiburg.de

A general and expressive model of sequential decision making under uncertainty is provided by the Markov decision processes (MDPs) framework. Complex applications with very large ...

Jussi Rintanen

claim paper

Read More »

124

Voted

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 5 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

120

Voted

NIPS
2001

111views Information Technology» more NIPS 2001»

Orientation-Selective aVLSI Spiking Neurons

15 years 5 months ago

Download books.nips.cc

We describe a programmable multi-chip VLSI neuronal system that can be used for exploring spike-based information processing models. The system consists of a silicon retina, a PIC...

Shih-Chii Liu, Jörg Kramer, Giacomo Indiveri,...

claim paper

Read More »

« Prev « First page 751 / 897 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers