Search Sciweavers | Sciweavers

50 search results - page 7 / 10

» Convergence and Divergence in Standard and Averaging Reinfor...

click to vote

TNN
2010

176views Management» more TNN 2010»

On the weight convergence of Elman networks

13 years 2 months ago

Download www3.ntu.edu.sg

Abstract--An Elman network (EN) can be viewed as a feedforward (FF) neural network with an additional set of inputs from the context layer (feedback from the hidden layer). Therefo...

Qing Song

claim paper

Read More »

click to vote

AAAI
2006

190views Intelligent Agents» more AAAI 2006»

Action Selection in Bayesian Reinforcement Learning

13 years 9 months ago

Download www.aaai.org

My research attempts to address on-line action selection in reinforcement learning from a Bayesian perspective. The idea is to develop more effective action selection techniques b...

Tao Wang

claim paper

Read More »

click to vote

CVPR
2011
IEEE

446views Computer Vision» more CVPR 2011»

Shape Grammar Parsing via Reinforcement Learning

13 years 4 months ago

Download www.mas.ecp.fr

This paper tackles shape grammar parsing for facade segmentation using a novel optimization approach based on reinforcement learning (RL). To this end, we use a binary recursive g...

Olivier Teboul, Iasonas Kokkinos, Panagiotis Kouts...

claim paper

Read More »

click to vote

UAI
2003

172views Artificial Intelligence» more UAI 2003»

On the Convergence of Bound Optimization Algorithms

13 years 9 months ago

Download cs.nyu.edu

Many practitioners who use EM and related algorithms complain that they are sometimes slow. When does this happen, and what can be done about it? In this paper, we study the gener...

Ruslan Salakhutdinov, Sam T. Roweis, Zoubin Ghahra...

claim paper

Read More »

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

14 years 1 months ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

« Prev « First page 7 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers