Search Sciweavers | Sciweavers

43 search results - page 7 / 9

» Training Reinforcement Neurocontrollers Using the Polytope A...

218

click to vote

NECO
2007

258views more NECO 2007»

Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity

15 years 6 months ago

Download www.coneural.org

The persistent modiﬁcation of synaptic efﬁcacy as a function of the relative timing of pre- and postsynaptic spikes is a phenomenon known as spiketiming-dependent plasticity (...

Razvan V. Florian

claim paper

Read More »

197

Voted

WWW
2009
ACM

200views Internet Technology» more WWW 2009»

Learning to recognize reliable users and content in social media with coupled mutual reinforcement

16 years 7 months ago

Download www.mathcs.emory.edu

Community Question Answering (CQA) has emerged as a popular forum for users to pose questions for other users to answer. Over the last few years, CQA portals such as Naver and Yah...

Jiang Bian, Yandong Liu, Ding Zhou, Eugene Agichte...

claim paper

Read More »

210

click to vote

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

15 years 6 months ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

196

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

15 years 6 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

217

Voted

ICML
2005
IEEE

123views Machine Learning» more ICML 2005»

A model for handling approximate, noisy or incomplete labeling in text classification

16 years 7 months ago

Download www.cse.iitb.ac.in

We introduce a Bayesian model, BayesANIL, that is capable of estimating uncertainties associated with the labeling process. Given a labeled or partially labeled training corpus of...

Ganesh Ramakrishnan, Krishna Prasad Chitrapura, Ra...

claim paper

Read More »

« Prev « First page 7 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers