Search Sciweavers | Sciweavers

50 search results - page 6 / 10

» Convergence and Divergence in Standard and Averaging Reinfor...

click to vote

JMLR
2010

161views more JMLR 2010»

Dual Averaging Methods for Regularized Stochastic Learning and Online Optimization

13 years 2 months ago

Download jmlr.csail.mit.edu

We consider regularized stochastic learning and online optimization problems, where the objective function is the sum of two convex terms: one is the loss function of the learning...

Lin Xiao

claim paper

Read More »

click to vote

BMCBI
2006

124views more BMCBI 2006»

Detection of divergent genes in microbial aCGH experiments

13 years 7 months ago

Download www.biomedcentral.com

Background: Array-based comparative genome hybridization (aCGH) is a tool for rapid comparison of genomes from different bacterial strains. The purpose of such analysis is to dete...

Lars Snipen, Dirk Repsilber, Ludvig Nyquist, &Arin...

claim paper

Read More »

click to vote

EMNLP
2008

104views Natural Language Processing» more EMNLP 2008»

Soft-Supervised Learning for Text Classification

13 years 9 months ago

Download ssli.ee.washington.edu

We propose a new graph-based semisupervised learning (SSL) algorithm and demonstrate its application to document categorization. Each document is represented by a vertex within a ...

Amarnag Subramanya, Jeff Bilmes

claim paper

Read More »

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

13 years 7 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

click to vote

ICML
1999
IEEE

129views Machine Learning» more ICML 1999»

Implicit Imitation in Multiagent Reinforcement Learning

14 years 8 months ago

Download www.cs.toronto.edu

Imitation is actively being studied as an effective means of learning in multi-agent environments. It allows an agent to learn how to act well (perhaps optimally) by passively obs...

Bob Price, Craig Boutilier

claim paper

Read More »

« Prev « First page 6 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers