Search Sciweavers | Sciweavers

564 search results - page 65 / 113

» Improving distant supervision using inference learning

198

Voted

JMLR
2010

192views more JMLR 2010»

Inducing Tree-Substitution Grammars

15 years 26 days ago

Download jmlr.csail.mit.edu

Inducing a grammar from text has proven to be a notoriously challenging learning task despite decades of research. The primary reason for its difficulty is that in order to induce...

Trevor Cohn, Phil Blunsom, Sharon Goldwater

claim paper

Read More »

196

click to vote

ICDAR
2011
IEEE

238views Document Analysis» more ICDAR 2011»

OCR-Driven Writer Identification and Adaptation in an HMM Handwriting Recognition System

14 years 5 months ago

Download www.icdar2011.org

—We present an OCR-driven writer identification algorithm in this paper. Our algorithm learns writer-specific characteristics more precisely from explicit character alignment usi...

Huaigu Cao, Rohit Prasad, Prem Natarajan

claim paper

Read More »

150

click to vote

LREC
2008

88views Education» more LREC 2008»

A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization

15 years 7 months ago

Download www.lrec-conf.org

Tokenization is one of the initial steps done for almost any text processing task. It is not particularly recognized as a challenging task for English monolingual systems but it r...

Oana Frunza

claim paper

Read More »

151

click to vote

ML
2010
ACM

86views Machine Learning» more ML 2010»

Semi-supervised local Fisher discriminant analysis for dimensionality reduction

15 years 4 months ago

Download www.trl.ibm.com

When only a small number of labeled samples are available, supervised dimensionality reduction methods tend to perform poorly due to overﬁtting. In such cases, unlabeled samples ...

Masashi Sugiyama, Tsuyoshi Idé, Shinichi Na...

claim paper

Read More »

162

click to vote

ICTIR
2009
Springer

129views Information Technology» more ICTIR 2009»

Training Data Cleaning for Text Classification

15 years 3 months ago

Download nmis.isti.cnr.it

Abstract. In text classification (TC) and other tasks involving supervised learning, labelled data may be scarce or expensive to obtain; strategies are thus needed for maximizing t...

Andrea Esuli, Fabrizio Sebastiani

claim paper

Read More »

« Prev « First page 65 / 113 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers