character n-grams | Sciweavers

181

LREC
2010

159views Education» more LREC 2010»

The Web Library of Babel: evaluating genre collections

15 years 5 months ago

We present experiments in automatic genre classiﬁcation on web corpora, comparing a wide variety of features on several different genreannotated datasets (HGC, I-EN, KI-04, KRYS...

Serge Sharoff, Zhili Wu, Katja Markert

claim paper

Read More »

172

Voted

ICWSM
2008

113views Internet Technology» more ICWSM 2008»

A Shallow Approach to Subjectivity Classification

15 years 8 months ago

Download www.aaai.org

We present a shallow linguistic approach to subjectivity classification. Using multinomial kernel machines, we demonstrate that a data representation based on counting character n...

Stephan Raaijmakers, Wessel Kraaij

claim paper

Read More »

149

click to vote

CLEF
2008
Springer

98views Information Technology» more CLEF 2008»

JHU Ad Hoc Experiments at CLEF 2008

15 years 8 months ago

Download clef.isti.cnr.it

For CLEF 2008 JHU conducted monolingual and bilingual experiments in the ad hoc TEL and Persian tasks. The TEL task involved focused on searching electronic card catalog records i...

Paul McNamee

claim paper

Read More »

170

click to vote

CLEF
2006
Springer

110views Information Technology» more CLEF 2006»

A First Approach to CLIR Using Character N -Grams Alignment

15 years 10 months ago

Download www.grupocole.org

Abstract. This paper describes the technique for translation of character n-grams we developed for our participation in CLEF 2006. This solution avoids the need for word normalizat...

Jesús Vilares, Michael P. Oakes, John Tait

claim paper

Read More »

155

click to vote

AIMSA
2006
Springer

122views Artificial Intelligence» more AIMSA 2006»

N-Gram Feature Selection for Authorship Identification

15 years 10 months ago

Download www.icsd.aegean.gr

Automatic authorship identification offers a valuable tool for supporting crime investigation and security. It can be seen as a multi-class, single-label text categorization task. ...

John Houvardas, Efstathios Stamatatos

claim paper

Read More »

187

click to vote

NLDB
2007
Springer

113views Natural Language Processing» more NLDB 2007»

Character N-Grams Translation in Cross-Language Information Retrieval

16 years 27 days ago

Download www.grupocole.org

Abstract. This paper describes a new technique for the direct translation of character n-grams for use in Cross-Language Information Retrieval systems. This solution avoids the nee...

Jesús Vilares, Michael P. Oakes, Manuel Vil...

claim paper

Read More »

183

click to vote

SIGIR
2009
ACM

134views Information Technology» more SIGIR 2009»

Addressing morphological variation in alphabetic languages

16 years 1 months ago

Download web.jhu.edu

The selection of indexing terms for representing documents is a key decision that limits how eﬀective subsequent retrieval can be. Often stemming algorithms are used to normaliz...

Paul McNamee, Charles K. Nicholas, James Mayfield

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers