Search Sciweavers | Sciweavers

269 search results - page 17 / 54

» Indexing text data under space constraints

160

click to vote

ICDM
2002
IEEE

162views Data Mining» more ICDM 2002»

Phrase-based Document Similarity Based on an Index Graph Model

15 years 10 months ago

Download pami.uwaterloo.ca

Document clustering techniques mostly rely on single term analysis of the document data set, such as the Vector Space Model. To better capture the structure of documents, the unde...

Khaled M. Hammouda, Mohamed S. Kamel

claim paper

Read More »

174

click to vote

KDD
2009
ACM

269views Data Mining» more KDD 2009»

Extracting discriminative concepts for domain adaptation in text mining

16 years 6 months ago

Download 140.123.102.14

One common predictive modeling challenge occurs in text mining problems is that the training data and the operational (testing) data are drawn from different underlying distributi...

Bo Chen, Wai Lam, Ivor Tsang, Tak-Lam Wong

claim paper

Read More »

176

click to vote

ECIR
2008
Springer

185views Information Technology» more ECIR 2008»

Filaments of Meaning in Word Space

15 years 7 months ago

Download www.sics.se

Word space models, in the sense of vector space models built on distributional data taken from texts, are used to model semantic relations between words. We argue that the high dim...

Jussi Karlgren, Anders Holst, Magnus Sahlgren

claim paper

Read More »

169

click to vote

DEXA
2006
Springer

193views Database» more DEXA 2006»

Understanding and Enhancing the Folding-In Method in Latent Semantic Indexing

15 years 9 months ago

Download bayou.cs.ucdavis.edu

Abstract. Latent Semantic Indexing(LSI) has been proved to be effective to capture the semantic structure of document collections. It is widely used in content-based text retrieval...

Xiang Wang 0002, Xiaoming Jin

claim paper

Read More »

159

click to vote

DASFAA
2007
IEEE

143views Database» more DASFAA 2007»

Using Redundant Bit Vectors for Near-Duplicate Image Detection

16 years 7 days ago

Download goanna.cs.rmit.edu.au

Images are amongst the most widely proliferated form of digital information due to aﬀordable imaging technologies and the Web. In such an environment, the use of digital watermar...

Jun Jie Foo, Ranjan Sinha

claim paper

Read More »

« Prev « First page 17 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers