Sciweavers

672 search results - page 72 / 135
» Link-based Approaches for Text Retrieval
Sort
View
145
Voted
KDD
2005
ACM
125views Data Mining» more  KDD 2005»
16 years 4 months ago
Email data cleaning
Addressed in this paper is the issue of `email data cleaning' for text mining. Many text mining applications need take emails as input. Email data is usually noisy and thus i...
Jie Tang, Hang Li, Yunbo Cao, ZhaoHui Tang
127
Voted
ACL
2003
15 years 5 months ago
Unsupervised Learning of Arabic Stemming Using a Parallel Corpus
This paper presents an unsupervised learning approach to building a non-English (Arabic) stemmer. The stemming model is based on statistical machine translation and it uses an Eng...
Monica Rogati, J. Scott McCarley, Yiming Yang
124
Voted
WWW
2006
ACM
16 years 4 months ago
Selective hypertext induced topic search
We address the problem of answering broad-topic queries on the World Wide Web. We present a link based analysis algorithm SelHITS, which is an improvement over Kleinberg's HI...
Amit C. Awekar, Pabitra Mitra, Jaewoo Kang
133
Voted
BTW
2005
Springer
91views Database» more  BTW 2005»
15 years 9 months ago
Element Relationship: Exploiting Inline Markup for Better XML Retrieval
: With the increasing popularity of semi-structured documents (particularly in the form of XML) for knowledge management, it is important to create tools that use the additional in...
Philipp Dopichaj
157
Voted
EACL
2003
ACL Anthology
15 years 5 months ago
NLP for Indexing and Retrieval of Captioned Photographs
We present a text-based approach for the automatic indexing and retrieval of digital photographs taken at crime scenes. Our research prototype, SOCIS, goes beyond keyword-based ap...
Horacio Saggion, Katerina Pastra, Yorick Wilks