document representation

316

IRFC
2011
Springer

373views Information Technology» more IRFC 2011»

Multilingual Document Clustering Using Wikipedia as External Knowledge

14 years 10 months ago

This paper presents Multilingual Document Clustering (MDC) on comparable corpora. Wikipedia, a structured multilingual knowledge base, has been highly exploited in many monolingual...

N. Kiran Kumar, G. S. K. Santosh, Vasudeva Varma

claim paper

Read More »

178

click to vote

IPM
2008

123views more IPM 2008»

Effectiveness of additional representations for the search result presentation on the web

15 years 6 months ago

Download www.dcs.gla.ac.uk

The presentation of search results on the web has been dominated by the textual form of document representation. On the other hand, the document's visual aspects such as the ...

Hideo Joho, Joemon M. Jose

claim paper

Read More »

177

click to vote

ECIR
2003
Springer

91views Information Technology» more ECIR 2003»

Taming Wild Phrases

15 years 8 months ago

Download www.cs.ru.nl

Abstract. In this paper the suitability of diﬀerent document representations for automatic document classiﬁcation is compared, investigating a whole range of representations be...

Cornelis H. A. Koster, Marc Seutter

claim paper

Read More »

189

click to vote

RIAO
2007

141views Information Technology» more RIAO 2007»

Effectiveness of Rich Document Representation in XML Retrieval

15 years 8 months ago

Download riao.free.fr

Information Retrieval (IR) systems are built with different goals in mind. Some IR systems target high precision that is to have more relevant documents on the first page of their...

Fahimeh Raja, Mostafa Keikha, Maseud Rahgozar, Far...

claim paper

Read More »

176

click to vote

CIKM
2006
Springer

159views Information Technology» more CIKM 2006»

Representing documents with named entities for story link detection (SLD)

15 years 10 months ago

Download www.unc.edu

Several information organization, access, and filtering systems can benefit from different kind of document representations than those used in traditional Information Retrieval (I...

Chirag Shah, W. Bruce Croft, David Jensen

claim paper

Read More »

182

click to vote

ICDAR
2003
IEEE

138views Document Analysis» more ICDAR 2003»

Classification of Web Documents Using a Graph Model

15 years 12 months ago

Download www.cse.salford.ac.uk

In this paper we describe work relating to classification of web documents using a graph-based model instead of the traditional vector-based model for document representation. We ...

Adam Schenker, Mark Last, Horst Bunke, Abraham Kan...

claim paper

Read More »

192

click to vote

SIGIR
2004
ACM

166views Information Technology» more SIGIR 2004»

Locality preserving indexing for document representation

16 years 25 min ago

Download research.microsoft.com

Document representation and indexing is a key problem for document analysis and processing, such as clustering, classification and retrieval. Conventionally, Latent Semantic Index...

Xiaofei He, Deng Cai, Haifeng Liu, Wei-Ying Ma

claim paper

Read More »

179

click to vote

CIS
2005
Springer

186views Applied Computing» more CIS 2005»

Concept Chain Based Text Clustering

16 years 3 days ago

Download dm.thss.tsinghua.edu.cn

Diﬀerent from familiar clustering objects, text documents have sparse data spaces. A common way of representing a document is as a bag of its component words, but the semantic re...

Shaoxu Song, Jian Zhang, Chunping Li

claim paper

Read More »

190

click to vote

KDD
2009
ACM

243views Data Mining» more KDD 2009»

Exploiting Wikipedia as external knowledge for document clustering

16 years 7 months ago

Download www-ai.cs.uni-dortmund.de

In traditional text clustering methods, documents are represented as "bags of words" without considering the semantic information of each document. For instance, if two ...

Xiaohua Hu, Xiaodan Zhang, Caimei Lu, E. K. Park, ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers