Sciweavers

2340 search results - page 124 / 468
» Speculative document evaluation
Sort
View
234
Voted
IRFC
2011
Springer
14 years 7 months ago
Multilingual Document Clustering Using Wikipedia as External Knowledge
This paper presents Multilingual Document Clustering (MDC) on comparable corpora. Wikipedia, a structured multilingual knowledge base, has been highly exploited in many monolingual...
N. Kiran Kumar, G. S. K. Santosh, Vasudeva Varma
136
Voted
ICDAR
2009
IEEE
15 years 10 months ago
Scalable Feature Extraction from Noisy Documents
We cope with the metadata recognition in layoutoriented documents. We address the problem as a classification task and propose a method for automatic extraction of relevant featu...
Loïc Lecerf, Boris Chidlovskii
136
Voted
IJCNN
2006
IEEE
15 years 9 months ago
A Self-Organising Map Approach for Clustering of XML Documents
— The number of XML documents produced and available on the Internet is steadily increasing. It is thus important to devise automatic procedures to extract useful information fro...
Francesca Trentini, Markus Hagenbuchner, Alessandr...
EEE
2005
IEEE
15 years 9 months ago
Learning the Kernel Matrix for XML Document Clustering
The rapid growth of XML adoption has urged for the need of a proper representation for semi-structured documents, where the document structural information has to be taken into ac...
Jianwu Yang, William Kwok-Wai Cheung, Xiaoou Chen
144
Voted
ADBIS
2006
Springer
165views Database» more  ADBIS 2006»
15 years 7 months ago
Fragmenting XML Documents via Structural Constraints
Abstract. XML query processors suffer from main-memory limitations that prevent them from processing large XML documents. While content-based predicates can be used to project down...
Angela Bonifati, Alfredo Cuzzocrea, Bruno Zinno