Sciweavers

368 search results - page 24 / 74
» Template-Based Information Mining from HTML Documents
Sort
View
CORR
2006
Springer
178views Education» more  CORR 2006»
13 years 7 months ago
A tool set for the quick and efficient exploration of large document collections
: We are presenting a set of multilingual text analysis tools that can help analysts in any field to explore large document collections quickly in order to determine whether the do...
Camelia Ignat, Bruno Pouliquen, Ralf Steinberger, ...
IPM
2006
64views more  IPM 2006»
13 years 7 months ago
Text mining without document context
We consider a challenging clustering task: the clustering of muti-word terms without document co-occurrence information in order to form coherent groups of topics. For this task, ...
Eric SanJuan, Fidelia Ibekwe-Sanjuan
CIKM
2001
Springer
13 years 12 months ago
Mining the Web to Create Minority Language Corpora
The Web is a valuable source of language speci c resources but the process of collecting, organizing and utilizing these resources is di cult. We describe CorpusBuilder, an approa...
Rayid Ghani, Rosie Jones, Dunja Mladenic
MSR
2006
ACM
14 years 1 months ago
MAPO: mining API usages from open source repositories
To improve software productivity, when constructing new software systems, developers often reuse existing class libraries or frameworks by invoking their APIs. Those APIs, however...
Tao Xie, Jian Pei
AUSDM
2008
Springer
230views Data Mining» more  AUSDM 2008»
13 years 9 months ago
Combining Structure and Content Similarities for XML Document Clustering
This paper proposes a clustering approach that explores both the content and the structure of XML documents for determining similarity among them. Assuming that the content and th...
Tien Tran, Richi Nayak, Peter Bruza