Sciweavers

953 search results - page 29 / 191
» Using Clustering and Blade Clusters in the Terabyte Task
Sort
View
IMCSIT
2010
13 years 5 months ago
Using Self Organizing Map to Cluster Arabic Crime Documents
This paper presents a system that combines two text mining techniques; information extraction and clustering. A rulebased approach is used to perform the information extraction tas...
Meshrif Alruily, Aladdin Ayesh, Abdulsamad Al-Marg...
IM
2007
13 years 8 months ago
Cluster Generation and Labeling for Web Snippets: A Fast, Accurate Hierarchical Solution
This paper describes Armil, a meta-search engine that groups the web snippets returned by auxiliary search engines into disjoint labeled clusters. The cluster labels generated by A...
Filippo Geraci, Marco Pellegrini, Marco Maggini, F...
CLEF
2011
Springer
12 years 8 months ago
A Language-Independent Approach to Identify the Named Entities in Under-Resourced Languages and Clustering Multilingual Document
Abstract. This paper presents a language-independent Multilingual Document Clustering (MDC) approach on comparable corpora. Named entites (NEs) such as persons, locations, organiza...
N. Kiran Kumar, G. S. K. Santosh, Vasudeva Varma
ICDE
2012
IEEE
252views Database» more  ICDE 2012»
11 years 11 months ago
Fuzzy Joins Using MapReduce
—Fuzzy/similarity joins have been widely studied in the research community and extensively used in real-world applications. This paper proposes and evaluates several algorithms f...
Foto N. Afrati, Anish Das Sarma, David Menestrina,...
IJCNN
2008
IEEE
14 years 2 months ago
Ranking and selecting clustering algorithms using a meta-learning approach
Abstract— We present a novel framework that applies a metalearning approach to clustering algorithms. Given a dataset, our meta-learning approach provides a ranking for the candi...
Marcílio Carlos Pereira de Souto, Ricardo B...