Sciweavers

328 search results - page 22 / 66
» A Multi-level Approach for Document Clustering
Sort
View
ICDAR
2009
IEEE
14 years 2 months ago
A Self-Adaptive Method for Extraction of Document-Specific Alphabets
Recognition and encoding of digitized historical documents is still a challenging and difficult task. A major problem is the occurrence of unknown glyphs and symbols which might n...
Stefan Pletschacher
ICDM
2002
IEEE
123views Data Mining» more  ICDM 2002»
14 years 15 days ago
Towards Automatic Generation of Query Taxonomy: A Hierarchical Query Clustering Approach
Previous works on automatic query clustering most generate a flat, un-nested partition of query terms. In this work, we are pursuing to organize query terms into a hierarchical s...
Shui-Lung Chuang, Lee-Feng Chien
CLEF
2010
Springer
13 years 8 months ago
Cross-document Coreference for WePS
A good clustering performance depends on the quality of the distance function used to asses similarity. In this paper we propose a pairwise document coreference model to improve pe...
Iustin Dornescu, Constantin Orasan, Tatiana Lesnik...
SAC
2011
ACM
12 years 10 months ago
Hierarchical comments-based clustering
Information resources on the Web like videos, images, and documents are increasingly becoming more “social” through user engagement via commenting systems. These commenting sy...
Chiao-Fang Hsu, James Caverlee, Elham Khabiri
ICASSP
2009
IEEE
14 years 2 months ago
Incorporating monolingual corpora into bilingual latent semantic analysis for crosslingual LM adaptation
The major limitation in bilingual latent semantic analysis (bLSA) is the requirement of parallel training corpora. Motivated by semi-supervised learning, we propose a clusterbased...
Yik-Cheung Tam, Tanja Schultz