Sciweavers

629 search results - page 79 / 126
» Document clustering based on cluster validation
Sort
View
LWA
2004
13 years 9 months ago
Dirichlet Enhanced Latent Semantic Analysis
This paper describes nonparametric Bayesian treatments for analyzing records containing occurrences of items. The introduced model retains the strength of previous approaches that...
Kai Yu, Shipeng Yu, Volker Tresp
WWW
2005
ACM
14 years 8 months ago
Disambiguating Web appearances of people in a social network
Say you are looking for information about a particular person. A search engine returns many pages for that person's name but which pages are about the person you care about, ...
Ron Bekkerman, Andrew McCallum
ICDAR
2009
IEEE
14 years 2 months ago
Word-Based Adaptive OCR for Historical Books
The aim of this work is to propose a new approach to the recognition of historical texts by providing an adaptive mechanism that automatically tunes itself to a specific book. Th...
Vladimir Kluzner, Asaf Tzadok, Yuval Shimony, Euge...
CIKM
2006
Springer
13 years 11 months ago
Multi-task text segmentation and alignment based on weighted mutual information
Text segmentation is important for text analysis, while text alignment is to determine shared sub-topics among similar documents. Multi-task text segmentation and alignment is the...
Bingjun Sun, Ding Zhou, Hongyuan Zha, John Yen
NLDB
2010
Springer
13 years 12 months ago
Semantic Content Access Using Domain-Independent NLP Ontologies
We present a lightweight, user-centred approach for document navigation and analysis that is based on an ontology of text mining results. This allows us to bring the result of exis...
René Witte, Ralf Krestel