Sciweavers

3441 search results - page 30 / 689
» Intelligent Computation of Presentation Documents
Sort
View
LAWEB
2003
IEEE
15 years 9 months ago
Syntactic Similarity of Web Documents
This paper presents and compares two methods for evaluating the syntactic similarity between documents. The first method uses the Patricia tree, constructed from the original doc...
Álvaro R. Pereira Jr., Nivio Ziviani
ICDAR
2003
IEEE
15 years 9 months ago
Improved Nearest Neighbor Based Approach to Accurate Document Skew Estimation
The nearest-neighbor based document skew detection methods do not require the presence of a predominant text area, and are not subject to skew angle limitation. However, the accur...
Yue Lu, Chew Lim Tan
KDD
2005
ACM
163views Data Mining» more  KDD 2005»
15 years 9 months ago
Web mining from competitors' websites
This paper presents a framework for user-oriented text mining. It is then illustrated with an example of discovering knowledge from competitors’ websites. The knowledge to be di...
Xin Chen, Yi-fang Brook Wu
AAAI
2010
15 years 5 months ago
A Two-Dimensional Topic-Aspect Model for Discovering Multi-Faceted Topics
This paper presents the Topic-Aspect Model (TAM), a Bayesian mixture model which jointly discovers topics and aspects. We broadly define an aspect of a document as a characteristi...
Michael Paul, Roxana Girju
SETN
2004
Springer
15 years 9 months ago
Exploiting Cross-Document Relations for Multi-document Evolving Summarization
This paper presents a methodology for summarization from multiple documents which are about a specic topic. It is based on the specication and identication of the cross-document...
Stergos D. Afantenos, Irene Doura, Eleni Kapellou,...