Sciweavers

808 search results - page 54 / 162
» Keyword-based document clustering
Sort
View
ICDAR
2007
IEEE
14 years 2 months ago
Simultaneous Layout Style and Logical Entity Recognition in a Heterogeneous Collection of Documents
Logical entity recognition in heterogeneous collections of document page images remains a challenging problem since the performance of traditional supervised methods degrade drama...
S. Chen, S. Mao, G. Thoma
SCIENTOMETRICS
2010
126views more  SCIENTOMETRICS 2010»
13 years 6 months ago
The 12th International conference on scientometrics and informetrics
This paper presents an approach for identifying similar documents that can be used to assist scientists in finding related work. The approach called Citation Proximity Analysis (C...
Jacqueline Leta, Birger Larsen, Ronald Rousseau, W...
BMCBI
2006
153views more  BMCBI 2006»
13 years 8 months ago
Automatic document classification of biological literature
Background: Document classification is a wide-spread problem with many applications, from organizing search engine snippets to spam filtering. We previously described Textpresso, ...
David Chen, Hans-Michael Müller, Paul W. Ster...
MHCI
2004
Springer
14 years 1 months ago
Automatic Partitioning of Web Pages Using Clustering
This paper introduces a method for automatically partitioning richly-formatted electronic documents. An automatic partitioning system has many potential uses, but we focus here on ...
Richard Romero, Adam Berger
ICDAR
2003
IEEE
14 years 1 months ago
Postal address block location by contour clustering
We have developed a well performing algorithm for locating address blocks in postal parcel images. Both machine printed and handwritten addresses are processed by the algorithm. T...
Venu Govindaraju, Sergey Tulyakov