Sciweavers

DIAL
2006
IEEE

Tree clustering for layout-based document image retrieval

14 years 6 months ago
Tree clustering for layout-based document image retrieval
We describe a system for the retrieval on the basis of layout similarity of document images belonging to collections stored in digital libraries. Layout regions are extracted and represented with the XY tree. The proposed indexing method combines a new tree clustering algorithm (based on Self Organizing Maps) with Principal Component Analysis. The combination of these techniques allows us to retrieve the most similar pages from large collections without the need for a direct comparison of the query page with each indexed document.
Simone Marinai, Emanuele Marino, Giovanni Soda
Added 11 Jun 2010
Updated 11 Jun 2010
Type Conference
Year 2006
Where DIAL
Authors Simone Marinai, Emanuele Marino, Giovanni Soda
Comments (0)