Sciweavers

112 search results - page 10 / 23
» Clustering Template Based Web Documents
Sort
View
ICDT
2007
ACM
143views Database» more  ICDT 2007»
14 years 1 months ago
Hierarchical Summarizing and Evaluating for Web Pages
In this investigation we propose a novel summarization method of Web pages using hierarchical expression. We discuss close relationship between summarization and hierarchical clust...
Kou Takahashi, Takao Miura, Isamu Shioya
CVPR
2011
IEEE
13 years 1 months ago
Registration of Camera Captured Documents Under Non-rigid Deformation
Document registration is a problem where the image of a template document whose layout is known is registered with a test document image. Given the registration parameters, layout...
Venkata Edupuganti, Suryaprakash Kompalli, Vinayak...
IADIS
2004
13 years 11 months ago
'surfing for knowledge' finding semantically similar Web clusters
In this paper we present our technique for finding semantically similar clusters within web documents obtained from a set of queries retrieved from the Google search engine. This ...
David Cleary, Diarmuid O'Donoghue
WWW
2008
ACM
14 years 10 months ago
As we may perceive: finding the boundaries of compound documents on the web
This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...
Pavel Dmitriev
COLING
2010
13 years 4 months ago
Open Entity Extraction from Web Search Query Logs
In this paper we propose a completely unsupervised method for open-domain entity extraction and clustering over query logs. The underlying hypothesis is that classes defined by mi...
Alpa Jain, Marco Pennacchiotti