Sciweavers

708 search results - page 13 / 142
» Identifying Content Blocks from Web Documents
Sort
View
WWW
2005
ACM
14 years 9 months ago
Browsing fatigue in handhelds: semantic bookmarking spells relief
Focused Web browsing activities such as periodically looking up headline news, weather reports, etc., which require only selective fragments of particular Web pages, can be made m...
Saikat Mukherjee, I. V. Ramakrishnan
WWW
2001
ACM
14 years 9 months ago
Content Request Markup Language (CRML): a Distributed Framework for XML-based Content Publishing
Construct web applications to provide dynamic, personalized web contents with high scalability and performance is a challenge to the software industry in the new Internet era. In ...
Chi-Huang Chiu, Kai-Chih Liang, Shyan-Ming Yuan
HT
2000
ACM
14 years 28 days ago
Clustering hypertext with applications to web searching
Clustering separates unrelated documents and groups related documents, and is useful for discrimination, disambiguation, summarization, organization, and navigation of unstructure...
Dharmendra S. Modha, W. Scott Spangler
RIAO
2007
13 years 10 months ago
From Layout to Semantic: a Reranking Model for Mapping Web Documents to Mediated XML Representations
Many documents on the Web are formated in a weakly structured format. Because of their weak semantic and because of the heterogeneity of their formats, the information conveyed by...
Guillaume Wisniewski, Patrick Gallinari
WWW
2008
ACM
14 years 9 months ago
Web graph similarity for anomaly detection (poster)
Web graphs are approximate snapshots of the web, created by search engines. Their creation is an error-prone procedure that relies on the availability of Internet nodes and the fa...
Panagiotis Papadimitriou 0002, Ali Dasdan, Hector ...