Sciweavers

15 search results - page 2 / 3
» A graph-theoretic approach to webpage segmentation
Sort
View
WIRI
2005
IEEE
14 years 27 days ago
Postal Address Detection from Web Documents
An approach to postal address detection from webpages is proposed. The webpages are first segmented into text blocks based on their visual similarity. The text content in each bl...
Lin Can, Zhang Qian, Xiaofeng Meng, Wenyin Lin
WWW
2006
ACM
14 years 8 months ago
Browsing on small screens: recasting web-page segmentation into an efficient machine learning framework
Fitting enough information from webpages to make browsing on small screens compelling is a challenging task. One approach is to present the user with a thumbnail image of the full...
Shumeet Baluja
CIKM
2005
Springer
14 years 26 days ago
Fast webpage classification using URL features
We demonstrate the usefulness of the uniform resource locator (URL) alone in performing web page classification. This approach is magnitudes faster than typical web page classific...
Min-Yen Kan, Hoang Oanh Nguyen Thi
AIRS
2010
Springer
13 years 4 months ago
Event Recognition from News Webpages through Latent Ingredients Extraction
We investigate the novel problem of event recognition from news webpages. "Events" are basic text units containing news elements. We observe that a news article is always...
Rui Yan, Yu Li, Yan Zhang, Xiaoming Li
WWW
2007
ACM
14 years 8 months ago
Page-level template detection via isotonic smoothing
We develop a novel framework for the page-level template detection problem. Our framework is built on two main ideas. The first is the automatic generation of training data for a ...
Deepayan Chakrabarti, Ravi Kumar, Kunal Punera