Sciweavers

468 search results - page 18 / 94
» Automatic Data Extraction from Data-Rich Web Pages
Sort
View
ICDAR
2003
IEEE
14 years 28 days ago
Identifying Story and Preview Images in News Web Pages
The World Wide Web provides an increasingly powerful and popular publication mechanism. Web documents often contain a large number of images serving various different purposes. Th...
Jianying Hu, Amit Bagga
ECAI
2006
Springer
13 years 11 months ago
Disambiguating Personal Names on the Web Using Automatically Extracted Key Phrases
Abstract. When you search for information regarding a particular person on the web, a search engine returns many pages. Some of these pages may be for people with the same name. Ho...
Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuk...
ICDE
2006
IEEE
153views Database» more  ICDE 2006»
14 years 1 months ago
Automatic Extraction of Publication Time from News Search Results
The publication time of a page can have a big impact on its relevance to a query, especially for time-sensitive pages such as news items. For news search engines, the publication ...
Yiyao Lu, Weiyi Meng, Wanjing Zhang, King-Lup Liu,...
KDD
2002
ACM
293views Data Mining» more  KDD 2002»
14 years 8 months ago
Automatic Categorization of Web Pages and User Clustering with Mixtures of Hidden Markov Models
We propose mixtures of hidden Markov models for modelling clickstreams of web surfers. Hence, the page categorization is learned from the data without the need for a (possibly cumb...
Alexander Ypma, Tom Heskes
ACL
2009
13 years 5 months ago
Mining Bilingual Data from the Web with Adaptively Learnt Patterns
Mining bilingual data (including bilingual sentences and terms1 ) from the Web can benefit many NLP applications, such as machine translation and cross language information retrie...
Long Jiang, Shiquan Yang, Ming Zhou, Xiaohua Liu, ...