Sciweavers

2677 search results - page 86 / 536
» Extracting Structured Data from Web Pages
Sort
View
ICDE
2006
IEEE
153views Database» more  ICDE 2006»
14 years 4 months ago
Automatic Extraction of Publication Time from News Search Results
The publication time of a page can have a big impact on its relevance to a query, especially for time-sensitive pages such as news items. For news search engines, the publication ...
Yiyao Lu, Weiyi Meng, Wanjing Zhang, King-Lup Liu,...
ICCBR
2005
Springer
14 years 3 months ago
Extending jCOLIBRI for Textual CBR
Abstract. This paper summarises our work in textual Case-Based Reasoning within jCOLIBRI. We use Information Extraction techniques to annotate web pages to facilitate semantic retr...
Juan A. Recio-García, Belén Dí...
KDD
2002
ACM
293views Data Mining» more  KDD 2002»
14 years 10 months ago
Automatic Categorization of Web Pages and User Clustering with Mixtures of Hidden Markov Models
We propose mixtures of hidden Markov models for modelling clickstreams of web surfers. Hence, the page categorization is learned from the data without the need for a (possibly cumb...
Alexander Ypma, Tom Heskes
INTERNET
2007
182views more  INTERNET 2007»
13 years 9 months ago
Analysis of Caching and Replication Strategies for Web Applications
Replication and caching mechanisms are often employed to enhance the performance of Web applications. In this article, we present a qualitative and quantitative analysis of state-...
Swaminathan Sivasubramanian, Guillaume Pierre, Maa...
CIKM
2008
Springer
13 years 12 months ago
Closing the loop in webpage understanding
The two most important tasks in information extraction from the Web are webpage structure understanding and natural language sentences processing. However, little work has been don...
Chunyu Yang, Yong Cao, Zaiqing Nie, Jie Zhou, Ji-R...