Sciweavers

318 search results - page 10 / 64
» Mining data records in Web pages
Sort
View
AAAI
2006
13 years 9 months ago
Automatic Wrapper Generation Using Tree Matching and Partial Tree Alignment
This paper is concerned with the problem of structured data extraction from Web pages. The objective of the research is to automatically segment data records in a page, extract da...
Yanhong Zhai, Bing Liu
ACL
2006
13 years 9 months ago
A DOM Tree Alignment Model for Mining Parallel Data from the Web
This paper presents a new web mining scheme for parallel data acquisition. Based on the Document Object Model (DOM), a web page is represented as a DOM tree. Then a DOM tree align...
Lei Shi, Cheng Niu, Ming Zhou, Jianfeng Gao
ICDE
2006
IEEE
124views Database» more  ICDE 2006»
14 years 9 months ago
Segmentation of Publication Records of Authors from the Web
Publication records are often found in the authors' personal home pages. If such a record is partitioned into a list of semantic fields of authors, title, date, etc., the uns...
Wei Zhang, Clement T. Yu, Neil R. Smalheiser, Vetl...
AAAI
2007
13 years 10 months ago
Mining Web Query Hierarchies from Clickthrough Data
In this paper, we propose to mine query hierarchies from clickthrough data, which is within the larger area of automatic acquisition of knowledge from the Web. When a user submits...
Dou Shen, Min Qin, Weizhu Chen, Qiang Yang, Zheng ...
PKDD
2009
Springer
269views Data Mining» more  PKDD 2009»
14 years 3 months ago
Enhanced Web Page Content Visualization with Firefox
This paper aims at presenting how natural language processing and machine learning techniques can help the internet surfer to get a better overview of the pages he is reading. The ...
Lorand Dali, Delia Rusu, Dunja Mladenic