Sciweavers

1541 search results - page 19 / 309
» Extracting Web Data Using Instance-Based Learning
Sort
View
VLDB
2001
ACM
144views Database» more  VLDB 2001»
14 years 1 months ago
RoadRunner: Towards Automatic Data Extraction from Large Web Sites
The paper investigates techniques for extracting data from HTML sites through the use of automatically generated wrappers. To automate the wrapper generation and the data extracti...
Valter Crescenzi, Giansalvatore Mecca, Paolo Meria...
COMPSAC
2003
IEEE
14 years 2 months ago
A Supervised Visual Wrapper Generator for Web-Data Extraction
Extracting data from Web pages using wrappers is a fundamental problem arising in a large variety of applications of vast practical interest. In this paper, we propose a novel sch...
Xiaofeng Meng, Haiyan Wang, Dongdong Hu, Chen Li
EMNLP
2006
13 years 10 months ago
Boosting Unsupervised Relation Extraction by Using NER
Web extraction systems attempt to use the immense amount of unlabeled text in the Web in order to create large lists of entities and relations. Unlike traditional IE methods, the ...
Ronen Feldman, Benjamin Rosenfeld
PVLDB
2008
117views more  PVLDB 2008»
13 years 8 months ago
Learning to extract form labels
In this paper we describe a new approach to extract element labels from Web form interfaces. Having these labels is a requirement for several techniques that attempt to retrieve a...
Hoa Nguyen, Thanh Hoang Nguyen, Juliana Freire
AI
2007
Springer
14 years 3 months ago
Learning the Semantic Meaning of a Concept from the Web
Many researchers have used text classification method in solving the ontology mapping problem. Their mapping results heavily depend on the availability of quality exemplars used as...
Yang Yu, Yun Peng