Sciweavers

391 search results - page 9 / 79
» Finding and Extracting Data Records from Web Pages
Sort
View
WISE
2005
Springer
14 years 2 months ago
Extracting Web Data Using Instance-Based Learning
This paper studies structured data extraction from Web pages, e.g., online product description pages. Existing approaches to data extraction include wrapper induction and automatic...
Yanhong Zhai, Bing Liu
BMCBI
2011
13 years 1 days ago
Extracting scientific articles from a large digital archive: BioStor and the Biodiversity Heritage Library
Background: The Biodiversity Heritage Library (BHL) is a large digital archive of legacy biological literature, comprising over 31 million pages scanned from books, monographs, an...
Roderic D. M. Page
KDD
1999
ACM
147views Data Mining» more  KDD 1999»
14 years 21 days ago
Text Mining: Finding Nuggets in Mountains of Textual Data
Text mining appliesthe sameanalytical functions of datamining to the domainof textual information, relying on sophisticatedtext analysis techniques that distill information from f...
Jochen Dörre, Peter Gerstl, Roland Seiffert
JCDL
2004
ACM
198views Education» more  JCDL 2004»
14 years 1 months ago
Finding authoritative people from the web
Today’s web is so huge and diverse that it arguably reflects the real world. For this reason, searching the web is a promising approach to find things in the real world. This ...
Masanori Harada, Shin-ya Sato, Kazuhiro Kazama
AAAI
2008
13 years 10 months ago
An Unsupervised Approach for Product Record Normalization across Different Web Sites
An unsupervised probabilistic learning framework for normalizing product records across different retailer Web sites is presented. Our framework decomposes the problem into two ta...
Tak-Lam Wong, Tik-Shun Wong, Wai Lam