Search Sciweavers | Sciweavers

391 search results - page 9 / 79

» Finding and Extracting Data Records from Web Pages

click to vote

WISE
2005
Springer

151views Internet Technology» more WISE 2005»

Extracting Web Data Using Instance-Based Learning

14 years 2 months ago

Download www.cs.uic.edu

This paper studies structured data extraction from Web pages, e.g., online product description pages. Existing approaches to data extraction include wrapper induction and automatic...

Yanhong Zhai, Bing Liu

claim paper

Read More »

click to vote

BMCBI
2011

219views Artificial Intelligence» more BMCBI 2011»

Extracting scientific articles from a large digital archive: BioStor and the Biodiversity Heritage Library

13 years 1 days ago

Download www.biomedcentral.com

Background: The Biodiversity Heritage Library (BHL) is a large digital archive of legacy biological literature, comprising over 31 million pages scanned from books, monographs, an...

Roderic D. M. Page

claim paper

Read More »

click to vote

KDD
1999
ACM

147views Data Mining» more KDD 1999»

Text Mining: Finding Nuggets in Mountains of Textual Data

14 years 21 days ago

Download maya.cs.depaul.edu

Text mining appliesthe sameanalytical functions of datamining to the domainof textual information, relying on sophisticatedtext analysis techniques that distill information from f...

Jochen Dörre, Peter Gerstl, Roland Seiffert

claim paper

Read More »

click to vote

JCDL
2004
ACM

198views Education» more JCDL 2004»

Finding authoritative people from the web

14 years 1 months ago

Download www.ingrid.org

Today’s web is so huge and diverse that it arguably reﬂects the real world. For this reason, searching the web is a promising approach to ﬁnd things in the real world. This ...

Masanori Harada, Shin-ya Sato, Kazuhiro Kazama

claim paper

Read More »

click to vote

AAAI
2008

109views Intelligent Agents» more AAAI 2008»

An Unsupervised Approach for Product Record Normalization across Different Web Sites

13 years 10 months ago

Download www.aaai.org

An unsupervised probabilistic learning framework for normalizing product records across different retailer Web sites is presented. Our framework decomposes the problem into two ta...

Tak-Lam Wong, Tik-Shun Wong, Wai Lam

claim paper

Read More »

« Prev « First page 9 / 79 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers