Sciweavers

2849 search results - page 53 / 570
» Extracting Objects from the Web
Sort
View
WEBI
2005
Springer
14 years 1 months ago
ITPilot: A Toolkit for Industrial-Strength Web Data Extraction
In recent years, many research systems have been proposed to perform data extraction and automation tasks on Web sources. Since most of today’s Web sources are “human-readable...
Alberto Pan, Juan Raposo, Manuel Álvarez, P...
SIGKDD
2010
111views more  SIGKDD 2010»
13 years 2 months ago
Unexpected results in automatic list extraction on the web
The discovery and extraction of general lists on the Web continues to be an important problem facing the Web mining community. There have been numerous studies that claim to autom...
Tim Weninger, Fabio Fumarola, Rick Barber, Jiawei ...
COLING
2010
13 years 2 months ago
A Novel Method for Bilingual Web Page Acquisition from Search Engine Web Records
A new approach has been developed for acquiring bilingual web pages from the result pages of search engines, which is composed of two challenging tasks. The first task is to detec...
Yanhui Feng, Yu Hong, Zhenxiang Yan, Jian-Min Yao,...
AAAI
2006
13 years 9 months ago
Using Semantics to Identify Web Objects
Many common web tasks can be automated by algorithms that are able to identify web objects relevant to the user's needs. This paper presents a novel approach to web object id...
Nathanael Chambers, James F. Allen, Lucian Galescu...
BMCBI
2011
12 years 11 months ago
Extracting scientific articles from a large digital archive: BioStor and the Biodiversity Heritage Library
Background: The Biodiversity Heritage Library (BHL) is a large digital archive of legacy biological literature, comprising over 31 million pages scanned from books, monographs, an...
Roderic D. M. Page