Search Sciweavers | Sciweavers

2849 search results - page 53 / 570

» Extracting Objects from the Web

click to vote

WEBI
2005
Springer

94views Internet Technology» more WEBI 2005»

ITPilot: A Toolkit for Industrial-Strength Web Data Extraction

15 years 8 months ago

Download www.tic.udc.es

In recent years, many research systems have been proposed to perform data extraction and automation tasks on Web sources. Since most of today’s Web sources are “human-readable...

Alberto Pan, Juan Raposo, Manuel Álvarez, P...

claim paper

Read More »

124

click to vote

SIGKDD
2010

111views more SIGKDD 2010»

Unexpected results in automatic list extraction on the web

14 years 10 months ago

Download www.sigkdd.org

The discovery and extraction of general lists on the Web continues to be an important problem facing the Web mining community. There have been numerous studies that claim to autom...

Tim Weninger, Fabio Fumarola, Rick Barber, Jiawei ...

claim paper

Read More »

139

click to vote

COLING
2010

187views Computational Linguistics» more COLING 2010»

A Novel Method for Bilingual Web Page Acquisition from Search Engine Web Records

14 years 10 months ago

Download www.aclweb.org

A new approach has been developed for acquiring bilingual web pages from the result pages of search engines, which is composed of two challenging tasks. The first task is to detec...

Yanhui Feng, Yu Hong, Zhenxiang Yan, Jian-Min Yao,...

claim paper

Read More »

103

click to vote

AAAI
2006

101views Intelligent Agents» more AAAI 2006»

Using Semantics to Identify Web Objects

15 years 4 months ago

Download cs.stanford.edu

Many common web tasks can be automated by algorithms that are able to identify web objects relevant to the user's needs. This paper presents a novel approach to web object id...

Nathanael Chambers, James F. Allen, Lucian Galescu...

claim paper

Read More »

164

click to vote

BMCBI
2011

219views Artificial Intelligence» more BMCBI 2011»

Extracting scientific articles from a large digital archive: BioStor and the Biodiversity Heritage Library

14 years 6 months ago

Download www.biomedcentral.com

Background: The Biodiversity Heritage Library (BHL) is a large digital archive of legacy biological literature, comprising over 31 million pages scanned from books, monographs, an...

Roderic D. M. Page

claim paper

Read More »

« Prev « First page 53 / 570 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers