Search Sciweavers | Sciweavers

119 search results - page 4 / 24

» Learning to Extract Text-Based Information from the World Wi...

129

Voted

CIS
2005
Springer

101views Applied Computing» more CIS 2005»

A Method for Automating the Extraction of Specialized Information from the Web

15 years 11 months ago

Download www.as.uky.edu

The World Wide Web can be viewed as a gigantic distributed database including millions of interconnected hosts some of which publish information via web servers or peer-to-peer sys...

Ling Lin, Antonio Liotta, Andrew Hippisley

claim paper

Read More »

296

Voted

RIAO
1997

350views Information Technology» more RIAO 1997»

Coupling information retrieval and information extraction: A new text technology for gathering information from the web

15 years 6 months ago

Download reference.kfupm.edu.sa

The techniques of information retrieval and information extraction are complementary, but to date there has been little concrete work aimed at integrating the two. We describe how...

Robert J. Gaizauskas, Alexander M. Robertson

claim paper

Read More »

161

click to vote

WWW
2007
ACM

194views Internet Technology» more WWW 2007»

Answering bounded continuous search queries in the world wide web

16 years 6 months ago

Download www2007.org

Search queries applied to extract relevant information from the World Wide Web over a period of time may be denoted as continuous search queries. The improvement of continuous sea...

Dirk Kukulenz, Alexandros Ntoulas

claim paper

Read More »

168

Voted

WWW
2003
ACM

133views Internet Technology» more WWW 2003»

Efficient URL caching for world wide web crawling

16 years 6 months ago

Download research.microsoft.com

Crawling the web is deceptively simple: the basic algorithm is (a) Fetch a page (b) Parse it to extract all linked URLs (c) For all the URLs not seen before, repeat (a)?(c). Howev...

Andrei Z. Broder, Marc Najork, Janet L. Wiener

claim paper

Read More »

159

click to vote

WWW
2010
ACM

193views Internet Technology» more WWW 2010»

Web-scale knowledge extraction from semi-structured tables

15 years 10 months ago

Download www.patrickpantel.com

A wealth of knowledge is encoded in the form of tables on the World Wide Web. We propose a classification algorithm and a rich feature set for automatically recognizing layout tab...

Eric Crestan, Patrick Pantel

claim paper

Read More »

« Prev « First page 4 / 24 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers