Search Sciweavers | Sciweavers

1947 search results - page 6 / 390

» On the Automatic Extraction of Data from the Hidden Web

149

click to vote

SIGIR
2004
ACM

135views Information Technology» more SIGIR 2004»

15 years 11 months ago

Query-related data extraction of hidden web documents

Download dis.shef.ac.uk

The larger amount of information on the Web is stored in document databases and is not indexed by general-purpose search engines (i.e., Google and Yahoo). Such information is dyna...

Yih-Ling Hedley, Muhammad Younas, Anne E. James, M...

claim paper

Read More »

174

click to vote

DEBU
2000

95views more DEBU 2000»

Accurately and Reliably Extracting Data from the Web: A Machine Learning Approach

15 years 5 months ago

Download www.isi.edu

A critical problem in developing information agents for the Web is accessing data that is formatted for human use. We have developed a set of tools for extracting data from web si...

Craig A. Knoblock, Kristina Lerman, Steven Minton,...

claim paper

Read More »

138

click to vote

CIKM
2003
Springer

129views Information Technology» more CIKM 2003»

Extracting unstructured data from template generated web documents

15 years 11 months ago

Download www.ir.iit.edu

We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...

Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...

claim paper

Read More »

221

click to vote

SEMCO
2009
IEEE

277views Applied Computing» more SEMCO 2009»

An Algebraic Language for Semantic Data Integration on the Hidden Web

16 years 17 days ago

Download integra.cs.wayne.edu

Semantic integration in the hidden Web is an emerging area of research where traditional assumptions do not always hold. Frequent changes, conﬂicts and the sheer size of the hid...

Shazzad Hosain, Hasan M. Jamil

claim paper

Read More »

177

click to vote

PAKM
2004

148views Knowledge Management» more PAKM 2004»

Automatic Generation of Taxonomies from the WWW

15 years 7 months ago

Download deim.urv.cat

In this paper we present a methodology to extract information from the Web to build a taxonomy of terms and Web resources for a given domain. This taxonomy represents a hierarchy o...

David Sánchez, Antonio Moreno

claim paper

Read More »

« Prev « First page 6 / 390 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers