Search Sciweavers | Sciweavers

2876 search results - page 30 / 576

» A Conceptual-Modeling Approach to Extracting Data from the W...

153

click to vote

WWW
2003
ACM

130views Internet Technology» more WWW 2003»

DOM-based content extraction of HTML documents

16 years 6 months ago

Download www.psl.cs.columbia.edu

Web pages often contain clutter (such as pop-up ads, unnecessary images and extraneous links) around the body of an article that distracts a user from actual content. Extraction o...

Suhit Gupta, Gail E. Kaiser, David Neistadt, Peter...

claim paper

Read More »

146

click to vote

WWW
2005
ACM

108views Internet Technology» more WWW 2005»

Using visual cues for extraction of tabular data from arbitrary HTML documents

16 years 6 months ago

Download www.dbai.tuwien.ac.at

We describe a method to extract tabular data from web pages. Rather than just analyzing the DOM tree, we also exploit visual cues in the rendered version of the document to extrac...

Bernhard Krüpl, Marcus Herzog, Wolfgang Gatte...

claim paper

Read More »

143

click to vote

SIGIR
2004
ACM

135views Information Technology» more SIGIR 2004»

15 years 11 months ago

Query-related data extraction of hidden web documents

Download dis.shef.ac.uk

The larger amount of information on the Web is stored in document databases and is not indexed by general-purpose search engines (i.e., Google and Yahoo). Such information is dyna...

Yih-Ling Hedley, Muhammad Younas, Anne E. James, M...

claim paper

Read More »

135

click to vote

ICDE
2006
IEEE

143views Database» more ICDE 2006»

Using Data-Extraction Ontologies to Foster Automating Semantic Annotation

15 years 11 months ago

Download ir.iit.edu

Semantic annotation adds formal metadata to web pages to link web data with ontology concepts. Automated semantic annotation is a primary way of enabling the semantic web. A main ...

Yihong Ding, David W. Embley

claim paper

Read More »

141

click to vote

WIDM
2003
ACM

97views Internet Technology» more WIDM 2003»

Schema-guided wrapper maintenance for web-data extraction

15 years 10 months ago

Download www.ics.uci.edu

Extracting data from Web pages using wrappers is a fundamental problem arising in a large variety of applications of vast practical interests. There are two main issues relevant t...

Xiaofeng Meng, Dongdong Hu, Chen Li

claim paper

Read More »

« Prev « First page 30 / 576 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers