Search Sciweavers | Sciweavers

2677 search results - page 33 / 536

» Extracting Structured Data from Web Pages

152

click to vote

UIST
2006
ACM

97views Software Engineering» more UIST 2006»

Enabling web browsers to augment web sites' filtering and sorting functionalities

16 years 25 days ago

Download people.csail.mit.edu

Existing augmentations of web pages are mostly small cosmetic changes (e.g., removing ads) and minor addition of third-party content (e.g., product prices from competing sites). N...

David F. Huynh, Robert C. Miller, David R. Karger

claim paper

Read More »

154

click to vote

LREC
2010

216views Education» more LREC 2010»

BlogBuster: A Tool for Extracting Corpora from the Blogosphere

15 years 8 months ago

Download www.lrec-conf.org

This paper presents BlogBuster, a tool for extracting a corpus from the blogosphere. The topic of cleaning arbitrary web pages with the goal of extracting a corpus from web data, ...

Georgios Petasis, Dimitrios Petasis

claim paper

Read More »

253

click to vote

ISEC
2001
Springer

180views ECommerce» more ISEC 2001»

i-Cube: A Tool-Set for the Dynamic Extraction and Integration of Web Data Content

15 years 11 months ago

Download www.swen.uwaterloo.ca

Over the past decade the Internet has evolved into the largest public community in the world. It provides a wealth of data content and services in almost every field of science, t...

Frankie Poon, Kostas Kontogiannis

claim paper

Read More »

177

click to vote

WWW
2001
ACM

116views Internet Technology» more WWW 2001»

16 years 7 months ago

Finding Related Web Pages Based on Connectivity Information from a Search Engine

Download www10.org

This paper proposes a method for finding related Web pages based on connectivity information of hyperlinks. As claimed by Kumar, a complete bipartite graph of Web pages can be reg...

Tsuyoshi Murata

claim paper

Read More »

185

click to vote

IJMMS
2008

108views more IJMMS 2008»

Ontology-based information extraction and integration from heterogeneous data sources

15 years 6 months ago

Download www.cl.uni-heidelberg.de

In this paper we present the design, implementation and evaluation of SOBA, a system for ontology-based information extraction from heterogeneous data resources, including plain t...

Paul Buitelaar, Philipp Cimiano, Anette Frank, Mat...

claim paper

Read More »

« Prev « First page 33 / 536 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers