Search Sciweavers | Sciweavers

2677 search results - page 70 / 536

» Extracting Structured Data from Web Pages

211

click to vote

ICDE
2010
IEEE

273views Database» more ICDE 2010»

WikiAnalytics: Ad-hoc Querying of Highly Heterogeneous Structured Data

16 years 7 months ago

Download cseweb.ucsd.edu

Searching and extracting meaningful information out of highly heterogeneous datasets is a hot topic that received a lot of attention. However, the existing solutions are based on e...

Andrey Balmin, Emiran Curtmola

claim paper

Read More »

200

click to vote

COMPSAC
2002
IEEE

139views Software Engineering» more COMPSAC 2002»

An Approach to Identify Duplicated Web Pages

16 years 4 days ago

Download www.cse.dmu.ac.uk

A relevant consequence of the unceasing expansion of the Web and e-commerce is the growth of the demand of new Web sites and Web applications. The software industry is facing the ...

Giuseppe A. Di Lucca, Massimiliano Di Penta, Anna ...

claim paper

Read More »

194

click to vote

CIKM
2009
Springer

115views Information Technology» more CIKM 2009»

Data extraction from the web using wild card queries

15 years 11 months ago

Download webdocs.cs.ualberta.ca

This paper presents an overview of our framework for searching and retrieving facts and relationships within natural language text sources. In this framework, an extraction task o...

Davood Rafiei, Haobin Li

claim paper

Read More »

248

click to vote

JOT
2008

136views more JOT 2008»

The Stock Statistics Parser

15 years 7 months ago

Download www.jot.fm

This paper describes how use the HTMLEditorKit to perform web data mining on stock statistics for listed firms. Our focus is on making use of the web to get information about comp...

Douglas Lyon

claim paper

Read More »

170

click to vote

WWW
2008
ACM

143views Internet Technology» more WWW 2008»

Visualizing historical content of web pages

16 years 7 months ago

Download www2008.org

Recently, along with the rapid growth of the Web, the preservation efforts have also increased. As a consequence, large amounts of past Web data are stored in Web archives. This h...

Adam Jatowt, Yukiko Kawai, Katsumi Tanaka

claim paper

Read More »

« Prev « First page 70 / 536 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers