Search Sciweavers | Sciweavers

468 search results - page 30 / 94

» Automatic Data Extraction from Data-Rich Web Pages

196

click to vote

IJCAI
2003

120views Artificial Intelligence» more IJCAI 2003»

Information Extraction from Tree Documents by Learning Subtree Delimiters

15 years 8 months ago

Download www.isi.edu

Information extraction from HTML pages has been conventionally treated as plain text documents extended with HTML tags. However, the growing maturity and correct usage of HTML/XHT...

Boris Chidlovskii

claim paper

Read More »

210

click to vote

CIKM
2009
Springer

153views Information Technology» more CIKM 2009»

Improving search engines using human computation games

15 years 8 months ago

Download research.microsoft.com

Work on evaluating and improving the relevance of web search engines typically use human relevance judgments or clickthrough data. Both these methods look at the problem of learni...

Hao Ma, Raman Chandrasekar, Chris Quirk, Abhishek ...

claim paper

Read More »

239

click to vote

SPIRE
1999
Springer

178views Information Technology» more SPIRE 1999»

Top-down Extraction of Semi-Structured Data

15 years 11 months ago

Download homepages.dcc.ufmg.br

In this paper, we propose an innovative approach to extracting semi-structured data from Web sources. The idea is to collect a couple of example objects from the user and to use t...

Berthier A. Ribeiro-Neto, Alberto H. F. Laender, A...

claim paper

Read More »

199

click to vote

WWW
2004
ACM

134views Internet Technology» more WWW 2004»

Time-based contextualized-news browser (t-cnb)

16 years 8 months ago

Download www.iw3c2.org

We propose a new way of browsing contextualized-news articles. Our prototype browser system is called a Time-based ContextualizedNews Browser (T-CNB). The T-CNB concurrently and a...

Akiyo Nadamoto, Katsumi Tanaka

claim paper

Read More »

210

Voted

VLDB
2011
ACM

251views Database» more VLDB 2011»

Harvesting relational tables from lists on the web

15 years 2 months ago

Download www.vldb.org

A large number of web pages contain data structured in the form of “lists”. Many such lists can be further split into multi-column tables, which can then be used in more seman...

Hazem Elmeleegy, Jayant Madhavan, Alon Y. Halevy

claim paper

Read More »

« Prev « First page 30 / 94 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers