Search Sciweavers | Sciweavers

2876 search results - page 27 / 576

» A Conceptual-Modeling Approach to Extracting Data from the W...

188

Voted

WWW
2010
ACM

188views Internet Technology» more WWW 2010»

Exploiting content redundancy for web information extraction

15 years 5 months ago

Download www.comp.nus.edu.sg

We propose a novel extraction approach that exploits content redundancy on the web to extract structured data from template-based web sites. We start by populating a seed database...

Pankaj Gulhane, Rajeev Rastogi, Srinivasan H. Seng...

claim paper

Read More »

181

click to vote

DEXAW
2007
IEEE

124views Database» more DEXAW 2007»

A Process Improvement Approach to Improve Web Form Design and Usability

15 years 7 months ago

Download www.nostin.com

The research presented in this paper is an examination of how the concepts used in process improvement may be applied to a web form to improve design and usability. Although much ...

Sean Thompson, Torab Torabi

claim paper

Read More »

167

click to vote

KDD
2007
ACM

189views Data Mining» more KDD 2007»

Corroborate and learn facts from the web

16 years 6 months ago

Download delivery.acm.org

The web contains lots of interesting factual information about entities, such as celebrities, movies or products. This paper describes a robust bootstrapping approach to corrobora...

Shubin Zhao, Jonathan Betz

claim paper

Read More »

249

click to vote

ICDE
2008
IEEE

153views Database» more ICDE 2008»

Automatically Extracting Form Labels

16 years 7 months ago

Download www.cs.utah.edu

We describe a machine-learning-based approach for extracting attribute labels from Web form interfaces. Having these labels is a requirement for several techniques that attempt to ...

Hoa Nguyen, Eun Yong Kang, Juliana Freire

claim paper

Read More »

175

click to vote

NAACL
2010

182views Computational Linguistics» more NAACL 2010»

Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment

15 years 3 months ago

Download research.microsoft.com

The quality of a statistical machine translation (SMT) system is heavily dependent upon the amount of parallel sentences used in training. In recent years, there have been several...

Jason R. Smith, Chris Quirk, Kristina Toutanova

claim paper

Read More »

« Prev « First page 27 / 576 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers