Search Sciweavers | Sciweavers

2137 search results - page 36 / 428

» Extraction of Structural Information from the Web

155

click to vote

SIGIR
2004
ACM

135views Information Technology» more SIGIR 2004»

15 years 11 months ago

Query-related data extraction of hidden web documents

Download dis.shef.ac.uk

The larger amount of information on the Web is stored in document databases and is not indexed by general-purpose search engines (i.e., Google and Yahoo). Such information is dyna...

Yih-Ling Hedley, Muhammad Younas, Anne E. James, M...

claim paper

Read More »

155

click to vote

CIKM
2009
Springer

115views Information Technology» more CIKM 2009»

Data extraction from the web using wild card queries

15 years 10 months ago

Download webdocs.cs.ualberta.ca

This paper presents an overview of our framework for searching and retrieving facts and relationships within natural language text sources. In this framework, an extraction task o...

Davood Rafiei, Haobin Li

claim paper

Read More »

174

click to vote

ACL
2010

150views Computational Linguistics» more ACL 2010»

Extraction and Approximation of Numerical Attributes from the Web

15 years 4 months ago

Download aclweb.org

We present a novel framework for automated extraction and approximation of numerical object attributes such as height and weight from the Web. Given an object-attribute pair, we d...

Dmitry Davidov, Ari Rappoport

claim paper

Read More »

178

click to vote

SIGIR
2005
ACM

156views Information Technology» more SIGIR 2005»

Title extraction from bodies of HTML documents and its application to web page retrieval

15 years 11 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...

Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...

claim paper

Read More »

171

Voted

SYNASC
2006
IEEE

211views Algorithms» more SYNASC 2006»

HTML Pattern Generator--Automatic Data Extraction from Web Pages

16 years 5 days ago

Download www.informatik.tu-cottbus.de

Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...

Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...

claim paper

Read More »

« Prev « First page 36 / 428 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers