Search Sciweavers | Sciweavers

232 search results - page 7 / 47

» Query-related data extraction of hidden web documents

144

click to vote

SIGIR
2005
ACM

156views Information Technology» more SIGIR 2005»

Title extraction from bodies of HTML documents and its application to web page retrieval

15 years 9 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...

Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...

claim paper

Read More »

132

Voted

DL
2000
Springer

162views Digital Library» more DL 2000»

Snowball: extracting relations from large plain-text collections

15 years 7 months ago

Download www.cs.columbia.edu

Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use...

Eugene Agichtein, Luis Gravano

claim paper

Read More »

130

click to vote

WWW
2007
ACM

183views Internet Technology» more WWW 2007»

Extraction and search of chemical formulae in text documents on the web

16 years 4 months ago

Download chemxseer.ist.psu.edu

Often scientists seek to search for articles on the Web related to a particular chemical. When a scientist searches for a chemical formula using a search engine today, she gets ar...

Bingjun Sun, Qingzhao Tan, Prasenjit Mitra, C. Lee...

claim paper

Read More »

133

click to vote

WWW
2003
ACM

130views Internet Technology» more WWW 2003»

DOM-based content extraction of HTML documents

16 years 4 months ago

Download www.psl.cs.columbia.edu

Web pages often contain clutter (such as pop-up ads, unnecessary images and extraneous links) around the body of an article that distracts a user from actual content. Extraction o...

Suhit Gupta, Gail E. Kaiser, David Neistadt, Peter...

claim paper

Read More »

132

click to vote

DMKD
2000
ACM

110views Data Mining» more DMKD 2000»

Combining Strategies for Extracting Relations from Text Collections

15 years 7 months ago

Download www.cs.columbia.edu

Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use...

Eugene Agichtein, Eleazar Eskin, Luis Gravano

claim paper

Read More »

« Prev « First page 7 / 47 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers