Search Sciweavers | Sciweavers

543 search results - page 58 / 109

» Exploiting content redundancy for web information extraction

130

click to vote

PKDD
2007
Springer

120views Data Mining» more PKDD 2007»

Site-Independent Template-Block Detection

15 years 11 months ago

Download research.microsoft.com

Detection of template and noise blocks in web pages is an important step in improving the performance of information retrieval and content extraction. Of the many approaches propos...

Aleksander Kolcz, Wen-tau Yih

claim paper

Read More »

150

click to vote

SIGMOD
2004
ACM

150views Database» more SIGMOD 2004»

When one Sample is not Enough: Improving Text Database Selection Using Shrinkage

16 years 5 months ago

Download qprober.cs.columbia.edu

Database selection is an important step when searching over large numbers of distributed text databases. The database selection task relies on statistical summaries of the databas...

Panagiotis G. Ipeirotis, Luis Gravano

claim paper

Read More »

140

click to vote

CIKM
2009
Springer

140views Information Technology» more CIKM 2009»

Easiest-first search: towards comprehension-based web search

15 years 12 months ago

Download www.dl.kuis.kyoto-u.ac.jp

Although Web search engines have become information gateways to the Internet, for queries containing technical terms, search results often contain pages that are difﬁcult to be ...

Makoto Nakatani, Adam Jatowt, Katsumi Tanaka

claim paper

Read More »

144

click to vote

WWW
2010
ACM

209views Internet Technology» more WWW 2010»

Shout out: integrating news and reader comments

16 years 8 days ago

Download infolab.northwestern.edu

A useful approach for enabling computers to automatically create new content is utilizing the text, media, and information already present on the World Wide Web. The newly created...

Lisa M. Gandy, Nathan D. Nichols, Kristian J. Hamm...

claim paper

Read More »

124

click to vote

JASIS
2006

106views more JASIS 2006»

Web unit-based mining of homepage relationships

15 years 5 months ago

Download www.cais.ntu.edu.sg

Abstract Homepages usually describe important semantic information about conceptual or physical entities, and are hence the main targets for searching and browsing. To facilitate s...

Aixin Sun, Ee-Peng Lim

claim paper

Read More »

« Prev « First page 58 / 109 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers