Search Sciweavers | Sciweavers

498 search results - page 15 / 100

» Robust web content extraction

159

click to vote

SIGIR
2006
ACM

117views Information Technology» more SIGIR 2006»

Getting work done on the web: supporting transactional queries

16 years 20 days ago

Download www.eecs.umich.edu

Many searches on the web have a transactional intent. We argue that pages satisfying transactional needs can be distinguished from the more common pages that have some information...

Yunyao Li, Rajasekar Krishnamurthy, Shivakumar Vai...

claim paper

Read More »

177

Voted

CIKM
2006
Springer

186views Information Technology» more CIKM 2006»

A fast and robust method for web page template detection and removal

15 years 10 months ago

Download www.cs.utah.edu

The widespread use of templates on the Web is considered harmful for two main reasons. Not only do they compromise the relevance judgment of many web IR and web mining methods suc...

Karane Vieira, Altigran Soares da Silva, Nick Pint...

claim paper

Read More »

170

Voted

SEMWEB
2009
Springer

153views Internet Technology» more SEMWEB 2009»

Policy-Aware Content Reuse on the Web

16 years 1 months ago

Download dig.csail.mit.edu

The Web allows users to share their work very eﬀectively leading to the rapid re-use and remixing of content on the Web including text, images, and videos. Scientiﬁc research d...

Oshani Seneviratne, Lalana Kagal, Tim Berners-Lee

claim paper

Read More »

165

click to vote

LAWEB
2003
IEEE

83views Internet Technology» more LAWEB 2003»

On the Image Content of the Chilean Web

15 years 12 months ago

Download www.ciw.cl

In this paper we perform a study of the image contents of the Chilean web (.cl domain) using automatic feature extraction, content-based analysis and face detection algorithms. In...

Alejandro Jaimes, Javier Ruiz-del-Solar, Rodrigo V...

claim paper

Read More »

151

click to vote

ICDE
2010
IEEE

255views Database» more ICDE 2010»

On supporting effective web extraction

16 years 1 months ago

Download rosaec.snu.ac.kr

— Commercial tuple extraction systems have enjoyed some success to extract tuples by regarding HTML pages as tree structures and exploiting XPath queries to ﬁnd attributes of t...

Wook-Shin Han, Wooseong Kwak, Hwanjo Yu

claim paper

Read More »

« Prev « First page 15 / 100 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers