Search Sciweavers | Sciweavers

708 search results - page 34 / 142

» Identifying Content Blocks from Web Documents

130

click to vote

SAC
2006
ACM

165views Applied Computing» more SAC 2006»

Template detection for large scale search engines

15 years 9 months ago

Download wwwcsif.cs.ucdavis.edu

Templates in web sites hurt search engine retrieval performance, especially in content relevance and link analysis. Current template removal methods suﬀer from processing speed ...

Liang Chen, Shaozhi Ye, Xing Li

claim paper

Read More »

159

click to vote

INTERACT
2003

132views Human Computer Interaction» more INTERACT 2003»

A Granular Approach to Web Search Result Presentation

15 years 4 months ago

Download www.idemployee.id.tue.nl

: In this paper we propose and evaluate interfaces for presenting the results of web searches. Sentences, taken from the top retrieved documents, are used as fine-grained represent...

Ryen W. White

claim paper

Read More »

136

click to vote

WWW
2010
ACM

257views Internet Technology» more WWW 2010»

CETR: content extraction via tag ratios

15 years 10 months ago

Download www.cs.illinois.edu

We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...

Tim Weninger, William H. Hsu, Jiawei Han

claim paper

Read More »

125

click to vote

WWW
2006
ACM

179views Internet Technology» more WWW 2006»

Detecting spam web pages through content analysis

16 years 3 months ago

Download research.microsoft.com

In this paper, we continue our investigations of "web spam": the injection of artificially-created pages into the web in order to influence the results from search engin...

Alexandros Ntoulas, Marc Najork, Mark Manasse, Den...

claim paper

Read More »

116

click to vote

APWEB
2003
Springer

119views Internet Technology» more APWEB 2003»

Mining "Hidden Phrase" Definitions from the Web

15 years 6 months ago

Download www.public.asu.edu

Keyword searching is the most common form of document search on the Web. Many Web publishers manually annotate the META tags and titles of their pages with frequently queried phras...

Hung V. Nguyen, P. Velamuru, Deepak Kolippakkam, H...

claim paper

Read More »

« Prev « First page 34 / 142 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers