Search Sciweavers | Sciweavers

591 search results - page 6 / 119

» Extracting Route Directions from Web Pages

252

Voted

WWW
2010
ACM

300views Internet Technology» more WWW 2010»

Automatic extraction of clickable structured web contents for name entity queries

16 years 2 months ago

Download research.microsoft.com

Today the major web search engines answer queries by showing ten result snippets, which need to be inspected by users for identifying relevant results. In this paper we investigat...

Xiaoxin Yin, Wenzhao Tan, Xiao Li, Yi-Chin Tu

claim paper

Read More »

218

click to vote

AIRWEB
2007
Springer

214views Internet Technology» more AIRWEB 2007»

Extracting Link Spam using Biased Random Walks from Spam Seed Sets

16 years 1 months ago

Download airweb.cse.lehigh.edu

Link spam deliberately manipulates hyperlinks between web pages in order to unduly boost the search engine ranking of one or more target pages. Link based ranking algorithms such ...

Baoning Wu, Kumar Chellapilla

claim paper

Read More »

198

click to vote

DEEC
2006
IEEE

113views Information Technology» more DEEC 2006»

Optimization of Automatic Navigation to Hidden Web Pages by Ranking-Based Browser Preloading

16 years 1 months ago

Download www.tic.udc.es

Web applications have become an invaluable source of information for many different vertical solutions, but their complex navigation and semistructured format make their informatio...

Justo Hidalgo, José Losada, Manuel Á...

claim paper

Read More »

176

click to vote

SOFSEM
2007
Springer

156views Theoretical Computer Science» more SOFSEM 2007»

Creating Permanent Test Collections of Web Pages for Information Extraction Research

16 years 1 months ago

Download www.dbai.tuwien.ac.at

In the research area of automatic web information extraction, there is a need for permanent and annotated web page collections enabling objective performance evaluation of differen...

Bernhard Pollak, Wolfgang Gatterbauer

claim paper

Read More »

199

click to vote

WWW
2003
ACM

130views Internet Technology» more WWW 2003»

DOM-based content extraction of HTML documents

16 years 7 months ago

Download www.psl.cs.columbia.edu

Web pages often contain clutter (such as pop-up ads, unnecessary images and extraneous links) around the body of an article that distracts a user from actual content. Extraction o...

Suhit Gupta, Gail E. Kaiser, David Neistadt, Peter...

claim paper

Read More »

« Prev « First page 6 / 119 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers