Search Sciweavers | Sciweavers

38 search results - page 3 / 8

» The indexable web is more than 11.5 billion pages

click to vote

APWEB
2004
Springer

172views Internet Technology» more APWEB 2004»

A Query-Dependent Duplicate Detection Approach for Large Scale Search Engines

14 years 2 months ago

Download www.net-glyph.org

Duplication of Web pages greatly hurts the perceived relevance of a search engine. Existing methods for detecting duplicated Web pages can be classified into two categories, i.e. o...

Shaozhi Ye, Ruihua Song, Ji-Rong Wen, Wei-Ying Ma

claim paper

Read More »

click to vote

CIKM
2006
Springer

150views Information Technology» more CIKM 2006»

Knowing a web page by the company it keeps

14 years 2 months ago

Download www.cse.lehigh.edu

Web page classification is important to many tasks in information retrieval and web mining. However, applying traditional textual classifiers on web data often produces unsatisfyi...

Xiaoguang Qi, Brian D. Davison

claim paper

Read More »

click to vote

INAP
2001
Springer

181views Information Technology» more INAP 2001»

A Modern Approach to Searching the World Wide Web: Ranking Pages by Inference over Content

14 years 2 months ago

Download homepage.univie.ac.at

The Hypertext-based Webs such as Intranets contain a vast amount of information pertaining to an enormous number of subjects. It is, however, an organically grown and thus essentia...

Bronson Trevor, Edgar Weippl, Werner Winiwarter

claim paper

Read More »

click to vote

PVLDB
2010

161views more PVLDB 2010»

Annotating and Searching Web Tables Using Entities, Types and Relationships

13 years 8 months ago

Download www.comp.nus.edu.sg

Tables are a universal idiom to present relational data. Billions of tables on Web pages express entity references, attributes and relationships. This representation of relational...

Girija Limaye, Sunita Sarawagi, Soumen Chakrabarti

claim paper

Read More »

click to vote

CIKM
2008
Springer

146views Information Technology» more CIKM 2008»

Indexing and retrieval of a Greek corpus

14 years 10 days ago

Download www.mendeley.com

Greek is one of the most difficult languages to handle in Web Information Retrieval (IR) related tasks. Its difficulty stems from the fact that it is grammatically, morphologicall...

Georgios Paltoglou, Michail Salampasis, Fotis Laza...

claim paper

Read More »

« Prev « First page 3 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers