Search Sciweavers | Sciweavers

1582 search results - page 180 / 317

» Digital Documents and Media

316

Voted

SIGIR
2011
ACM

362views Information Technology» more SIGIR 2011»

Faster top-k document retrieval using block-max indexes

14 years 9 months ago

Download cis.poly.edu

Large search engines process thousands of queries per second over billions of documents, making query processing a major performance bottleneck. An important class of optimization...

Shuai Ding, Torsten Suel

claim paper

Read More »

186

click to vote

WWW
2008
ACM

163views Internet Technology» more WWW 2008»

As we may perceive: finding the boundaries of compound documents on the web

16 years 6 months ago

Download www2008.org

This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...

Pavel Dmitriev

claim paper

Read More »

172

click to vote

KDD
2007
ACM

231views Data Mining» more KDD 2007»

Xproj: a framework for projected structural clustering of xml documents

16 years 6 months ago

Download www.cs.rpi.edu

XML has become a popular method of data representation both on the web and in databases in recent years. One of the reasons for the popularity of XML has been its ability to encod...

Charu C. Aggarwal, Na Ta, Jianyong Wang, Jianhua F...

claim paper

Read More »

163

Voted

SIGIR
2005
ACM

154views Information Technology» more SIGIR 2005»

Boosted decision trees for word recognition in handwritten document retrieval

15 years 11 months ago

Download maven.smith.edu

Recognition and retrieval of historical handwritten material is an unsolved problem. We propose a novel approach to recognizing and retrieving handwritten manuscripts, based upon ...

Nicholas R. Howe, Toni M. Rath, R. Manmatha

claim paper

Read More »

165

click to vote

CIKM
2004
Springer

137views Information Technology» more CIKM 2004»

Hierarchical document categorization with support vector machines

15 years 11 months ago

Download www.cs.brown.edu

Automatically categorizing documents into pre-deﬁned topic hierarchies or taxonomies is a crucial step in knowledge and content management. Standard machine learning techniques ...

Lijuan Cai, Thomas Hofmann

claim paper

Read More »

« Prev « First page 180 / 317 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers