Search Sciweavers | Sciweavers

2764 search results - page 549 / 553

» Information Retrieval by Semantic Similarity

click to vote

WWW
2007
ACM

175views Internet Technology» more WWW 2007»

Efficient search in large textual collections with redundancy

14 years 8 months ago

Download www2007.org

Current web search engines focus on searching only the most recent snapshot of the web. In some cases, however, it would be desirable to search over collections that include many ...

Jiangong Zhang, Torsten Suel

claim paper

Read More »

click to vote

SIGMOD
2008
ACM

142views Database» more SIGMOD 2008»

Cost-based variable-length-gram selection for string collections to support approximate queries efficiently

14 years 7 months ago

Download www.db-infotech.cn

Approximate queries on a collection of strings are important in many applications such as record linkage, spell checking, and Web search, where inconsistencies and errors exist in...

Xiaochun Yang, Bin Wang, Chen Li

claim paper

Read More »

click to vote

WSDM
2010
ACM

204views Data Mining» more WSDM 2010»

Learning URL patterns for webpage de-duplication

14 years 2 months ago

Download www.wsdm-conference.org

Presence of duplicate documents in the World Wide Web adversely aﬀects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...

Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...

claim paper

Read More »

click to vote

JCDL
2006
ACM

176views Education» more JCDL 2006»

A hierarchical, HMM-based automatic evaluation of OCR accuracy for a digital library of books

14 years 1 months ago

Download ciir.cs.umass.edu

A number of projects are creating searchable digital libraries of printed books. These include the Million Book Project, the Google Book project and similar eﬀorts from Yahoo an...

Shaolei Feng, R. Manmatha

claim paper

Read More »

click to vote

DL
1998
Springer

159views Digital Library» more DL 1998»

CiteSeer: An Automatic Citation Indexing System

13 years 12 months ago

Download clgiles.ist.psu.edu

We present CiteSeer: an autonomous citation indexing system which indexes academic literature in electronic format (e.g. Postscript files on the Web). CiteSeer understands how to ...

C. Lee Giles, Kurt D. Bollacker, Steve Lawrence

claim paper

Read More »

« Prev « First page 549 / 553 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers