Search Sciweavers | Sciweavers

1353 search results - page 218 / 271

» Text Indexing with Errors

192

click to vote

CN
1998

207views more CN 1998»

The Anatomy of a Large-Scale Hypertextual Web Search Engine

15 years 5 months ago

Download infolab.stanford.edu

In this paper, we present Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext. Google is designed to crawl and index the...

Sergey Brin, Lawrence Page

claim paper

Read More »

147

click to vote

IJDAR
2010

110views more IJDAR 2010»

Locating and parsing bibliographic references in HTML medical articles

15 years 4 months ago

Download archive.nlm.nih.gov

The set of references that typically appear toward the end of journal articles is sometimes, though not always, a ﬁeld in bibliographic (citation) databases. But even if referenc...

Jie Zou, Daniel X. Le, George R. Thoma

claim paper

Read More »

167

click to vote

ACL
2006

146views Computational Linguistics» more ACL 2006»

An Effective Two-Stage Model for Exploiting Non-Local Dependencies in Named Entity Recognition

15 years 7 months ago

Download acl.ldc.upenn.edu

This paper shows that a simple two-stage approach to handle non-local dependencies in Named Entity Recognition (NER) can outperform existing approaches that handle non-local depen...

Vijay Krishnan, Christopher D. Manning

claim paper

Read More »

173

Voted

WWW
2008
ACM

124views Internet Technology» more WWW 2008»

Improving relevance judgment of web search results with image excerpts

16 years 6 months ago

Download www2008.org

Current web search engines return result pages containing mostly text summary even though the matched web pages may contain informative pictures. A text excerpt (i.e. snippet) is ...

Zhiwei Li, Shuming Shi, Lei Zhang

claim paper

Read More »

167

click to vote

KDD
2008
ACM

183views Data Mining» more KDD 2008»

De-duping URLs via rewrite rules

16 years 6 months ago

Download research.yahoo.com

A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...

Anirban Dasgupta, Ravi Kumar, Amit Sasturkar

claim paper

Read More »

« Prev « First page 218 / 271 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers