Search Sciweavers | Sciweavers

308 search results - page 57 / 62

» Syntactic Similarity of Web Documents

197

click to vote

DASFAA
2007
IEEE

138views Database» more DASFAA 2007»

An Original Semantics to Keyword Queries for XML Using Structural Patterns

16 years 1 months ago

Download web.njit.edu

XML is by now the de facto standard for exporting and exchanging data on the web. The need for querying XML data sources whose structure is not fully known to the user and the need...

Dimitri Theodoratos, Xiaoying Wu

claim paper

Read More »

236

Voted

CIKM
2004
Springer

128views Information Technology» more CIKM 2004»

Exploiting hierarchical relationships in conceptual search

16 years 25 days ago

Download citeseer.uark.edu

As the number of available Web pages grows, users experience increasing difficulty finding documents relevant to their interests. One of the underlying reasons for this is that mo...

Devanand Ravindran, Susan Gauch

claim paper

Read More »

208

click to vote

SIGIR
2000
ACM

112views Information Technology» more SIGIR 2000»

Evaluating evaluation measure stability

15 years 11 months ago

Download www-lipn.univ-paris13.fr

: This paper presents a novel way of examining the accuracy of the evaluation measures commonly used in information retrieval experiments. It validates several of the rules-of-thum...

Chris Buckley, Ellen M. Voorhees

claim paper

Read More »

211

click to vote

ICAIL
2007
ACM

147views Artificial Intelligence» more ICAIL 2007»

Essential deduplication functions for transactional databases in law firms

15 years 11 months ago

Download www.conradweb.org

As massive document repositories and knowledge management systems continue to expand, in proprietary environments as well as on the Web, the need for duplicate detection becomes i...

Jack G. Conrad, Edward L. Raymond

claim paper

Read More »

215

click to vote

KDD
2008
ACM

183views Data Mining» more KDD 2008»

De-duping URLs via rewrite rules

16 years 7 months ago

Download research.yahoo.com

A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...

Anirban Dasgupta, Ravi Kumar, Amit Sasturkar

claim paper

Read More »

« Prev « First page 57 / 62 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers