Search Sciweavers | Sciweavers

45 search results - page 6 / 9

» An Efficient Similarity Join Algorithm with Cosine Similarit...

190

Voted

CIKM
2008
Springer

133views Information Technology» more CIKM 2008»

Achieving both high precision and high recall in near-duplicate detection

15 years 9 months ago

Download www.infomall.cn

To find near-duplicate documents, fingerprint-based paradigms such as Broder's shingling and Charikar's simhash algorithms have been recognized as effective approaches a...

Lian'en Huang, Lei Wang, Xiaoming Li

claim paper

Read More »

341

Voted

ICDE
2006
IEEE

156views Database» more ICDE 2006»

Reasoning About Approximate Match Query Results

16 years 8 months ago

Download www.yorku.ca

Join techniques deploying approximate match predicates are fundamental data cleaning operations. A variety of predicates have been utilized to quantify approximate match in such o...

Sudipto Guha, Nick Koudas, Divesh Srivastava, Xiao...

claim paper

Read More »

226

click to vote

WWW
2004
ACM

128views Internet Technology» more WWW 2004»

Web data integration using approximate string join

16 years 8 months ago

Download www.iw3c2.org

Web data integration is an important preprocessing step for web mining. It is highly likely that several records on the web whose textual representations differ may represent the ...

Yingping Huang, Gregory R. Madey

claim paper

Read More »

241

click to vote

WWW
2008
ACM

221views Internet Technology» more WWW 2008»

Contextual advertising by combining relevance with click feedback

16 years 8 months ago

Download www2008.org

Contextual advertising supports much of the Web's ecosystem today. User experience and revenue (shared by the site publisher ad the ad network) depend on the relevance of the...

Deepayan Chakrabarti, Deepak Agarwal, Vanja Josifo...

claim paper

Read More »

310

click to vote

ICDE
2003
IEEE

144views Database» more ICDE 2003»

Scalable template-based query containment checking for web semantic caches

16 years 8 months ago

Download www-2.cs.cmu.edu

Semantic caches, originally proposed for client-server database systems, are being recently deployed to accelerate the serving of dynamic web content by transparently caching data...

Khalil Amiri, Sanghyun Park, Renu Tewari, Sriram P...

claim paper

Read More »

« Prev « First page 6 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers