Search Sciweavers | Sciweavers

367 search results - page 1 / 74

» Duplicate detection in probabilistic data

185

click to vote

ICDE
2010
IEEE

208views Database» more ICDE 2010»

Duplicate detection in probabilistic data

15 years 6 months ago

Download eprints.eemcs.utwente.nl

Abstract— Collected data often contains uncertainties. Probabilistic databases have been proposed to manage uncertain data. To combine data from multiple autonomous probabilistic...

Fabian Panse, Maurice van Keulen, Ander de Keijzer...

claim paper

Read More »

172

click to vote

ICDE
2010
IEEE

204views Database» more ICDE 2010»

ProbClean: A probabilistic duplicate detection system

16 years 1 months ago

Download www.cs.uwaterloo.ca

— One of the most prominent data quality problems is the existence of duplicate records. Current data cleaning systems usually produce one clean instance (repair) of the input da...

George Beskales, Mohamed A. Soliman, Ihab F. Ilyas...

claim paper

Read More »

173

click to vote

P2P
2010
IEEE

202views Communications» more P2P 2010»

Optimizing Near Duplicate Detection for P2P Networks

15 years 5 months ago

Download www.l3s.de

—In this paper, we propose a probabilistic algorithm for detecting near duplicate text, audio, and video resources efﬁciently and effectively in large-scale P2P systems. To thi...

Odysseas Papapetrou, Sukriti Ramesh, Stefan Siersd...

claim paper

Read More »

147

click to vote

KDD
2005
ACM

104views Data Mining» more KDD 2005»

A hit-miss model for duplicate detection in the WHO drug safety database

16 years 7 months ago

Download www.comp.nus.edu.sg

The WHO Collaborating Centre for International Drug Monitoring in Uppsala, Sweden, maintains and analyses the world's largest database of reports on suspected adverse drug re...

Andrew Bate, G. Niklas Norén, Roland Orre

claim paper

Read More »

166

click to vote

ESWS
2010
Springer

138views Internet Technology» more ESWS 2010»

Efficient Semantic-Aware Detection of Near Duplicate Resources

15 years 10 months ago

Download www.l3s.de

Abstract. Efficiently detecting near duplicate resources is an important task when integrating information from various sources and applications. Once detected, near duplicate reso...

Ekaterini Ioannou, Odysseas Papapetrou, Dimitrios ...

claim paper

Read More »

« Prev « First page 1 / 74 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers