Sciweavers

DILS
2007
Springer

Fast Approximate Duplicate Detection for 2D-NMR Spectra

14 years 6 months ago
Fast Approximate Duplicate Detection for 2D-NMR Spectra
2D-Nuclear magnetic resonance (NMR) spectroscopy is a powerful analytical method to elucidate the chemical structure of molecules. In contrast to 1D-NMR spectra, 2D-NMR spectra correlate the chemical shifts of 1 H and 13 C simultaneously. To curate or merge large spectra libraries a robust (and fast) duplicate detection is needed. We propose a definition of duplicates with the desired robustness properties mandatory for 2D-NMR experiments. A major gain in runtime performance wrt. previously proposed heuristics is achieved by mapping the spectra to simple discrete objects. We propose several appropriate data transformations for this task. In order to compensate for slight variations of the mapped spectra, we use appropriate hashing functions according to the locality sensitive hashing scheme, and identify duplicates by hashcollisions. 1 Motivation Nuclear magnetic resonance (NMR) spectra are important to analyze unknown natural products. In contrast to standard one-dimensional NMR spec...
Björn Egert, Steffen Neumann, Alexander Hinne
Added 07 Jun 2010
Updated 07 Jun 2010
Type Conference
Year 2007
Where DILS
Authors Björn Egert, Steffen Neumann, Alexander Hinneburg
Comments (0)