Sciweavers

MM
2015
ACM

Accelerating Large-scale Image Retrieval on Heterogeneous Architectures with Spark

8 years 7 months ago
Accelerating Large-scale Image Retrieval on Heterogeneous Architectures with Spark
Apache Spark is a general-purpose cluster computing system for big data processing and has drawn much attention recently from several fields, such as pattern recognition, machine learning and so on. Unlike MapReduce, Spark is especially suitable for iterative and interactive computations. With the computing power of Spark, a utility library, referred to as IRlib, is proposed in this work to accelerate large-scale image retrieval applications by jointly harnessing the power of GPU. Similar to the built-in machine learning library of Spark, namely MLlib, IRlib fits into the Spark APIs and benefits from the powerful functionalities of Spark. The main contributions of IRlib lie in two-folds. First, IRlib provides a uniform set of APIs for the programming of image retrieval applications. Second, the computational performance of Spark equipped with multiple GPUs is dramatically boosted by developing high performance modules for common image retrieval related algorithms. Comparative exper...
Hanli Wang, Bo Xiao, Lei Wang, Jun Wu
Added 14 Apr 2016
Updated 14 Apr 2016
Type Journal
Year 2015
Where MM
Authors Hanli Wang, Bo Xiao, Lei Wang, Jun Wu
Comments (0)