The Internet brings us access to multimedia databases with billions of data instances. The massive amount of data available to researchers and application developers brings both opportunities and challenges. In particular, massive amount of data makes data-driven approach feasible, but at the same time, it demands scalable algorithms. In this tutorial we present a range of algorithms and approaches that make it easy/easier to scale our work to Internetsized collections of multimedia data. The tutorial will start by providing attendees an overview and pointers to the tools that will allow them to scale their work to massive datasets. The tutorial discusses the theoretical and practical problem with large data, applications where large amounts of data are important to consider, types of algorithms that are practical with such large datasets, and examples of implementation techniques that make these algorithms practical. Many real-world examples and results illustrate the tutorial. Categ...
Malcolm Slaney, Edward Y. Chang