Multimedia Retrieval and Analysis with Cottontail DB

Authors
Ralph Gasser, Luca Rossetto, Silvan Heller, Heiko Schuldt
Type
Article
Date
2021/3
Appears in
ACM SIGMM Records
Publisher
ACM SIGMM
Abstract

Analysis and retrieval of media collections get more and more challenging the larger the collections become. Keeping everything in the main memory becomes less feasible, and more and more time and effort have to be spent to deal with the data management. However, traditional relational databases do not support primitives often used in multimedia workloads, such as the nearest-neighbour search on vectors. In this column, we introduce Cottontail DB, an open-source database management system for multimedia features. Cottontail DB supports traditional relational database operations and text retrieval based on Lucene and, most importantly, efficient vector-space retrieval operations for large datasets.