TY - JOUR
T1 - Investigating music collections at different scales with audioDB
AU - Rhodes, Christophe
AU - Crawford, Tim
AU - Casey, Michael
AU - d'Inverno, Mark
N1 - This work was supported by EPSRC (grant number EP/E02274X/1).
Publisher Copyright:
© 2010 Taylor & Francis
PY - 2010/12/1
Y1 - 2010/12/1
N2 - Content-based search of music collections presents differing challenges at different scales and according to the task at hand. In this paper, we consider a number of different use cases for content-based similarity search, at scales ranging between a detailed investigation of a single track to searching for fragments of a track against a collection of millions of media items. We pay particular attention to the varying tradeoff between precision and recall in these contexts, both from the point of view of system evaluation and from the point of view of a user of a system searching an unknown collection. We present the audioDB software for content-based search, and describe how it has been used to address use cases across these different collection sizes; in addition we show that the interpretation of similarity as a distance which can be modelled statistically, initially motivated by our desire to achieve sublinear retrieval time on large databases, can be used to improve the precision of searches over small and medium-sized collections.
AB - Content-based search of music collections presents differing challenges at different scales and according to the task at hand. In this paper, we consider a number of different use cases for content-based similarity search, at scales ranging between a detailed investigation of a single track to searching for fragments of a track against a collection of millions of media items. We pay particular attention to the varying tradeoff between precision and recall in these contexts, both from the point of view of system evaluation and from the point of view of a user of a system searching an unknown collection. We present the audioDB software for content-based search, and describe how it has been used to address use cases across these different collection sizes; in addition we show that the interpretation of similarity as a distance which can be modelled statistically, initially motivated by our desire to achieve sublinear retrieval time on large databases, can be used to improve the precision of searches over small and medium-sized collections.
UR - http://www.scopus.com/inward/record.url?scp=79955440261&partnerID=8YFLogxK
U2 - 10.1080/09298215.2010.516832
DO - 10.1080/09298215.2010.516832
M3 - Journal article
AN - SCOPUS:79955440261
SN - 0929-8215
VL - 39
SP - 337
EP - 348
JO - Journal of New Music Research
JF - Journal of New Music Research
IS - 4
ER -