On the robustness of cover version identification models: a study using cover versions from YouTube

Authors

  • Simon Hachmeier Humboldt-Universität zu Berlin, Germany
  • Robert Jäschke Humboldt-Universität zu Berlin, Germany

DOI:

https://doi.org/10.47989/ir30iConf47077

Keywords:

cover song, music, retrieval longtail

Abstract

Introduction. Recent advances in cover version identification have shown great success. However, models are usually tested on a fixed set of datasets which are relying on the online cover version database SecondHandSongs. It is unclear how well models perform on cover versions on online video platforms, which might exhibit alterations that are not expected.

Method. We annotate a subset of versions from YouTube sampled by a multi-modal uncertainty sampling approach and evaluate state-of-the-art cover version identification models.

Results. We find that existing models achieve significantly lower ranking performance on our dataset compared to a community dataset. We additionally measure the performance of different types of versions (e.g., instrumental versions) and find several types that are particularly hard to rank. Lastly, we provide a taxonomy of alterations in cover versions on the web.

Conclusions. We found that research in cover version identification shall be less dependent on SecondHandSongs but rather on more diverse datasets.

Published

2025-03-11

How to Cite

Hachmeier, S., & Jäschke, R. (2025). On the robustness of cover version identification models: a study using cover versions from YouTube. Information Research an International Electronic Journal, 30(iConf), 1103–1122. https://doi.org/10.47989/ir30iConf47077

Issue

Section

Peer-reviewed papers

Similar Articles

1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.