Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Automatic detection of cue points for the emulation of DJ mixing
Umeå University, Faculty of Science and Technology, Department of Computing Science. (HPAC)ORCID iD: 0000-0001-5022-1686
Department of Music, Universidad EAFIT, Medellín 050022, Colombia.
Umeå University, Faculty of Science and Technology, Department of Computing Science. Umeå University, Faculty of Science and Technology, High Performance Computing Center North (HPC2N).ORCID iD: 0000-0002-4972-7097
2022 (English)In: Computer music journal, ISSN 0148-9267, E-ISSN 1531-5169, Vol. 46, no 3, p. 67-82Article in journal (Refereed) Published
Abstract [en]

The automatic identification of cue points is a central task in applications as diverse as music thumbnailing, generation of mash ups, and DJ mixing. Our focus lies in electronic dance music and in a specific kind of cue point, the “switch point,” that makes it possible to automatically construct transitions between tracks, mimicking what professional DJs do. We present two approaches for the detection of switch points. One embodies a few general rules we established from interviews with professional DJs, the other models a manually annotated dataset that we curated. Both approaches are based on feature extraction and novelty analysis. From an evaluation conducted on previously unknown tracks, we found that about 90 percent of the points generated can be reliably used in the context of a DJ mix.

Place, publisher, year, edition, pages
MIT Press, 2022. Vol. 46, no 3, p. 67-82
National Category
Signal Processing Computer Sciences
Identifiers
URN: urn:nbn:se:umu:diva-216393DOI: 10.1162/comj_a_00652ISI: 001101195600004Scopus ID: 2-s2.0-85177430629OAI: oai:DiVA.org:umu-216393DiVA, id: diva2:1811100
Available from: 2023-11-10 Created: 2023-11-10 Last updated: 2025-04-24Bibliographically approved
In thesis
1. Towards automatic DJ mixing: cue point detection and drum transcription
Open this publication in new window or tab >>Towards automatic DJ mixing: cue point detection and drum transcription
2024 (English)Doctoral thesis, comprehensive summary (Other academic)
Alternative title[sv]
Mot automatisk DJ-mixning : cue point-detektering och trumtranskription
Abstract [en]

With this thesis, we aim to automate the creation of DJ mixes. A DJ mix consists of an uninterrupted sequence of music, constructed by playing tracks one after the other, to improve the listening experience for the audience. Thus, to be able to build mixes automatically, we first need to understand the tracks we want to mix. This is done by extracting information from the audio signal. Specifically, we retrieve two pieces of information that are essential for DJs: cue points and drum transcription. In the field of music information retrieval, the two associated tasks are cue point detection and automatic drum transcription.

With cue point detection, we identify the positions in the tracks that can be used to create pleasant transitions in the mix. DJs have a good intuition on how to detect these positions. However, it is not straightforward to transform their intuition into a computer program because of the semantic gap between the two. To solve this problem we propose multiple approaches based on either expert knowledge or machine learning. Further, by interpreting the resulting models from our approaches, we also reflect on the musical content that is linked to the presence of cue points.

With automatic drum transcription, we aim to retrieve the position and the instrument of the notes played on the drumkit, to characterize the musical content of the tracks. To create the transcription, the most promising method is based on supervised deep learning. That is, models trained on labeled datasets. However, because of the difficulty of creating the annotations, the datasets available for training are usually limited in size or diversity. Thus, we propose novel methods to create better training data, either with real-world or synthetic music tracks. Further, by investigating thoroughly the performance of the models resulting from the training data, we deduce the most relevant characteristics of a dataset that help train models.

The solutions we proposed for both tasks of cue point detection and automatic drum transcription achieve high levels of accuracy. By investigating how these tasks reach this accuracy, we further our understanding of music information retrieval. And by open-sourcing our contributions, we make these findings reproducible. With the software resulting from this research, we created a proof of concept for automatic DJ mixing.

Place, publisher, year, edition, pages
Umeå: Umeå University, 2024. p. 34
Series
Report / UMINF, ISSN 0348-0542 ; 24.08
Keywords
Music Information Retrieval, Cue Point Detection, Automatic Drum Transcription
National Category
Computer Sciences
Identifiers
urn:nbn:se:umu:diva-228266 (URN)9789180704533 (ISBN)9789180704540 (ISBN)
Public defence
2024-09-02, MIT.C.343, MIT-huset, Umeå, 13:00 (English)
Opponent
Supervisors
Available from: 2024-08-15 Created: 2024-08-07 Last updated: 2024-08-09Bibliographically approved

Open Access in DiVA

fulltext(1100 kB)450 downloads
File information
File name FULLTEXT02.pdfFile size 1100 kBChecksum SHA-512
81ec8176cfef8f27af7072c169f9e350ea7751d7fa467daa4929c24d7e5d5196b35bdeae82517621ad2f1b0fe7712f524d375162f61ba343599c8d2da43a987e
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Zehren, MickaëlBientinesi, Paolo

Search in DiVA

By author/editor
Zehren, MickaëlBientinesi, Paolo
By organisation
Department of Computing ScienceHigh Performance Computing Center North (HPC2N)
In the same journal
Computer music journal
Signal ProcessingComputer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 450 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 411 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf