AIMAIM
Back to publications

AAAI · 2025 · Vol. 39, Issue 22 · pp. 23687–23695

Detecting Music Performance Errors with Transformers

Abstract

Beginner musicians often struggle to identify specific errors in their performances, such as playing incorrect notes or rhythms. There are two limitations in existing tools for music error detection: (1) Existing approaches rely on automatic alignment; therefore, they are prone to errors caused by small deviations between alignment targets. (2) There is a lack of sufficient datasets to train music error detection models. In this work, we introduce a transformer-based approach that takes both audio and aligned score representations as inputs and learns to identify pitch and rhythm errors without depending on perfect alignment. We also release a synthesized dataset of erroneous performances built on top of the MusicNet corpus to enable training and evaluation of error detection models.

Authors

Benjamin Shiue-Hal Chou, Purvish Jajal, Nicholas John Eliopoulos, Tim Nadolsky, Cheng-Yun Yang, Nikita Ravi, James C. Davis, Kristen Yeon-Ji Yun, Yung-Hsiang Lu

Venue

Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 39, Issue 22, pp. 23687–23695 · 2025

Cite this paper

Chou, B. S.-H., Jajal, P., Eliopoulos, N. J., Nadolsky, T., Yang, C.-Y., Ravi, N., et al. (2025). Detecting Music Performance Errors with Transformers. Proceedings of the AAAI Conference on Artificial Intelligence, 39(22), 23687–23695. https://doi.org/10.1609/aaai.v39i22.34539

Part of

Related work

  • LadderSym: A Multimodal Interleaved Transformer for Music Practice Error Detection

    ICLR · 2026

    Read publication
  • Real-Time Cellist Postural Evaluation With On-Device Computer Vision

    HCII · 2026

    Read publication