Skip to main content
Research

Publications: DR Emmanouil Benetos

( 2026 ) . Domain-invariant representation learning of bird sounds . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Barcelona, Spain ) from: 04/05/2026 to: 08/05/2026 ,
( 2026 ) . Scalable Evaluation for Audio Identification via Synthetic Latent Fingerprint Generation . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Barcelona, Spain ) from: 04/05/2026 to: 08/05/2026 ,
( 2026 ) . OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs . Conference: 14th International Conference on Learning Representations (ICLR) ( Rio de Janeiro, Brazil ) from: 23/04/2026 to: 27/04/2026 ,
Mitcheltree C, Lostanlen V, Benetos E, Lagrange M ( 2026 ) . SCRAPL: Scattering Transform with Random Paths for Machine Learning . Conference: 14th International Conference on Learning Representations (ICLR) ( Rio de Janeiro, Brazil ) from: 23/04/2026 to: 27/04/2026 ,
( 2026 ) . YuE: Scaling Open Foundation Models for Long-Form Music Generation . Conference: 14th International Conference on Learning Representations (ICLR) ( Rio de Janeiro, Brazil ) from: 23/04/2026 to: 27/04/2026 ,
( 2026 ) . Computational hermeneutics: evaluating generative AI as a cultural technology . Frontiers in Artificial Intelligence vol. 9 , Article 1753041 ,
Tang X, Lei X, Zhu C, Chen S, Yuan R, Li Y, Oh C, Zhang G et al. ( 2025 ) . AutoMV: An Automatic Multi-Agent System for Music Video Generation .
Ma Z, Ma Y, Zhu Y, Yang C, Chao Y-W, Xu R, Chen W, Chen Y et al. ( 2025 ) . MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix . Conference: The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025) from: 02/12/2025 to: 07/12/2025 ,
Li Y, Ma Y, Ma Y, Yuan R, Zhu K, Guo H, Liang Y, Liu J et al. ( 2025 ) . OmniBench: Towards The Future of Universal Omni-Language Models . Conference: The Thirty-Ninth Annual Conference on Neural Information Processing Systems. (NeurIPS 2025) from: 02/12/2025 to: 07/12/2025 ,
Kim H, Benetos E, Serra X ( 2025 ) . Velocity2DMs: A contextual modeling approach to dynamics marking prediction in piano performance . IEEE Signal Processing Letters vol. 32 ,
Chang SK, Dixon S, Benetos E ( 2025 ) . RUMAA: Repeat-Aware Unified Music Audio Analysis for Score-Performance Alignment, Transcription, and Mistake Detection . Conference: 2025 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2025) ( Granlibakken Tahoe, Toahoe City, CA ) from: 12/10/2025 to: 15/10/2025 ,
Ma Y, Li S, Yu J, Benetos E, Maezawa A ( 2025 ) . CMI-Bench: A Comprehensive Benchmark for Evaluating Music Instruction Following . Conference: 26th International Society for Music Information Retrieval Conference (ISMIR) ( Daejeon, Korea ) from: 21/09/2025 to: 25/09/2025 ,
Sarkar S, Moomjian V, Woods B, Benetos E, Sandler M ( 2025 ) . Perceptual errors in music source separation: looking beyond SDR averages . Conference: 26th International Society for Music Information Retrieval Conference (ISMIR) ( Daejeon, Korea ) from: 21/09/2025 to: 25/09/2025 ,
Bhattacharjee A, Meresman Higgs I, Sandler M, Benetos E ( 2025 ) . Refining music sample identification with a self-supervised graph neural network . Conference: 26th International Society for Music Information Retrieval Conference (ISMIR 2025) ( Daejeon, Korea ) from: 21/09/2025 to: 25/09/2025 ,
Papaioannou C, Benetos E, Potamianos A ( 2025 ) . Universal Music Representations? Evaluating Foundation Models on World Music Corpora . Conference: 26th International Society for Music Information Retrieval Conference (ISMIR) ( Daejeon, Korea ) from: 21/09/2025 to: 25/09/2025 ,
Zhang H, Liang J, Phan QH, Wang W, Benetos E ( 2025 ) . From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems . Conference: IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2025) ( Istanbul, Turkey ) from: 31/08/2025 to: 03/09/2025 ,
Huang J, Sousa F, Demirel E, Benetos E, Gadelha I ( 2025 ) . Enhancing Lyrics Transcription on Music Mixtures with Consistency Loss . Conference: Interspeech 2025 ( Rotterdam, The Netherlands ) from: 21/08/2025 to: 17/08/2025 ,
Plachouras C, Guinot J, Fazekas G, Quinton E, Benetos E, Pauwels J ( 2025 ) . Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks . Conference: International Joint Conference on Neural Networks (IJCNN) ( Rome, Italy ) from: 30/06/2025 to: 05/07/2025 ,
Qu X, Bai Y, Ma Y, Zhou Z, Lo KM, Liu J, Yuan R, Min L et al. ( 2025 ) . MuPT: A Generative Symbolic Music Pretrained Transformer . https://openreview.net/forum?id=iAK9oHp4Zz . Conference: International Conference on Learning Representations (ICLR) ( Singapore ) from: 24/04/2025 to: 28/04/2025 ,
Peeters G, Rafii Z, Fuentes M, Duan Z, Benetos E, Nam J, Mitsufuji Y ( 2025 ) . Twenty-Five Years of MIR Research: Achievements, Practices, Evaluations, and Future Challenges . Conference: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) vol. 00 , 1 - 5 .
De Almeida Nolasco IS, Stowell D, Benetos E ( 2025 ) . Acoustic identification of individual animals based on hierarchical contrastive learning . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Hyderabad, India ) from: 06/04/2025 to: 11/04/2025 ,
Singh S, Bhattacharjee A, Benetos E ( 2025 ) . GraFPrint: A GNN-Based Approach for Audio Identification . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Hyderabad, India ) from: 06/04/2025 to: 11/04/2025 ,
Singh S, Benetos E, Phan H, Stowell D ( 2025 ) . LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Hyderabad, India ) from: 06/04/2025 to: 11/04/2025 ,
Plachouras C, Benetos E, Pauwels J ( 2025 ) . Learning Music Audio Representations With Limited Data . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Hyderabad, India ) from: 06/04/2025 to: 11/04/2025 ,
Huang J ( 2025 ) . Singing to speech conversion with generative flow . EURASIP Journal on Audio, Speech, and Music Processing vol. 2025 , Article 12 ,
Liang J, Liu X, Wang W, Plumbley M, Phan H, Benetos E ( 2025 ) . Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities . IEEE Transactions on Audio, Speech and Language Processing vol. 33 , 949 - 961 .
Papaioannou C, Benetos E ( 2025 ) . LC-Protonets: Multi-label Few-shot learning for world music audio tagging . IEEE Open Journal of Signal Processing vol. 6 , 138 - 146 .
Elisha S, McDowell A, Beguerisse-Díaz M ( 2024 ) . Classification of spontaneous and scripted speech for multilingual audio . Conference: IEEE Spoken Language Technology Workshop 2024 ( Macao, China ) from: 02/12/2024 to: 05/12/2024 , 489 - 495 .
Zhou Z, Wu Y, Wu Z, Zhang X, Yuan R, Ma Y, Xue W ( 2024 ) . Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation . Conference: 25th International Society for Music Information Retrieval Conference (ISMIR) ( San Franscisco, CA, USA ) from: 10/11/2024 to: 14/11/2024 ,
Deng Q, Yang Q, Yuan R, Huang Y, Wang Y, Liu X, Tian Z, Pan J et al. ( 2024 ) . ComposerX: Multi-Agent Symbolic Music Composition with LLMs . Conference: 25th International Society for Music Information Retrieval Conference (ISMIR), ( San Francisco, CA, USA ) from: 10/11/2024 to: 14/11/2024 ,
Weck B, Manco I, Benetos E, QUINTON E ( 2024 ) . MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models . Conference: 25th International Society for Music Information Retrieval Conference (ISMIR) ( San Francisco, CA, USA ) from: 10/11/2024 to: 14/11/2024 ,
Steinmetz C, Singh S, Comunit� M, Ibnyahya I, Yuan S, Benetos E, Reiss J ( 2024 ) . ST-ITO: Controlling audio effects for style transfer with inference-time optimization . Conference: 25th International Society for Music Information Retrieval Conference (ISMIR) ( San Francisco, CA, USA ) from: 10/11/2024 to: 14/11/2024 ,
Chang SK, Benetos E, KIRCHHOFF H, Dixon S ( 2024 ) . ËœYourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation . Conference: IEEE International Workshop on Machine Learning for Signal Processing (MLSP) ( London, UK ) from: 25/09/2024 to: 22/09/2024 ,
Torrisi A, De Almeida Nolasco IS, Versace E, Benetos E ( 2024 ) . Exploratory analysis of early-life chick calls . Conference: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots (VIHAR) ( Kos, Greece ) from: 06/09/2024 to: 06/09/2024 ,
Ma Y, Øland A, Ragni A, Del Sette BM, Saitis C, Donahue C, Lin C, Plachouras C et al. ( 2024 ) . Foundation Models for Music: A Survey .
Liang J, Nolasco I, Ghani B, Phan H, Benetos E, Stowell D ( 2024 ) . Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event Detection . Conference: 32nd European Signal Processing Conference (EUSIPCO 2024) ( Lyon, France ) from: 26/08/2024 to: 30/08/2024 ,
Huang J, Benetos E ( 2024 ) . Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model . Conference: 32nd European Signal Processing Conference (EUSIPCO) ( Lyon, France ) from: 26/08/2024 to: 30/08/2024 ,
Yuan R, Lin H, Wang Y, Tian Z, Wu S, Shen T, Zhang G, Wu Y et al. ( 2024 ) . ChatMusician: Understanding and Generating Music Intrinsically with LLM . Conference: 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) ( Bangkok, Thailand ) from: 11/08/2024 to: 16/08/2024 ,
Xompero A, Bontonou M, Arbona J-M, Benetos E ( 2024 ) . Explaining models relating objects and privacy . Proceedings of CVPR 2024 Workshops . Conference: 3rd Explainable AI for Computer Vision (XAI4CV) Workshop at CVPR 2024 ( Seattle Convention Center, Seattle WA, USA ) from: 18/06/2024 to: 18/06/2024 ,
Deng Z, Ma Y, Liu Y, Guo R, Zhang G, Chen W, Huang W, Benetos E ( 2024 ) . MusiLingo: bridging music and text with pre-trained language models for music captioning and query response . Conference: 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024) ( Mexico City, Mexico ) from: 16/06/2024 to: 21/06/2024 , 3643 - 3655 .
Ozaki Y, Tierney A, Pfordresher PQ, McBride JM, Benetos E, Proutskova P, Chiba G, Liu F et al. ( 2024 ) . Globally, songs and instrumental melodies are slower and higher and use more stable pitches than speech: A Registered Report . Science Advances vol. 10 , ( 20 )
Liang J, Zhang H, Liu H, Cao Y, Kong Q, Liu X, Wang W, Plumbley MD et al. ( 2024 ) . WavCraft: audio editing and generation with large language models . Conference: ICLR 2024 Workshop on LLM Agents ( Vienna, Austria ) from: 11/05/2024 to: 11/05/2024 ,
Li Y, Yuan R, Zhang G, Ma Y, Chen X, Yin H, Xiao C, Lin C et al. ( 2024 ) . MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training . Conference: International Conference on Learning Representations (ICLR) ( Vienna, Austria ) from: 07/05/2024 to: 11/05/2024 ,
Postolache E, Mariani G, Cosmo L ( 2024 ) . Generalized multi-source inference for text conditioned music diffusion models . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Seoul, Korea ) from: 14/04/2024 to: 19/04/2024 , 6980 - 6984 .
Liang J, Phan QH, Benetos E ( 2024 ) . Learning from taxonomy: multi-label few-shot classification for everyday sound recognition . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Seoul, Korea ) from: 14/04/2024 to: 19/04/2024 , 771 - 775 .
Li D, Ma Y, Wei W, KONG Q, Wu Y, Che M, Xia F, Benetos E et al. ( 2024 ) . MERTech: instrument playing technique detection using self-supervised pretrained model with multi-task finetuning . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Seoul, Korea ) from: 14/04/2024 to: 19/04/2024 , 521 - 525 .
EDWARDS D, Dixon S, Benetos E, Maezawa A, Kusaka Y ( 2024 ) . A Data-Driven Analysis of Robust Automatic Piano Transcription . IEEE Signal Processing Letters vol. 31 , 681 - 685 .
Singh S, Steinmetz C, Benetos E, Phan QH, Stowell D ( 2024 ) . ATGNN: audio tagging graph neural network . IEEE Signal Processing Letters vol. 31 , 825 - 829 .
Deb O, Torr P ( 2023 ) . Remaining-useful-life prediction and uncertainty quantification using LSTM ensembles for aircraft engines . Conference: NeurIPS Workshop on Advancing Neural Network Training (WANT): Computational Efficiency, Scalability, and Resource Optimization ( New Orleans, USA ) from: 16/12/2023 to: 16/12/2023 ,
Manco I, Weck B, Doh S, Won M, Bodganov D, Wu Y, Tovstogan P, Benetos E et al. ( 2023 ) . The Song Describer dataset: a corpus of audio captions for music-and-language evaluation . Conference: NeurIPS Machine Learning for Audio Workshop ( New Orleans, USA ) from: 16/12/2023 to: 16/12/2023 ,
Yuan R, Ma Y, Li Y, Zhang G, Chen X, Yin H, Zhuo L, Liu Y et al. ( 2023 ) . MARBLE: Music Audio Representation Benchmark for Universal Evaluation . Conference: 37th Conference on Neural Information Processing Systems (NeurIPS) from: 10/12/2023 to: 16/12/2023 ,
Ragano A, Benetos E ( 2023 ) . Learning Music Representations with wav2vec 2.0 . Conference: 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS) ( Letterkenny, Ireland ) from: 07/12/2023 to: 07/12/2023 ,
Papaioannou C, Benetos E, Potamianos A ( 2023 ) . From West to East: Who can understand the music of the others better? . Conference: 24th International Society for Music Information Retrieval Conference (ISMIR) ( Milan, Italy ) from: 05/11/2023 to: 09/11/2023 ,
Zhuo L, Yuan R, Pan J, Ma Y, Li Y, Zhang G, Liu S, Dannenberg R et al. ( 2023 ) . LyricWhiz: Robust Multilingual Lyrics Transcription by Whispering to ChatGPT . Conference: 24th International Society for Music Information Retrieval Conference (ISMIR) ( Milan, Italy ) from: 05/11/2023 to: 09/11/2023 ,
Sarkar S, Thorpe L, Benetos E, Sandler M ( 2023 ) . Leveraging Synthetic Data for Improving Chamber Ensemble Separation . Conference: 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) vol. 00 , 1 - 5 .
Vahidi C, Singh S, Benetos E, Phan H, Stowell D, Fazekas G, Lagrange M ( 2023 ) . Perceptual Musical Similarity Metric Learning with Graph Neural Networks . Conference: 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) vol. 00 , 1 - 5 .
Edwards D, Dixon S, Benetos E ( 2023 ) . PiJAMA: Piano Jazz with Automatic MIDI Annotations . Transactions of the International Society for Music Information Retrieval vol. 6 , ( 1 ) 89 - 102 .
Liang J, Liu X, Liu H, Phan H, Benetos E, Plumbley M, Wang W ( 2023 ) . Adapting Language-Audio Models as Few-Shot Audio Learners . Conference: 24th Annual Conference of the International Speech Communication Association (INTERSPEECH) ( Dublin, Ireland ) from: 20/08/2023 to: 24/08/2023 ,
Ma Y, Yuan R, Li Y, Zhang G, Chen X, Yin H, Lin C, Benetos E et al. ( 2023 ) . On the Effectiveness of Speech Self-supervised Learning for Music .
Ragano A, Benetos E, Chinen M, Becerra H, Chandan Karadagur Ananda R ( 2023 ) . A Comparison Of Deep Learning MOS Predictors For Speech Synthesis Quality . Conference: Irish Signals & Systems Conference 2023 ( Dublin, Ireland ) from: 13/06/2023 to: 14/06/2023 ,
Ragano A, Benetos E ( 2023 ) . Audio Quality Assessment of Vinyl Music Collections Using Self-Supervised Learning . Conference: 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) from: 04/06/2023 to: 10/06/2023 , 1 - 5 .
Li Y, Cao W, Xie W ( 2023 ) . Few-shot Class-incremental Audio Classification Using Dynamically Expanded Classifier with Self-attention Modified Prototypes . IEEE Transactions on Multimedia vol. 26 , 1346 - 1360 .
Wang C, Benetos E, Wang S, Versace E . Joint Scattering for Automatic Chick Call Recognition . 2015 23rd European Signal Processing Conference (EUSIPCO) . Conference: 2022 30th European Signal Processing Conference (EUSIPCO)195 - 199 .
Li Y, Yuan R, Zhang G, Ma Y, Lin C, Chen X, Ragni A, Yin H et al. ( 2022 ) . Large-Scale Pretrained Model for Self-Supervised Music Audio Representation Learning . Conference: DMRN+17: Digital Music Research Network One-day Workshop 2022 ( London, UK ) from: 20/12/2022 to: 20/12/2022 ,
Liu L, KONG Q, Morfi G-V, Benetos E ( 2022 ) . Performance MIDI-to-score conversion by neural beat tracking . Conference: 23rd International Society for Music Information Retrieval Conference (ISMIR) ( Bengaluru, India ) from: 04/12/2022 to: 08/12/2022 ,
Sarkar S, Benetos E, Sandler M ( 2022 ) . EnsembleSet: A new high-quality synthesised dataset for chamber ensemble separation . Conference: 23rd International Society for Music Information Retrieval Conference (ISMIR) ( Bengaluru, India ) from: 05/12/2022 to: 08/12/2022 ,
Manco I, Benetos E, Fazekas G ( 2022 ) . Contrastive audio-language learning for music . https://ismir2022.ismir.net/ . Conference: 23rd International Society for Music Information Retrieval Conference (ISMIR) ( Bengaluru, India ) from: 04/12/2022 to: 08/12/2022 ,
Mai KT, Davies T ( 2022 ) . Explaining the decisions of anomalous sound detectors . https://dcase.community/workshop2022/ . Conference: 7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE) ( Nancy, France ) from: 03/11/2022 to: 04/11/2022 ,
Liang J, Phan QH, Benetos E ( 2022 ) . Leveraging label hierarchies for few-shot everyday sound recognition . Conference: 7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE) ( Nancy, France ) from: 03/11/2022 to: 04/11/2022 ,
Ozaki Y, Kuroyanagi J, McBride J, Proutskova P, Tierney A, Benetos E ( 2022 ) . Similarities and differences in a cross-linguistic sample of song and speech recordings . Conference: Joint Conference on Language Evolution ( Kanazawa, Japan ) from: 05/09/2022 to: 08/09/2022 ,
Singh S, Benetos E, Phan QH ( 2022 ) . Hypernetworks for sound event detection: a proof-of-concept . Conference: 30th European Signal Processing Conference (EUSIPCO 2022) ( Belgrade, Serbia ) from: 29/08/2022 to: 03/09/2022 , 429 - 433 .
Daikoku H, Ding S, Benetos E, Wood ALC, Shimizono T, Sanne US ( 2022 ) . Agreement among human and automated estimates of similarity in a global music sample . Conference: 10th International Workshop on Folk Music Analysis (FMA 2022) ( Sheffield, UK ) from: 14/06/2022 to: 17/06/2022 ,
Ou L, Guo Z, Benetos E, Han J, Wang Y ( 2022 ) . Exploring Transformer’s Potential on Automatic Piano Transcription . Conference: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) vol. 00 , 776 - 780 .
Huang J, Benetos E, Ewert S ( 2022 ) . Improving lyrics Alignment through Joint Pitch Detection . Conference: 2022 IEEE International Conference on Acoustics, Speech and Signal Processing ( Singapore ) from: 22/05/2022 to: 27/05/2022 , 451 - 455 .
Manco I, Benetos E, Quinton E ( 2022 ) . Learning music audio representations via weak language supervision . Conference: 2022 IEEE International Conference on Acoustics, Speech and Signal Processing ( Singapore ) from: 22/05/2022 to: 27/05/2022 , 456 - 460 .
Ragano A, Benetos E, Hines A ( 2022 ) . Automatic Quality Assessment of Digitized and Restored Sound Archives . Journal of the Audio Engineering Society vol. 70 , ( 4 ) 252 - 270 .
Wang C, Benetos E, Lostanlen V ( 2022 ) . Adaptive Scattering Transforms for Playing Technique Recognition . IEEE/ACM Transactions on Audio, Speech and Language Processing vol. 30 , 1407 - 1421 .
Benetos E, Ragano A, Sgroi D, Tuckwell A ( 2022 ) . Measuring national mood with music: using machine learning to construct a measure of national valence from audio data . Behavior Research Methods vol. 54 , ( 6 ) 3085 - 3092 .
Terenzi A, Nolasco I, Benetos E ( 2021 ) . Comparison of Feature Extraction Methods for Sound-Based Classification of Honey Bee Activity . IEEE Transactions on Audio Speech and Language Processing vol. 30 , 112 - 122 .
Bodo RPP, Benetos E ( 2021 ) . A framework for music similarity and cover song identification . Conference: 15th International Symposium on Computer Music Multidisciplinary Research (CMMR) ( Tokyo, Japan ) from: 15/11/2021 to: 19/11/2021 , 205 - 214 .
Liu L, Morfi V, Benetos E ( 2021 ) . ACPAS: A Dataset of Aligned Classical Piano Audio and Scores for Audio-to-Score Transcription . Conference: Late-Breaking Demo Session of the 22nd Int. Society for Music Information Retrieval Conference
Ozaki Y, McBride J, Benetos E, Pfordresher PQ, Six J, T. Tierney A, Proutskova P, Fukatsu H et al. ( 2021 ) . Agreement among human and annotated transcriptions of global songs . Conference: 22nd International Society for Music Information Retrieval Conference (ISMIR) from: 09/11/2021 to: 12/11/2021 , 500 - 508 .
Vianna Lordelo C, Benetos E, Dixon S, Ahlbäck S ( 2021 ) . Pitch-informed instrument assignment using a deep convolutional network with multiple kernel shapes . Conference: 22nd International Society for Music Information Retrieval Conference (ISMIR) from: 09/11/2021 to: 12/11/2021 , 389 - 395 .
O'Hanlon K, Benetos E, Dixon S ( 2021 ) . Detecting Cover Songs with Pitch Class Key-Invariant Networks . Conference: 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP) vol. 00 , 1 - 6 .
Holzapfel A, Benetos E, Killick A, Widdess R ( 2021 ) . Humanities and engineering perspectives on music transcription . Digital Scholarship in the Humanities vol. 37 , ( 3 ) 747 - 764 .
Bear HL, Morfi V, Benetos E . An Evaluation of Data Augmentation Methods for Sound Scene Geotagging . Conference: Interspeech 2021581 - 585 .
Sarkar S, Benetos E, Sandler M ( 2021 ) . Vocal Harmony Separation using Time-domain Neural Networks . Conference: 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH) ( Brno, Czech Republic ) from: 30/08/2021 to: 03/09/2021 , 3515 - 3519 .
Zhao Y, Wang C, Fazekas G, Benetos E, Sandler M ( 2021 ) . Violinist identification based on vibrato features . Conference: 2021 29th European Signal Processing Conference (EUSIPCO) vol. 00 , 381 - 385 .
Manco I, Benetos E, Quinton E ( 2021 ) . MusCaps: generating captions for music audio . Conference: International Joint Conference on Neural Networks (IJCNN) from: 18/07/2021 to: 22/07/2021 ,
Cheuk KW, Luo Y-J, Benetos E, Herremans D ( 2021 ) . Revisiting the onsets and frames model with additive attention . Conference: International Joint Conference on Neural Networks (IJCNN) from: 18/07/2021 to: 22/07/2021 ,
( 2021 ) . From Audio to Music Notation . Handbook of Artificial Intelligence for Music , Springer Nature
Ragano A, Benetos E, Hines A ( 2021 ) . More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations . Conference: 2021 13th International Conference on Quality of Multimedia Experience (QoMEX) vol. 00 , 103 - 108 .
Liu L, Morfi G-V, Benetos E ( 2021 ) . Joint multi-pitch detection and score transcription for polyphonic piano music . Conference: IEEE International Conference on Acoustics, Speech and Signal Processing ( Toronto, Canada ) from: 06/06/2021 to: 11/06/2021 ,
Singh S, Bear H, Benetos E ( 2021 ) . Prototypical Networks for Domain Adaptation in Acoustic Scene Classification . Conference: IEEE International Conference on Acoustics, Speech and Signal Processing ( Toronto, Canada ) from: 06/06/2021 to: 11/06/2021 ,
Subramanian V, Gururani S, Benetos E, Sandler M ( 2021 ) . Anomalous behaviour in loss-gradient based interpretability methods . Conference: RobustML workshop paper at ICLR 2021
Cheuk KW, Benetos E, Luo Y, Herremans D ( 2021 ) . The effect of spectrogram reconstructions on automatic music transcription: an alternative approach to improve transcription accuracy . Conference: 25th International Conference on Pattern Recognition (ICPR2020) ( Milan, Italy ) from: 10/01/2021 to: 15/01/2021 , 9091 - 9098 .
Vianna Lordelo C, Benetos E, Dixon S, Ahlbäck S, Ohlsson P ( 2020 ) . Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation . IEEE Signal Processing Letters vol. 28 , 81 - 85 .
Liu L, Morfi G-V, Benetos E ( 2020 ) . Joint Piano-roll and Score Transcription for Polyphonic Piano Music . Conference: DMRN+15: Digital Music Research Network One-day Workshop ( London, UK ) from: 15/12/2020 to: 15/12/2020 ,
Chettri B, Benetos E, Sturm BLT ( 2020 ) . Dataset artefacts in anti-spoofing systems: a case study on the ASVspoof 2017 benchmark . IEEE/ACM Transactions on Audio, Speech and Language Processing vol. 28 , 3018 - 3028 .
Chettri B, Kinnunen T ( 2020 ) . Subband modeling for spoofing detection in automatic speaker verification . http://www.odyssey2020.org/ . Conference: Odyssey 2020: The Speaker and Language Recognition Workshop ( Tokyo, Japan ) from: 01/11/2020 to: 05/11/2020 , 341 - 348 .
Ragano A, Benetos E ( 2020 ) . Development of a Speech Quality Database Under Uncontrolled Conditions . Conference: 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020) ( Shanghai, China ) from: 25/10/2020 to: 29/10/2020 ,
Pankajakshan A, Bear H, Benetos E ( 2020 ) . Memory Controlled Sequential Self Attention for Sound Recognition . Conference: 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020) ( Shanghai, China ) from: 25/10/2020 to: 29/10/2020 ,
MISHRA S, Benetos E, Sturm B, Dixon S ( 2020 ) . Reliable Local Explanations for Machine Listening . Conference: International Joint Conference on Neural Networks (IJCNN) ( Glasgow, UK ) from: 19/07/2020 to: 24/07/2020 ,
Ycart A, Liu L, Benetos E, Pearce M ( 2020 ) . Investigating the Perceptual Validity of Evaluation Metrics for Automatic Piano Music Transcription . Transactions of the International Society for Music Information Retrieval vol. 3 , ( 1 ) 68 - 81 .
Ragano A, Benetos E ( 2020 ) . Audio impairment recognition using a correlation-based feature representation . http://qomex2020.ie/ . Conference: 12th International Conference on Quality of Multimedia Experience (QoMEX) ( Athlone, Ireland ) from: 26/05/2020 to: 28/05/2020 ,
SUBRAMANIAN V, Pankajakshan A, Benetos E, Xu N, McDonald S, Sandler M ( 2020 ) . A Study on the Transferability of Adversarial Attacks in Sound Event Classification . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) ( Barcelona, Spain ) from: 04/05/2020 to: 08/05/2020 , 301 - 305 .
Wei W, Zhu H, Benetos E ( 2020 ) . A-CRNN: a domain adaptation model for sound event detection . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) ( Barcelona, Spain ) from: 04/05/2020 to: 08/05/2020 , 276 - 280 .
Martinez Ramirez M, Benetos E, Reiss J ( 2020 ) . Modeling plate and spring reverberation using a DSP-informed deep neural network . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) ( Barcelona, Spain ) from: 04/05/2020 to: 08/05/2020 , 241 - 245 .
Wang C, Lostanlen V, Benetos E ( 2020 ) . Playing Technique Recognition by Joint Time–Frequency Scattering . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) ( Barcelona, Spain ) from: 04/05/2020 to: 08/05/2020 , 881 - 885 .
Ycart A, Liu L, Benetos E ( 2020 ) . Musical Features for Automatic Music Transcription Evaluation .
Ycart A, Benetos E ( 2020 ) . Learning and Evaluation Methodologies for Polyphonic Music Sequence Prediction with LSTMs . IEEE/ACM Transactions on Audio, Speech and Language Processing vol. 28 , ( 1 ) 1328 - 1341 .
Chettri B, Kinnunen T ( 2020 ) . Deep Generative Variational Autoencoding for Replay Spoof Detection in Automatic Speaker Verification . Computer Speech and Language vol. 63 , Article 101092 ,
Martinez Ramirez M, Benetos E, Reiss J ( 2020 ) . Deep Learning for Black-Box Modeling of Audio Effects . Applied Sciences vol. 10 , ( 2 ) Article 638 ,
Liu L, Benetos E ( 2019 ) . Automatic Music Accompaniment with a Chroma-based Music Data Representation . Conference: DMRN+14: Digital Music Research Network One-day Workshop
Ycart A, Stoller D ( 2019 ) . A Comparative Study of Neural Models for Polyphonic Music Sequence Transduction . Conference: 20th conference of the International Society for Music Information Retrieval (ISMIR) ( Delft, The Netherlands ) from: 04/11/2019 to: 08/11/2019 , 470 - 477 .
Wang C, Benetos E, Lostanlen V ( 2019 ) . Adaptive Time–Frequency Scattering for Periodic Modulation Recognition in Music Signals . Conference: International Society for Music Information Retrieval Conference ( Delft, The Netherlands ) from: 04/11/2019 to: 08/11/2019 , 809 - 815 .
Holzapfel A ( 2019 ) . Automatic music transcription and ethnomusicology: a user study . Conference: 20th conference of the International Society for Music Information Retrieval (ISMIR) ( Delft, The Netherlands ) from: 04/11/2019 to: 08/11/2019 , 678 - 684 .
Ycart A, McLeod A, Benetos E ( 2019 ) . Blending acoustic and language model predictions for automatic music transcription . Conference: 20th conference of the International Society for Music Information Retrieval (ISMIR) ( Delft, The Netherlands ) from: 04/11/2019 to: 08/11/2019 , 454 - 461 .
Wang C, Benetos E ( 2019 ) . CBF-periDB: A Chinese Bamboo Flute Dataset for Periodic Modulation Analysis . Conference: International Society for Music Information Retrieval Conference Late-Breaking Demo Session ( Delft, The Netherlands ) from: 04/11/2019 to: 08/11/2019 ,
Singh S, Pankajakshan A ( 2019 ) . Audio tagging using a linear noise modelling layer . http://dcase.community/workshop2019/ . Conference: 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019) ( New York, USA ) from: 25/10/2019 to: 26/10/2019 , 234 - 238 .
Pankajakshan A, Benetos E ( 2019 ) . Onsets, activity, and events: a multi-task approach for polyphonic sound event modelling . http://dcase.community/workshop2019/ . Conference: 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019) ( New York, USA ) from: 25/10/2019 to: 26/10/2019 , 174 - 178 .
SUBRAMANIAN V, Benetos E, Sandler M ( 2019 ) . Robustness of Adversarial Attacks in Sound Event Classification . http://dcase.community/workshop2019/ . Conference: 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019) ( New York, USA ) from: 25/10/2019 to: 26/10/2019 , 239 - 243 .
Bear H, Heittola T, Mesaros A, Virtanen T ( 2019 ) . City classification from multiple real-world sound scenes . http://www.waspaa.com/ . Conference: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics ( New Paltz, NY, USA ) from: 20/10/2019 to: 23/10/2019 , 11 - 15 .
Vianna Lordelo C, Benetos E, Dixon S, Ahlbäck S ( 2019 ) . Investigating kernel shapes and skip connections for deep learning-based harmonic-percussive separation . http://www.waspaa.com/ . Conference: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics ( New Paltz, NY, USA ) from: 20/10/2019 to: 23/10/2019 , 40 - 44 .
Pankajakshan A, Bear H ( 2019 ) . Polyphonic sound event and sound activity detection: a multi-task approach . http://www.waspaa.com/ . Conference: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics ( New Paltz, NY, USA ) from: 20/10/2019 to: 23/10/2019 , 318 - 322 .
Chettri B, Stoller D, Morfi V, Martinez Ramirez M ( 2019 ) . Ensemble Models for Spoofing Detection in Automatic Speaker Verification . Conference: 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) ( Graz, Austria ) from: 15/07/2019 to: 19/09/2019 , 1018 - 1022 .
Bear H, Nolasco I ( 2019 ) . Towards joint sound scene and polyphonic sound event recognition . Conference: 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) ( Graz, Austria ) from: 15/09/2019 to: 19/09/2019 , 4594 - 4598 .
Martinez Ramirez M, Benetos E, Reiss J ( 2019 ) . A general-purpose deep learning approach to model time-varying audio effects . Conference: International Conference on Digital Audio Effects (DAFx-19) ( Birmingham, UK ) from: 02/09/2019 to: 06/09/2019 ,
Zhou Q, Feng Z ( 2019 ) . Adaptive Noise Reduction for Sound Event Detection Using Subband-Weighted NMF . Sensors vol. 19 , ( 14 ) Article 3206 ,
Subramanian V, Benetos E, Xu N, McDonald S, Sandler MB ( 2019 ) . Adversarial Attacks in Sound Event Classification .
Covas E ( 2019 ) . Optimal Neural Network Feature Selection for Spatial-Temporal Forecasting . Chaos vol. 29 , ( 6 ) Article 063111 ,
Ragano A, BENETOS E ( 2019 ) . Adapting the Quality of Experience Framework for Audio Archive Evaluation . https://www.qomex2019.de/ . Conference: 11th International Conference on Quality of Multimedia Experience ( Berlin, Germany ) from: 05/06/2019 to: 07/06/2019 ,
WANG C, BENETOS E, MENG X ( 2019 ) . HMM-based Glissando Detection for Recordings of Chinese Bamboo Flute . Proceedings of Sound and Music Computing Conference . Conference: Sound and Music Computing Conference ( Malaga, Spain ) from: 28/05/2019 to: 31/05/2019 , 545 - 550 .
Lins F, Johann M, BENETOS E ( 2019 ) . Automatic Transcription of Diatonic Harmonica Recordings . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing ( Brighton, UK ) from: 12/05/2019 to: 17/05/2019 ,
Phaye SSR, BENETOS E, Wang Y ( 2019 ) . SubSpectralNet - Using sub-spectrogram based convolutional neural networks for acoustic scene classification . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing ( Brighton, UK ) from: 12/05/2019 to: 17/05/2019 ,
MISHRA S, STOLLER D, BENETOS E, STURM B, DIXON S ( 2019 ) . GAN-based Generation and Automatic Selection of Explanations for Neural Networks . https://sites.google.com/view/safeml-iclr2019 . Conference: SafeML ICLR 2019 Workshop ( New Orleans, USA ) from: 06/05/2019 to: 06/05/2019 ,
Nolasco I, Terenzi A, Cecchi S, Orcioni S ( 2019 ) . Audio-based identification of beehive states . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing ( Brighton, UK ) from: 12/05/2019 to: 17/05/2019 ,
BENETOS E, DIXON S, Duan Z, EWERT S ( 2019 ) . Automatic Music Transcription: An Overview . IEEE Signal Processing Magazine vol. 36 , ( 1 ) 20 - 30 .
CHETTRI B, MISHRA S, STURM B, BENETOS E ( 2018 ) . Analysing the predictions of a CNN-based replay spoofing detection system . http://www.slt2018.org/ . Conference: 2018 IEEE Workshop on Spoken Language Technology ( Athens, Greece ) from: 18/12/2018 to: 21/12/2018 , 92 - 97 .
BEAR H ( 2018 ) . An extensible cluster-graph taxonomy for open set sound scene analysis . http://dcase.community/workshop2018/ . Conference: Workshop on Detection and Classification of Acoustic Scenes and Events ( Surrey, UK ) from: 19/11/2018 to: 20/11/2018 ,
Nolasco I, BENETOS E ( 2018 ) . To bee or not to bee: Investigating machine learning approaches for beehive sound recognition . http://dcase.community/documents/workshop2018/proceedings/DCASE2018Workshop_Nolasco_131.pdf . Conference: 2018 Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2018) ( Surrey, UK ) from: 19/11/2018 to: 20/11/2018 ,
YCART A ( 2018 ) . A-MAPS: Augmented MAPS Dataset with Rhythm and Key Annotations . Conference: 19th International Society for Music Information Retrieval Conference Late-Breaking Demos Session ( Paris ) from: 23/09/2018 to: 27/09/2018 ,
WANG C, BENETOS E, MENG X ( 2018 ) . Towards HMM-based glissando detection for recordings of Chinese bamboo flute . http://ismir2018.ircam.fr/pages/events-lbd.html . Conference: International Society for Music Information Retrieval Conference Late-Breaking Demos Session ( Paris, France ) from: 23/09/2018 to: 27/09/2018 ,
CHETTRI B, STURM BLT, BENETOS E ( 2018 ) . Analysing replay spoofing countermeasure performance under varied conditions . Conference: IEEE International Workshop on Machine Learning for Signal Processing ( Aalborg, Denmark ) from: 17/09/2018 to: 20/09/2018 ,
Ali H, Tran SN, d'Avila Garcez AS ( 2018 ) . Speaker recognition with hybrid features from a deep belief network . Neural Computing and Applications vol. 29 , ( 6 ) 13 - 19 .
Chettri B, Mishra S, Sturm BL ( 2018 ) . A Study On Convolutional Neural Network Based End-To-End Replay Anti-Spoofing .
YCART A ( 2018 ) . Polyphonic Music Sequence Transduction with Meter-Constrained LSTM Networks . Conference: IEEE International Conference on Acoustics, Speech and Signal Processing ( Calgary, Canada ) from: 15/04/2018 to: 20/04/2018 , 386 - 390 .
Nakamura E, BENETOS E, Yoshii K, DIXON S ( 2018 ) . Towards Complete Polyphonic Music Transcription: Integrating Multi-Pitch Detection and Rhythm Quantization . Conference: IEEE International Conference on Acoustics, Speech and Signal Processing ( Calgary, Canada ) from: 15/04/2018 to: 20/04/2018 , 101 - 105 .
Valero-Mas JJ, BENETOS E, Iñesta JM ( 2018 ) . A Supervised Classification Approach for Note Tracking in Polyphonic Piano Transcription . Journal of New Music Research vol. 47 , ( 3 ) 249 - 263 .
Mesaros A, Heittola T, Benetos E, Foster P, Lagrange M ( 2018 ) . Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge . IEEE/ACM Transactions on Audio, Speech and Language Processing vol. 26 , ( 2 ) 379 - 393 .
PANTELI M, BENETOS E, DIXON S ( 2018 ) . A review of manual and computational approaches for the study of world music corpora . Journal of New Music Research vol. 47 , ( 2 ) 176 - 189 .
BENETOS E, STOWELL D, PLUMBLEY M, Virtanen T, PLUMBLEY M, Ellis D ( 2018 ) . Approaches to complex sound scene analysis . Computational Analysis of Sound Scenes and Events , Edition. 1 , Springer International Publishing
( 2018 ) . Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018, Paris, France, September 23-27, 2018 . ISMIR .
PANTELI M, BENETOS E, DIXON S ( 2017 ) . A computational study on outliers in world music . PLoS ONE vol. 12 , ( 12 ) Article e0189399 , 1 - 28 .
McLeod A, Steedman M, BENETOS E ( 2017 ) . Automatic Transcription of Polyphonic Vocal Music . Applied Sciences vol. 7 , ( 12 ) Article 1285 ,
Ycart A, Benetos E ( 2017 ) . A study on LSTM networks for polyphonic music sequence modelling . Conference: 18th International Society for Music Information Retrieval Conference (ISMIR 2017) ( Suzhou, China ) from: 23/10/2017 to: 27/10/2017 , 421 - 427 .
Schramm R, McLeod A, Benetos E ( 2017 ) . Multi-pitch detection and voice assignment for a cappella recordings of multiple singers . Conference: 18th International Society for Music Information Retrieval Conference (ISMIR 2017) ( Suzhou, China ) from: 23/10/2017 to: 27/10/2017 , 552 - 559 .
Lafay G, Lagrange M ( 2017 ) . Sound Event Detection in Synthetic Audio: Analysis of the DCASE 2016 Task Results . http://www.waspaa.com/ . Conference: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2017) ( New Paltz, NY, USA ) from: 18/10/2017 to: 15/10/2017 , 11 - 15 .
YCART A, BENETOS E ( 2017 ) . Neural Music Language Models: investigating the training process . Conference: International Conference of Students of Systematic Musicology
Valero-Mas JJ, Benetos E ( 2017 ) . Assessing the Relevance of Onset Information for Note Tracking in Piano Music Transcription . Conference: 2017 AES International Conference on Semantic Audio ( Erlangen, Germany ) from: 22/06/2017 to: 24/06/2017 ,
Schramm R ( 2017 ) . Automatic Transcription of a Cappella Recordings from Multiple Singers . Conference: 2017 AES International Conference on Semantic Audio ( Erlangen, Germany ) from: 22/06/2017 to: 24/06/2017 ,
Benetos E ( 2017 ) . Polyphonic note and instrument tracking using linear dynamical systems . Conference: 2017 AES International Conference on Semantic Audio ( Erlangen, Germany ) from: 22/06/2017 to: 24/06/2017 ,
Stowell D, Benetos E, Gill LF ( 2017 ) . On-Bird Sound Recordings: Automatic Acoustic Recognition of Activities and Contexts . IEEE/ACM Trans. Audio, Speech & Language Processing vol. 25 , ( 6 ) 1193 - 1206 .
Stowell D, Benetos E, Gill LF ( 2017 ) . On-bird Sound Recordings: Automatic Acoustic Recognition of Activities and Contexts . IEEE/ACM Transactions on Audio, Speech and Language Processing vol. 25 , ( 6 ) 1193 - 1206 .
Benetos E, Lafay G, Plumbley MD ( 2017 ) . Polyphonic Sound Event Tracking using Linear Dynamical Systems . IEEE/ACM Transactions on Audio, Speech and Language Processing vol. 25 , ( 6 ) 1266 - 1277 .
Russell AJ, Benetos E ( 2017 ) . On the Memory Properties of Recurrent Neural Models . Conference: International Joint Conference on Neural Networks (IJCNN 2017) ( Anchorage, Alaska, USA ) from: 19/05/2017 to: 14/05/2017 , 2596 - 2603 .
Abdallah S, Benetos E, Gold N, Hargreaves S ( 2017 ) . The Digital Music Lab: A Big Data Infrastructure for Digital Musicology . ACM Journal on Computing and Cultural Heritage vol. 10 , ( 1 )
BENETOS E ( 2016 ) . Automatic Transcription of Vocal Quartets . DMRN+11: Digital Music Research Network Workshop Proceedings 2016 . Conference: DMRN+11: Digital Music Research Network One-day Workshop 2016 ( Centre for Digital Music, Queen Mary University of London ) from: 20/12/2016 to: 20/12/2016 ,
YCART A, Benetos E ( 2016 ) . Towards a Music Language Model for Audio Analysis . DMRN+11: Digital Music Research Network Workshop Proceedings 2016 . Conference: DMRN+11: Digital Music Research Network One-day Workshop 2016 ( Centre for Digital Music, Queen Mary University of London ) from: 20/12/2016 to: 20/12/2016 ,
Valero-Mas JJ, Benetos E ( 2016 ) . Classification-based Note Tracking for Automatic Music Transcription . https://sites.google.com/site/musicmachinelearning16/proceedings . Conference: 9th International Workshop on Machine Learning and Music ( Riva del Garda, Italy ) from: 23/09/2016 to: 23/09/2016 , 61 - 65 .
Abdallah S, Gold N, Hargreaves S, Weyde T, Wolff D ( 2016 ) . Digital Music Lab: A Framework for Analysing Big Music Data . Conference: 24th European Signal Processing Conference ( Budapest, Hungary ) from: 29/08/2016 to: 02/09/2016 , 1118 - 1122 .
Cheng T, Mauch M, Benetos E, Dixon S ( 2016 ) . An attack/decay model for piano transcription . Conference: 17th International Society for Music Information Retrieval Conference ( New York, USA ) from: 07/08/2016 to: 11/08/2016 , 584 - 590 .
Panteli M, Benetos E, Dixon S ( 2016 ) . Learning a feature space for similarity in world music . Conference: 17th International Society for Music Information Retrieval Conference ( New York, USA ) from: 07/08/2016 to: 11/08/2016 , 538 - 544 .
Holzapfel A, Benetos E ( 2016 ) . The Sousta corpus: Beat-informed automatic transcription of traditional dance tunes . Conference: 17th International Society for Music Information Retrieval Conference ( New York, USA ) from: 07/08/2016 to: 11/08/2016 , 531 - 537 .
Lafay G, Lagrange M, Rossignol M, Benetos E ( 2016 ) . A morphological model for simulating acoustic scenes and its application to sound event detection . IEEE/ACM Transactions on Audio, Speech, and Language Processing vol. 24 , ( 10 ) 1854 - 1864 .
Panteli M, Benetos E, Dixon S ( 2016 ) . Automatic detection of outliers in world music collections . Conference: Fourth International Conference on Analytical Approaches to World Music (AAWM 2016) ( New York, USA ) from: 11/06/2016 to: 08/06/2016 ,
Benetos E, Lafay G, Lagrange M, Plumbley MD ( 2016 ) . Detection of Overlapping Acoustic Events Using a Temporally-Constrained Probabilistic Model . Conference: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)6450 - 6454 .
Sigtia S, Benetos E, Dixon S ( 2016 ) . An End-to-End Neural Network for Polyphonic Piano Music Transcription . IEEE/ACM Transactions on Audio, Speech, and Language Processing vol. 24 , ( 5 ) 927 - 939 .
Benetos E ( 2015 ) . An efficient temporally-constrained probabilistic model for multiple-instrument music transcription . http://ismir2015.uma.es/docs/ISMIR2015_Proceedings.pdf . Conference: 16th International Society for Music Information Retrieval Conference (ISMIR) ( Malaga, Spain ) from: 26/10/2015 to: 30/10/2015 , 701 - 707 .
BENETOS E, Holzapfel A ( 2015 ) . Automatic transcription of Turkish microtonal music . Journal of the Acoustical Society of America vol. 138 , ( 4 ) 2118 - 2130 .
Stowell D, Giannoulis D, Benetos E, Lagrange M, Plumbley MD ( 2015 ) . Detection and Classification of Acoustic Scenes and Events . IEEE Transactions on Multimedia vol. 17 , ( 10 ) 1733 - 1746 .
Rossignol M, Lagrange M, Lafay G ( 2015 ) . Alternate level clustering for drum transcription . Conference: 23rd European Signal Processing Conference (EUSIPCO) ( Nice, France ) from: 04/09/2015 to: 31/08/2015 , 2068 - 2072 .
Abdallah S, Alencar-Brayner A, BENETOS E, Cottrell S, Dykes J, Gold N, Kachkaev A, Tidhar D ( 2015 ) . Automatic transcription and pitch analysis of the British Library World & Traditional Music Collection . http://fma2015.sciencesconf.org/conference/fma2015/FMA2015_OfficialProceedings.pdf . Conference: 5th International Workshop on Folk Music Analysis ( Paris, France ) from: 10/06/2015 to: 12/06/2015 , 10 - 12 .
Sigtia S, Benetos E, Boulanger-Lewandowski N, Weyde T, Garcez ASDA, Dixon S ( 2015 ) . A Hybrid Recurrent Neural Network for Music Transcription . IEEE International Conference on Acoustics Speech and Signal Processing . Conference: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ( Brisbane, Australia ) from: 19/04/2015 to: 24/04/2015 , 2061 - 2065 .
Benetos E, Badeau R, Weyde T ( 2014 ) . Template Adaptation for Improving Automatic Music Transcription . http://www.terasoft.com.tw/conf/ismir2014//proceedings%5CISMIR2014_Proceedings.pdf . Conference: 15th International Society for Music Information Retrieval Conference (ISMIR) ( Taipei, Taiwan ) from: 27/10/2014 to: 31/10/2014 , 175 - 180 .
Tidhar D, Dixon S, Benetos E, Weyde T ( 2014 ) . The temperament police . Early Music vol. 42 , ( 4 ) 579 - 590 .
Weyde T, Cottrell S, Dykes J, Benetos E, Wolff D, Tidhar D, Gold N, Abdallah S et al. ( 2014 ) . Big Data for Musicology . Conference: 1st International Digital Libraries for Musicology workshop ( London, UK ) from: 12/09/2014 to: 12/09/2014 ,
Wolff D, Tidhar D, Benetos E, Dumon E, Cherla S, Page K, Fields B ( 2014 ) . Incremental dataset definition for large scale musicological research . Conference: 1st International Digital Libraries for Musicology workshop ( London, UK ) from: 12/09/2014 to: 12/09/2014 ,
Tran S, Benetos E, d Avila Garcez A ( 2014 ) . Learning motion-difference features using Gaussian restricted Boltzmann machines for efficient human action recognition . Conference: 2014 International Joint Conference on Neural Networks (IJCNN) ( Beijing, China ) from: 06/07/2014 to: 11/07/2014 , 2123 - 2129 .
Benetos E, Holzapfel A, Holzapfel A ( 2014 ) . Incorporating pitch class profiles for improving automatic transcription of Turkish makam music . Proceedings of the Fourth International Workshop on Folk Music Analysis (FM . Conference: 4th International Workshop on Folk Music Analysis ( Istanbul, Turkey ) from: 12/06/2014 to: 13/06/2014 , 15 - 20 .
Giannoulis D, Benetos E, Klapuri A ( 2014 ) . Improving instrument recognition in polyphonic music through system integration . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing ( Florence, Italy ) from: 04/05/2014 to: 09/05/2014 , 5259 - 5263 .
Benetos E, Weyde T ( 2014 ) . Improving automatic music transcription through key detection . http://www.aes.org/conferences/53/technical_programme.cfm . Conference: AES 53rd International Conference on Semantic Audio ( London, UK ) from: 27/01/2014 to: 29/01/2014 ,
Benetos E, Ewert S, Weyde T ( 2014 ) . Automatic Transcription Of Pitched And Unpitched Sounds From Polyphonic Music . Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) . 3131 - 3135 .
Sigtia S, Benetos E, Cherla S, Weyde T, Garcez A, Dixon S ( 2014 ) . RNN-based Music Language Models for Improving Automatic Music Transcription . 15th International Society for Music Information Retrieval Conference . 53 - 58 .
BARTHET M, Benetos E, Cottrell S, Dixon S, Dykes J, Gold N, Mahey M, Plumbley MD et al. ( 2014 ) . The DML Research Project: Digital Music Lab - Analysing Big Music Data . Presented at: Workshop on "Big Data: Challenges and Applications", Imperial College, London ,
Benetos E, Holzapfel A ( 2013 ) . Automatic transcription of Turkish makam music . Conference: 14th International Society for Music Information Retrieval Conference ( Curitiba, PR, Brazil ) from: 04/11/2013 to: 08/11/2013 , 355 - 360 .
Benetos E, Weyde T ( 2013 ) . Explicit duration hidden Markov models for multiple-instrument polyphonic music transcription . Conference: 14th International Society for Music Information Retrieval Conference ( Curitiba, PR, Brazil ) from: 04/11/2013 to: 08/11/2013 , 269 - 274 .
de Valk R, Weyde T, Britto AS, Gouyon F, Dixon S ( 2013 ) . A machine learning approach to voice separation in lute tablature . Conference: 14th International Society for Music Information Retrieval Conference ( Curitiba, PR, Brazil ) from: 04/11/2013 to: 08/11/2013 , 555 - 560 .
Giannoulis D, Benetos E, Stowell D, Rossignol M, Lagrange M, Plumbley MD ( 2013 ) . DETECTION AND CLASSIFICATION OF ACOUSTIC SCENES AND EVENTS: AN IEEE AASP CHALLENGE . Conference: 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics1 - 4 .
Giannoulis D, Stowell D, Benetos E, Rossignol M, Lagrange M, Plumbley MD ( 2013 ) . A database and challenge for acoustic scene classification and event detection . Conference: 21st European Signal Processing Conference ( Marrakech, Morocco )
Benetos E, Cherla S ( 2013 ) . An efficient shift-invariant model for polyphonic music transcription . Conference: 6th International Workshop on Machine Learning and Music ( Prague, Czech Republic )
Benetos E, Dixon S, Giannoulis D, Kirchhoff H, Klapuri A ( 2013 ) . Automatic music transcription: challenges and future directions . Journal of Intelligent Information Systems vol. 41 , ( 3 ) 407 - 434 .
Serra X, Magas M, Benetos E, Chudy M, Dixon S, Flexer A, Gomez E, Gouyon F et al. ( 2013 ) . Roadmap for Music Information ReSearch . The MIReS Consortium
Benetos E, Dixon S ( 2013 ) . Multiple-instrument polyphonic music transcription using a temporally constrained shift-invariant model . The Journal of the Acoustical Society of America vol. 133 , ( 3 ) 1727 - 1741 .
Benetos E, Dixon S ( 2012 ) . A Shift-Invariant Latent Variable Model for Automatic Music Transcription . Computer Music Journal vol. 36 , ( 4 ) 81 - 94 .
BENETOS E, Dixon S, Giannoulis D, Kirchhoff H, Klapuri A ( 2012 ) . Automatic Music Transcription: Breaking the Glass Ceiling . Conference: 13th International Society for Music Information Retrieval Conference (ISMIR 2012) ( Porto, Portugal ) from: 08/10/2012 to: 12/10/2012 , 379 - 384 .
Zijlstra A, Mancini M, Lindemann U, Chiari L, Zijlstra W ( 2012 ) . Sit-stand and stand-sit transitions in older adults and patients with Parkinson’s disease: event detection based on motion sensors versus force plates . Journal of NeuroEngineering and Rehabilitation vol. 9 , ( 1 )
Benetos E, Lagrange M, Dixon S ( 2012 ) . Characterisation of acoustic scenes using a temporally-constrained shift-invariant model . 15th International Conference on Digital Audio Effects, DAFx 2012 Proceedings .
Benetos E, Klapuri A, Dixon S ( 2012 ) . Score-informed transcription for automatic piano tutoring . Conference: 20th European Signal Processing Conference ( Bucharest, Romania ) 2153 - 2157 .
Benetos E, Dixon S ( 2012 ) . Temporally-Constrained Convolutive Probabilistic Latent Component Analysis for Multi-pitch Detection . Lecture Notes in Computer Science . vol. 7191 , 364 - 371 .
Benetos E, Dixon S ( 2011 ) . A TEMPORALLY-CONSTRAINED CONVOLUTIVE PROBABILISTIC MODEL FOR PITCH DETECTION . Conference: 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)133 - 136 .
Benetos E, Dixon S ( 2011 ) . Joint Multi-Pitch Detection Using Harmonic Envelope Estimation for Polyphonic Music Transcription . IEEE Journal of Selected Topics in Signal Processing vol. 5 , ( 6 ) 1111 - 1123 .
Mearns L, Benetos E, Dixon S ( 2011 ) . Automatically detecting key modulations in J.S. Bach chorale recordings . 8th Sound and Music Computing Conference . 25 - 32 .
Benetos E, Dixon S ( 2011 ) . Multiple-instrument polyphonic music transcription using a convolutive probabilistic model . Conference: 8th Sound and Music Computing Conference ( Padova, Italy ) from: 06/07/2011 to: 09/07/2011 , 19 - 24 .
Benetos E, Dixon S ( 2011 ) . Polyphonic Music Transcription Using Note Onset and Offset Detection . Conference: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)37 - 40 .
Dixon S, Tidhar D, Benetos E ( 2011 ) . The temperament police: The truth, the ground truth, and nothing but the truth . Proceedings of the 12th International Society for Music Information Retrieval Conference, ISMIR 2011 . Conference: 12th International Society for Music Information Retrieval Conference ( Miami, Florida, USA ) from: 24/10/2011 to: 28/10/2011 , 281 - 286 .
Anglade A, Benetos E, Mauch M, Dixon S ( 2010 ) . Improving Music Genre Classification Using Automatically Induced Harmony Rules . Journal of New Music Research vol. 39 , ( 4 ) 349 - 361 .
Benetos E, Dixon S ( 2010 ) . Multiple-F0 estimation of piano sounds exploiting spectral structure and temporal evolution . ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition . 13 - 18 .
Benetos E, Stylianou Y ( 2010 ) . Auditory Spectrum-Based Pitched Instrument Onset Detection . IEEE Transactions on Audio Speech and Language Processing vol. 18 , ( 8 ) 1968 - 1977 .
Benetos E, Kotropoulos C ( 2010 ) . Non-Negative Tensor Factorization Applied to Music Genre Classification . IEEE Transactions on Audio Speech and Language Processing vol. 18 , ( 8 ) 1955 - 1967 .
Benetos E, Holzapfel A ( 2009 ) . Pitched instrument onset detection based on auditory spectra . Proceedings of the 10th International Society for Music Information Retrieval Conference, ISMIR 2009 . 105 - 110 .
Benetos E, Kotropoulos C ( 2008 ) . A tensor-based approach for automatic music genre classification . 16th European Signal Processing Conference .
Spachos D, Zlantintsi A, Moschou V, Antonopoulos P, Benetos E, Kotti M, Tzimouli K, Kotropoulos C et al. ( 2008 ) . MUSCLE movie-database: a multimodal corpus with rich annotation for dialogue and saliency detection . 6th Language Resources and Evaluation Conference . 16 - 19 .
BENETOS E, Siatras S, Kotropoulos C, Nikolaidis N ( 2008 ) . Movie analysis with emphasis to dialogue and action scene detection . Multimodal Processing and Interaction , vol. 33 , Springer
Panagakis I, Benetos E, Kotropoulos C, Bello JP, Chew E, Turnbull D ( 2008 ) . Music Genre Classification: A Multilinear Approach . ISMIR . 583 - 588 .
Kotti M, Benetos E, Kotropoulos C, Pitas I ( 2007 ) . A neural network approach to audio-assisted movie dialogue detection . Neurocomputing vol. 71 , ( 1-3 ) 157 - 166 .
Moschou V, Kotti M, Benetos E, Kotropoulos C ( 2007 ) . Systematic comparison of BIC-based speaker segmentation systems . Conference: 2007 IEEE 9th Workshop on Multimedia Signal Processing66 - 69 .
Kotti M, Benetos E ( 2007 ) . Neural network-based movie dialogue detection . 10th International Conference on Engineering Applications of Neural Networks .
Benetos E, Kotti M, Kotropoulos C ( 2007 ) . Large scale musical instrument identification . 4th Sound and Music Computing Conference . 283 - 286 .
Benetos E, Kotti M, Kotropoulos C ( 2006 ) . Applying Supervised Classifiers Based on Non-negative Matrix Factorization to Musical Instrument Classification . 2006 IEEE International Conference on Multimedia and Expo . Conference: 2006 IEEE International Conference on Multimedia and Expo2105 - 2108 .
Kotti M, Martins LGPM, Benetos E, Cardoso JS, Kotropoulos C ( 2006 ) . Automatic Speaker Segmentation using Multiple Features and Distance Measures: A Comparison of Three Approaches . 2006 IEEE International Conference on Multimedia and Expo . Conference: 2006 IEEE International Conference on Multimedia and Expo1101 - 1104 .
Kotti M, Benetos E, Kotropoulos C ( 2006 ) . Automatic speaker change detection with the Bayesian information criterion using MPEG-7 features and a fusion scheme . 2005 IEEE International Symposium on Circuits and Systems (ISCAS) . Conference: 2006 IEEE International Symposium on Circuits and Systems4 - pp. .
Benetos E, Kotti M, Kotropoulos C ( 2006 ) . Musical instrument classification using non-negative matrix factorization algorithms . 2005 IEEE International Symposium on Circuits and Systems (ISCAS) . Conference: 2006 IEEE International Symposium on Circuits and Systems4 - pp. .
Benetos E, Kotti M ( 2006 ) . Musical instrument classification using non-negative matrix factorization algorithms and subset feature selection . ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings . vol. 5 ,
Benetos E, Kotropoulos C, Lidy T ( 2006 ) . Testing supervised classifiers based on non-negative matrix factorization to musical instrument classification . European Signal Processing Conference .
Benetos E, Kotti M, Kotropoulos C, Burred JJ, Eisenberg G, Sikora T ( 2005 ) . Comparison of subspace analysis-based and statistical model-based algorithms for musical instrument classification . 2nd Workshop On Immersive Communication And Broadcast Systems .
Liang J, Benetos E, Phan H . Adapting Language-Audio Models as Few-Shot Audio Learners . Conference: INTERSPEECH 2023
Savage PE, Ampiah-Bonney A, Arabadjiev A, Arhine A, Ariza JF, Bamford JS, Barbosa BS, Beck A-K et al. . Does synchronised singing enhance social bonding more than speaking does? A global experimental Stage 1 Registered Report .
. From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems . Conference: 35th IEEE International Workshop on Machine Learning for Signal Processing
de Fleurian R, Clemente A, Benetos E, Pearce MT . Melodic expectation as an elicitor of music-evoked chills .
Qu X, Bai Y, Ma Y, Zhou Z, Lo KM, Liu J, Yuan R, Min L et al. . MuPT: A Generative Symbolic Music Pretrained Transformer . Conference: The Thirteenth International Conference on Learning Representations ( Singapore ) from: 23/04/2025 to: 28/04/2025 ,