Publications: Emmanouil Benetos

Loth J, Riley J, Dixon S, Benetos E ( 2026 ) . Velocity Prediction in Automatic Guitar Transcription . Conference: 34th European Signal Processing Conference (EUSIPCO) ( Bruges, Belgium ) from: 31/08/2026 to: 04/09/2026 ,

Ma Y, Xia H, Gao H, Chen W, Ye Y, Yang Y, Chang S, Ding M et al. ( 2026 ) . CMI-RewardBench: Evaluating Music Reward Models with Compositional Multimodal Instruction . Conference: 43rd International Conference on Machine Learning (ICML) ( Seoul, South Korea ) from: 06/07/2026 to: 11/07/2026 ,

10.48550/arxiv.2603.00610

Moummad I, Serizel R, Benetos E, Farrugia N ( 2026 ) . Domain-invariant representation learning of bird sounds . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Barcelona, Spain ) from: 04/05/2026 to: 08/05/2026 ,

10.1109/ICASSP55912.2026.11463533

https://qmro.qmul.ac.uk/xmlui/handle/123456789/119071

Bhattacharjee A, Pasini M, Benetos E ( 2026 ) . Scalable Evaluation for Audio Identification via Synthetic Latent Fingerprint Generation . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Barcelona, Spain ) from: 04/05/2026 to: 08/05/2026 ,

10.1109/ICASSP55912.2026.11460511

https://qmro.qmul.ac.uk/xmlui/handle/123456789/119035

Li C, Chen Y, Ji Y, Xu J, Cui Z, Li S, Zhang Y, Wang W et al. ( 2026 ) . OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs . Conference: 14th International Conference on Learning Representations (ICLR) ( Rio de Janeiro, Brazil ) from: 23/04/2026 to: 27/04/2026 ,

10.48550/arxiv.2510.10689

https://qmro.qmul.ac.uk/xmlui/handle/123456789/125032

Mitcheltree C, Lostanlen V, Benetos E, Lagrange M ( 2026 ) . SCRAPL: Scattering Transform with Random Paths for Machine Learning . Conference: 14th International Conference on Learning Representations (ICLR) ( Rio de Janeiro, Brazil ) from: 23/04/2026 to: 27/04/2026 ,

10.48550/arXiv.2602.11145

https://qmro.qmul.ac.uk/xmlui/handle/123456789/124159

Yuan R, Lin H, Guo S, Zhang G, Pan J, Zang Y, Liu H, Liang Y et al. ( 2026 ) . YuE: Scaling Open Foundation Models for Long-Form Music Generation . Conference: 14th International Conference on Learning Representations (ICLR) ( Rio de Janeiro, Brazil ) from: 23/04/2026 to: 27/04/2026 ,

10.48550/arxiv.2503.08638

https://qmro.qmul.ac.uk/xmlui/handle/123456789/125031

Kommers C, Ahnert R, Antoniak M, Benetos E, Benford S, Bunz M, Caramiaux B, Concannon S et al. ( 2026 ) . Computational hermeneutics: evaluating generative AI as a cultural technology . Frontiers in Artificial Intelligence vol. 9 , Article 1753041 ,

10.3389/frai.2026.1753041

https://qmro.qmul.ac.uk/xmlui/handle/123456789/123012

Tang X, Lei X, Zhu C, Chen S, Yuan R, Li Y, Oh C, Zhang G et al. ( 2025 ) . AutoMV: An Automatic Multi-Agent System for Music Video Generation .

10.48550/arxiv.2512.12196

Ma Z, Ma Y, Zhu Y, Yang C, Chao Y-W, Xu R, Chen W, Chen Y et al. ( 2025 ) . MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix . Conference: The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025) from: 02/12/2025 to: 07/12/2025 ,

10.48550/arxiv.2505.13032

https://qmro.qmul.ac.uk/xmlui/handle/123456789/113995

Li Y, Ma Y, Ma Y, Yuan R, Zhu K, Guo H, Liang Y, Liu J et al. ( 2025 ) . OmniBench: Towards The Future of Universal Omni-Language Models . Conference: The Thirty-Ninth Annual Conference on Neural Information Processing Systems. (NeurIPS 2025) from: 02/12/2025 to: 07/12/2025 ,

10.48550/arxiv.2409.15272

https://qmro.qmul.ac.uk/xmlui/handle/123456789/113994

Kim H, Benetos E, Serra X ( 2025 ) . Velocity2DMs: A contextual modeling approach to dynamics marking prediction in piano performance . IEEE Signal Processing Letters vol. 32 ,

10.1109/LSP.2025.3633579

https://qmro.qmul.ac.uk/xmlui/handle/123456789/113699

Chang SK, Dixon S, Benetos E ( 2025 ) . RUMAA: Repeat-Aware Unified Music Audio Analysis for Score-Performance Alignment, Transcription, and Mistake Detection . Conference: 2025 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2025) ( Granlibakken Tahoe, Toahoe City, CA ) from: 12/10/2025 to: 15/10/2025 ,

10.1109/WASPAA66052.2025.11230990

https://qmro.qmul.ac.uk/xmlui/handle/123456789/108341

Ma Y, Li S, Yu J, Benetos E, Maezawa A ( 2025 ) . CMI-Bench: A Comprehensive Benchmark for Evaluating Music Instruction Following . Conference: 26th International Society for Music Information Retrieval Conference (ISMIR) ( Daejeon, Korea ) from: 21/09/2025 to: 25/09/2025 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/107957

Sarkar S, Moomjian V, Woods B, Benetos E, Sandler M ( 2025 ) . Perceptual errors in music source separation: looking beyond SDR averages . Conference: 26th International Society for Music Information Retrieval Conference (ISMIR) ( Daejeon, Korea ) from: 21/09/2025 to: 25/09/2025 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/107959

Bhattacharjee A, Meresman Higgs I, Sandler M, Benetos E ( 2025 ) . Refining music sample identification with a self-supervised graph neural network . Conference: 26th International Society for Music Information Retrieval Conference (ISMIR 2025) ( Daejeon, Korea ) from: 21/09/2025 to: 25/09/2025 ,

10.48550/arxiv.2506.14684

https://qmro.qmul.ac.uk/xmlui/handle/123456789/108305

Papaioannou C, Benetos E, Potamianos A ( 2025 ) . Universal Music Representations? Evaluating Foundation Models on World Music Corpora . Conference: 26th International Society for Music Information Retrieval Conference (ISMIR) ( Daejeon, Korea ) from: 21/09/2025 to: 25/09/2025 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/107958

Zhang H, Liang J, Phan QH, Wang W, Benetos E ( 2025 ) . From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems . Conference: IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2025) ( Istanbul, Turkey ) from: 31/08/2025 to: 03/09/2025 ,

10.1109/MLSP62443.2025.11204254

https://qmro.qmul.ac.uk/xmlui/handle/123456789/108302

Huang J, Sousa F, Demirel E, Benetos E, Gadelha I ( 2025 ) . Enhancing Lyrics Transcription on Music Mixtures with Consistency Loss . Conference: Interspeech 2025 ( Rotterdam, The Netherlands ) from: 21/08/2025 to: 17/08/2025 ,

10.21437/Interspeech.2025-311

https://qmro.qmul.ac.uk/xmlui/handle/123456789/107684

Plachouras C, Guinot J, Fazekas G, Quinton E, Benetos E, Pauwels J ( 2025 ) . Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks . Conference: International Joint Conference on Neural Networks (IJCNN) ( Rome, Italy ) from: 30/06/2025 to: 05/07/2025 ,

10.1109/IJCNN64981.2025.11229333

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106803

Qu X, Bai Y, Ma Y, Zhou Z, Lo KM, Liu J, Yuan R, Min L et al. ( 2025 ) . MuPT: A Generative Symbolic Music Pretrained Transformer . https://openreview.net/forum?id=iAK9oHp4Zz . Conference: International Conference on Learning Representations (ICLR) ( Singapore ) from: 24/04/2025 to: 28/04/2025 ,

10.48550/arxiv.2404.06393

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106408

Peeters G, Rafii Z, Fuentes M, Duan Z, Benetos E ( 2025 ) . Twenty-Five Years of MIR Research: Achievements, Practices, Evaluations, and Future Challenges . Conference: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) vol. 00 , 1 - 5 .

10.1109/icassp49660.2025.10888947

https://qmro.qmul.ac.uk/xmlui/handle/123456789/104350

De Almeida Nolasco IS, Stowell D, Benetos E ( 2025 ) . Acoustic identification of individual animals based on hierarchical contrastive learning . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Hyderabad, India ) from: 06/04/2025 to: 11/04/2025 ,

10.1109/ICASSP49660.2025.10890076

https://qmro.qmul.ac.uk/xmlui/handle/123456789/104358

Singh S, Bhattacharjee A, Benetos E ( 2025 ) . GraFPrint: A GNN-Based Approach for Audio Identification . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Hyderabad, India ) from: 06/04/2025 to: 11/04/2025 ,

10.1109/ICASSP49660.2025.10888557

https://qmro.qmul.ac.uk/xmlui/handle/123456789/104700

Singh S, Benetos E, Phan H, Stowell D ( 2025 ) . LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Hyderabad, India ) from: 06/04/2025 to: 11/04/2025 ,

10.1109/ICASSP49660.2025.10890467

https://qmro.qmul.ac.uk/xmlui/handle/123456789/104699

Plachouras C, Benetos E, Pauwels J ( 2025 ) . Learning Music Audio Representations With Limited Data . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Hyderabad, India ) from: 06/04/2025 to: 11/04/2025 ,

10.1109/ICASSP49660.2025.10887766

https://qmro.qmul.ac.uk/xmlui/handle/123456789/103944

Huang J ( 2025 ) . Singing to speech conversion with generative flow . EURASIP Journal on Audio, Speech, and Music Processing vol. 2025 , Article 12 ,

10.1186/s13636-025-00400-x

https://qmro.qmul.ac.uk/xmlui/handle/123456789/105344

Liang J, Liu X, Wang W, Plumbley M, Phan H, Benetos E ( 2025 ) . Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities . IEEE Transactions on Audio, Speech and Language Processing vol. 33 , 949 - 961 .

10.1109/TASLPRO.2025.3533375

https://qmro.qmul.ac.uk/xmlui/handle/123456789/104086

Papaioannou C, Benetos E ( 2025 ) . LC-Protonets: Multi-label Few-shot learning for world music audio tagging . IEEE Open Journal of Signal Processing vol. 6 , 138 - 146 .

10.1109/OJSP.2025.3529315

https://qmro.qmul.ac.uk/xmlui/handle/123456789/104351

Elisha S, McDowell A, Beguerisse-Díaz M ( 2024 ) . Classification of spontaneous and scripted speech for multilingual audio . Conference: IEEE Spoken Language Technology Workshop 2024 ( Macao, China ) from: 02/12/2024 to: 05/12/2024 , 489 - 495 .

10.1109/SLT61566.2024.10832309

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102020

Zhou Z, Wu Y, Wu Z, Zhang X, Yuan R, Ma Y, Xue W ( 2024 ) . Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation . Conference: 25th International Society for Music Information Retrieval Conference (ISMIR) ( San Franscisco, CA, USA ) from: 10/11/2024 to: 14/11/2024 ,

10.48550/arxiv.2407.21531

https://qmro.qmul.ac.uk/xmlui/handle/123456789/98625

Deng Q, Yang Q, Yuan R, Huang Y, Wang Y, Liu X, Tian Z, Pan J et al. ( 2024 ) . ComposerX: Multi-Agent Symbolic Music Composition with LLMs . Conference: 25th International Society for Music Information Retrieval Conference (ISMIR), ( San Francisco, CA, USA ) from: 10/11/2024 to: 14/11/2024 ,

10.48550/arxiv.2404.18081

https://qmro.qmul.ac.uk/xmlui/handle/123456789/98627

Weck B, Manco I, Benetos E, QUINTON E ( 2024 ) . MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models . Conference: 25th International Society for Music Information Retrieval Conference (ISMIR) ( San Francisco, CA, USA ) from: 10/11/2024 to: 14/11/2024 ,

10.48550/arxiv.2408.01337

https://qmro.qmul.ac.uk/xmlui/handle/123456789/98705

Steinmetz C, Singh S, Comunit� M, Ibnyahya I, Yuan S, Benetos E, Reiss J ( 2024 ) . ST-ITO: Controlling audio effects for style transfer with inference-time optimization . Conference: 25th International Society for Music Information Retrieval Conference (ISMIR) ( San Francisco, CA, USA ) from: 10/11/2024 to: 14/11/2024 ,

10.48550/arxiv.2410.21233

https://qmro.qmul.ac.uk/xmlui/handle/123456789/98593

Chang SK, Benetos E, KIRCHHOFF H, Dixon S ( 2024 ) . ˜YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation . Conference: IEEE International Workshop on Machine Learning for Signal Processing (MLSP) ( London, UK ) from: 25/09/2024 to: 22/09/2024 ,

10.1109/MLSP58920.2024.10734819

https://qmro.qmul.ac.uk/xmlui/handle/123456789/98710

Torrisi A, De Almeida Nolasco IS, Versace E, Benetos E ( 2024 ) . Exploratory analysis of early-life chick calls . Conference: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots (VIHAR) ( Kos, Greece ) from: 06/09/2024 to: 06/09/2024 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/98700

Ma Y, Øland A, Ragni A, Del Sette BM, Saitis C, Donahue C, Lin C, Plachouras C et al. ( 2024 ) . Foundation Models for Music: A Survey .

10.48550/arxiv.2408.14340

Liang J, Nolasco I, Ghani B, Phan H, Benetos E, Stowell D ( 2024 ) . Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event Detection . Conference: 32nd European Signal Processing Conference (EUSIPCO 2024) ( Lyon, France ) from: 26/08/2024 to: 30/08/2024 ,

10.23919/EUSIPCO63174.2024.10714948

https://qmro.qmul.ac.uk/xmlui/handle/123456789/97744

Huang J, Benetos E ( 2024 ) . Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model . Conference: 32nd European Signal Processing Conference (EUSIPCO) ( Lyon, France ) from: 26/08/2024 to: 30/08/2024 ,

10.23919/EUSIPCO63174.2024.10715045

https://qmro.qmul.ac.uk/xmlui/handle/123456789/97337

Yuan R, Lin H, Wang Y, Tian Z, Wu S, Shen T, Zhang G, Wu Y et al. ( 2024 ) . ChatMusician: Understanding and Generating Music Intrinsically with LLM . Conference: 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) ( Bangkok, Thailand ) from: 11/08/2024 to: 16/08/2024 ,

10.18653/v1/2024.findings-acl.373

https://qmro.qmul.ac.uk/xmlui/handle/123456789/97871

Xompero A, Bontonou M, Arbona J-M, Benetos E ( 2024 ) . Explaining models relating objects and privacy . Proceedings of CVPR 2024 Workshops . Conference: 3rd Explainable AI for Computer Vision (XAI4CV) Workshop at CVPR 2024 ( Seattle Convention Center, Seattle WA, USA ) from: 18/06/2024 to: 18/06/2024 ,

10.48550/arXiv.2405.01646

https://qmro.qmul.ac.uk/xmlui/handle/123456789/96444

Deng Z, Ma Y, Liu Y, Guo R, Zhang G, Chen W, Huang W, Benetos E ( 2024 ) . MusiLingo: bridging music and text with pre-trained language models for music captioning and query response . Conference: 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024) ( Mexico City, Mexico ) from: 16/06/2024 to: 21/06/2024 , 3643 - 3655 .

10.18653/v1/2024.findings-naacl.231

https://qmro.qmul.ac.uk/xmlui/handle/123456789/96229

Ozaki Y, Tierney A, Pfordresher PQ, McBride JM, Benetos E, Proutskova P, Chiba G, Liu F et al. ( 2024 ) . Globally, songs and instrumental melodies are slower and higher and use more stable pitches than speech: A Registered Report . Science Advances vol. 10 , ( 20 )

10.1126/sciadv.adm9797

https://qmro.qmul.ac.uk/xmlui/handle/123456789/96939

Liang J, Zhang H, Liu H, Cao Y, Kong Q, Liu X, Wang W, Plumbley MD et al. ( 2024 ) . WavCraft: audio editing and generation with large language models . Conference: ICLR 2024 Workshop on LLM Agents ( Vienna, Austria ) from: 11/05/2024 to: 11/05/2024 ,

10.48550/arxiv.2403.09527

https://qmro.qmul.ac.uk/xmlui/handle/123456789/97150

Li Y, Yuan R, Zhang G, Ma Y, Chen X, Yin H, Xiao C, Lin C et al. ( 2024 ) . MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training . Conference: International Conference on Learning Representations (ICLR) ( Vienna, Austria ) from: 07/05/2024 to: 11/05/2024 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/95146

Postolache E, Mariani G, Cosmo L ( 2024 ) . Generalized multi-source inference for text conditioned music diffusion models . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Seoul, Korea ) from: 14/04/2024 to: 19/04/2024 , 6980 - 6984 .

10.1109/ICASSP48485.2024.10447122

https://qmro.qmul.ac.uk/xmlui/handle/123456789/93927

Liang J, Phan QH, Benetos E ( 2024 ) . Learning from taxonomy: multi-label few-shot classification for everyday sound recognition . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Seoul, Korea ) from: 14/04/2024 to: 19/04/2024 , 771 - 775 .

10.1109/ICASSP48485.2024.10446908

https://qmro.qmul.ac.uk/xmlui/handle/123456789/97149

Li D, Ma Y, Wei W, KONG Q, Wu Y, Che M, Xia F, Benetos E et al. ( 2024 ) . MERTech: instrument playing technique detection using self-supervised pretrained model with multi-task finetuning . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ( Seoul, Korea ) from: 14/04/2024 to: 19/04/2024 , 521 - 525 .

10.1109/ICASSP48485.2024.10447445

https://qmro.qmul.ac.uk/xmlui/handle/123456789/93901

EDWARDS D, Dixon S, Benetos E, Maezawa A, Kusaka Y ( 2024 ) . A Data-Driven Analysis of Robust Automatic Piano Transcription . IEEE Signal Processing Letters vol. 31 , 681 - 685 .

10.1109/LSP.2024.3363646

https://qmro.qmul.ac.uk/xmlui/handle/123456789/94700

Singh S, Steinmetz C, Benetos E, Phan QH, Stowell D ( 2024 ) . ATGNN: audio tagging graph neural network . IEEE Signal Processing Letters vol. 31 , 825 - 829 .

10.1109/LSP.2024.3352514

https://qmro.qmul.ac.uk/xmlui/handle/123456789/93742

Deb O, Torr P ( 2023 ) . Remaining-useful-life prediction and uncertainty quantification using LSTM ensembles for aircraft engines . Conference: NeurIPS Workshop on Advancing Neural Network Training (WANT): Computational Efficiency, Scalability, and Resource Optimization ( New Orleans, USA ) from: 16/12/2023 to: 16/12/2023 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/92326

Manco I, Weck B, Doh S, Won M, Bodganov D, Wu Y, Tovstogan P, Benetos E et al. ( 2023 ) . The Song Describer dataset: a corpus of audio captions for music-and-language evaluation . Conference: NeurIPS Machine Learning for Audio Workshop ( New Orleans, USA ) from: 16/12/2023 to: 16/12/2023 ,

10.48550/arxiv.2311.10057

https://qmro.qmul.ac.uk/xmlui/handle/123456789/93119

Yuan R, Ma Y, Li Y, Zhang G, Chen X, Yin H, Zhuo L, Liu Y et al. ( 2023 ) . MARBLE: Music Audio Representation Benchmark for Universal Evaluation . Conference: 37th Conference on Neural Information Processing Systems (NeurIPS) from: 10/12/2023 to: 16/12/2023 ,

10.48550/arxiv.2306.10548

https://qmro.qmul.ac.uk/xmlui/handle/123456789/93083

Ragano A, Benetos E ( 2023 ) . Learning Music Representations with wav2vec 2.0 . Conference: 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS) ( Letterkenny, Ireland ) from: 07/12/2023 to: 07/12/2023 ,

10.48550/arxiv.2210.15310

https://qmro.qmul.ac.uk/xmlui/handle/123456789/92418

Papaioannou C, Benetos E, Potamianos A ( 2023 ) . From West to East: Who can understand the music of the others better? . Conference: 24th International Society for Music Information Retrieval Conference (ISMIR) ( Milan, Italy ) from: 05/11/2023 to: 09/11/2023 ,

10.5281/zenodo.10265287

https://qmro.qmul.ac.uk/xmlui/handle/123456789/89661

Zhuo L, Yuan R, Pan J, Ma Y, Li Y, Zhang G, Liu S, Dannenberg R et al. ( 2023 ) . LyricWhiz: Robust Multilingual Lyrics Transcription by Whispering to ChatGPT . Conference: 24th International Society for Music Information Retrieval Conference (ISMIR) ( Milan, Italy ) from: 05/11/2023 to: 09/11/2023 ,

10.48550/arxiv.2306.17103

https://qmro.qmul.ac.uk/xmlui/handle/123456789/90411

Sarkar S, Thorpe L, Benetos E, Sandler M ( 2023 ) . Leveraging Synthetic Data for Improving Chamber Ensemble Separation . Conference: 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) vol. 00 , 1 - 5 .

10.1109/waspaa58266.2023.10248118

https://qmro.qmul.ac.uk/xmlui/handle/123456789/89844

Vahidi C, Singh S, Benetos E, Phan H, Stowell D, Fazekas G, Lagrange M ( 2023 ) . Perceptual Musical Similarity Metric Learning with Graph Neural Networks . Conference: 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) vol. 00 , 1 - 5 .

10.1109/waspaa58266.2023.10248151

https://qmro.qmul.ac.uk/xmlui/handle/123456789/90297

Edwards D, Dixon S, Benetos E ( 2023 ) . PiJAMA: Piano Jazz with Automatic MIDI Annotations . Transactions of the International Society for Music Information Retrieval vol. 6 , ( 1 ) 89 - 102 .

10.5334/tismir.162

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91025

Liang J, Liu X, Liu H, Phan H, Benetos E, Plumbley M, Wang W ( 2023 ) . Adapting Language-Audio Models as Few-Shot Audio Learners . Conference: 24th Annual Conference of the International Speech Communication Association (INTERSPEECH) ( Dublin, Ireland ) from: 20/08/2023 to: 24/08/2023 ,

10.21437/Interspeech.2023-1082

https://qmro.qmul.ac.uk/xmlui/handle/123456789/88692

Ma Y, Yuan R, Li Y, Zhang G, Chen X, Yin H, Lin C, Benetos E et al. ( 2023 ) . On the Effectiveness of Speech Self-supervised Learning for Music .

10.48550/arxiv.2307.05161

https://qmro.qmul.ac.uk/xmlui/handle/123456789/90410

Ragano A, Benetos E, Chinen M, Becerra H, Chandan Karadagur Ananda R ( 2023 ) . A Comparison Of Deep Learning MOS Predictors For Speech Synthesis Quality . Conference: Irish Signals & Systems Conference 2023 ( Dublin, Ireland ) from: 13/06/2023 to: 14/06/2023 ,

10.1109/ISSC59246.2023.10162088

https://qmro.qmul.ac.uk/xmlui/handle/123456789/87841

Ragano A, Benetos E ( 2023 ) . Audio Quality Assessment of Vinyl Music Collections Using Self-Supervised Learning . Conference: 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) from: 04/06/2023 to: 10/06/2023 , 1 - 5 .

10.1109/icassp49357.2023.10096274

https://qmro.qmul.ac.uk/xmlui/handle/123456789/87840

Li Y, Cao W, Xie W ( 2023 ) . Few-shot Class-incremental Audio Classification Using Dynamically Expanded Classifier with Self-attention Modified Prototypes . IEEE Transactions on Multimedia vol. 26 , 1346 - 1360 .

10.1109/TMM.2023.3280011

https://qmro.qmul.ac.uk/xmlui/handle/123456789/88344

Wang C, Benetos E, Wang S, Versace E . Joint Scattering for Automatic Chick Call Recognition . 2015 23rd European Signal Processing Conference (EUSIPCO) . Conference: 2022 30th European Signal Processing Conference (EUSIPCO)195 - 199 .

10.23919/eusipco55093.2022.9909738

https://qmro.qmul.ac.uk/xmlui/handle/123456789/79120

Li Y, Yuan R, Zhang G, Ma Y, Lin C, Chen X, Ragni A, Yin H et al. ( 2022 ) . Large-Scale Pretrained Model for Self-Supervised Music Audio Representation Learning . Conference: DMRN+17: Digital Music Research Network One-day Workshop 2022 ( London, UK ) from: 20/12/2022 to: 20/12/2022 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/83372

Liu L, KONG Q, Morfi G-V, Benetos E ( 2022 ) . Performance MIDI-to-score conversion by neural beat tracking . Conference: 23rd International Society for Music Information Retrieval Conference (ISMIR) ( Bengaluru, India ) from: 04/12/2022 to: 08/12/2022 ,

10.5281/zenodo.7316682

https://qmro.qmul.ac.uk/xmlui/handle/123456789/80694

Sarkar S, Benetos E, Sandler M ( 2022 ) . EnsembleSet: A new high-quality synthesised dataset for chamber ensemble separation . Conference: 23rd International Society for Music Information Retrieval Conference (ISMIR) ( Bengaluru, India ) from: 05/12/2022 to: 08/12/2022 ,

10.5281/zenodo.7316740

https://qmro.qmul.ac.uk/xmlui/handle/123456789/80496

Manco I, Benetos E, Fazekas G ( 2022 ) . Contrastive audio-language learning for music . https://ismir2022.ismir.net/ . Conference: 23rd International Society for Music Information Retrieval Conference (ISMIR) ( Bengaluru, India ) from: 04/12/2022 to: 08/12/2022 ,

10.5281/zenodo.7316744

https://qmro.qmul.ac.uk/xmlui/handle/123456789/81922

Mai KT, Davies T ( 2022 ) . Explaining the decisions of anomalous sound detectors . https://dcase.community/workshop2022/ . Conference: 7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE) ( Nancy, France ) from: 03/11/2022 to: 04/11/2022 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/82013

Liang J, Phan QH, Benetos E ( 2022 ) . Leveraging label hierarchies for few-shot everyday sound recognition . Conference: 7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE) ( Nancy, France ) from: 03/11/2022 to: 04/11/2022 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/82109

Ozaki Y, Kuroyanagi J, McBride J, Proutskova P, Tierney A, Benetos E ( 2022 ) . Similarities and differences in a cross-linguistic sample of song and speech recordings . Conference: Joint Conference on Language Evolution ( Kanazawa, Japan ) from: 05/09/2022 to: 08/09/2022 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/78999

Singh S, Benetos E, Phan QH ( 2022 ) . Hypernetworks for sound event detection: a proof-of-concept . Conference: 30th European Signal Processing Conference (EUSIPCO 2022) ( Belgrade, Serbia ) from: 29/08/2022 to: 03/09/2022 , 429 - 433 .

10.23919/eusipco55093.2022.9909716

https://qmro.qmul.ac.uk/xmlui/handle/123456789/80502

Daikoku H, Ding S, Benetos E, Wood ALC, Shimizono T, Sanne US ( 2022 ) . Agreement among human and automated estimates of similarity in a global music sample . Conference: 10th International Workshop on Folk Music Analysis (FMA 2022) ( Sheffield, UK ) from: 14/06/2022 to: 17/06/2022 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/78967

Ou L, Guo Z, Benetos E ( 2022 ) . Exploring Transformer’s Potential on Automatic Piano Transcription . Conference: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) vol. 00 , 776 - 780 .

10.1109/icassp43922.2022.9746789

https://qmro.qmul.ac.uk/xmlui/handle/123456789/76838

Huang J, Benetos E, Ewert S ( 2022 ) . Improving lyrics Alignment through Joint Pitch Detection . Conference: 2022 IEEE International Conference on Acoustics, Speech and Signal Processing ( Singapore ) from: 22/05/2022 to: 27/05/2022 , 451 - 455 .

10.1109/ICASSP43922.2022.9746460

https://qmro.qmul.ac.uk/xmlui/handle/123456789/76603

Manco I, Benetos E, Quinton E ( 2022 ) . Learning music audio representations via weak language supervision . Conference: 2022 IEEE International Conference on Acoustics, Speech and Signal Processing ( Singapore ) from: 22/05/2022 to: 27/05/2022 , 456 - 460 .

10.1109/ICASSP43922.2022.9746996

https://qmro.qmul.ac.uk/xmlui/handle/123456789/77270

Ragano A, Benetos E ( 2022 ) . Automatic Quality Assessment of Digitized and Restored Sound Archives . Journal of the Audio Engineering Society vol. 70 , ( 4 ) 252 - 270 .

10.17743/jaes.2022.0002

https://qmro.qmul.ac.uk/xmlui/handle/123456789/76602

Wang C, Benetos E, Lostanlen V ( 2022 ) . Adaptive Scattering Transforms for Playing Technique Recognition . IEEE/ACM Transactions on Audio, Speech and Language Processing vol. 30 , 1407 - 1421 .

10.1109/TASLP.2022.3156785

https://qmro.qmul.ac.uk/xmlui/handle/123456789/77277

Benetos E, Ragano A, Sgroi D ( 2022 ) . Measuring national mood with music: using machine learning to construct a measure of national valence from audio data . Behavior Research Methods vol. 54 , ( 6 ) 3085 - 3092 .

10.3758/s13428-021-01747-7

https://qmro.qmul.ac.uk/xmlui/handle/123456789/75092

Terenzi A, Nolasco I, Benetos E ( 2021 ) . Comparison of Feature Extraction Methods for Sound-Based Classification of Honey Bee Activity . IEEE Transactions on Audio Speech and Language Processing vol. 30 , 112 - 122 .

10.1109/taslp.2021.3133194

https://qmro.qmul.ac.uk/xmlui/handle/123456789/75462

Bodo RPP, Benetos E ( 2021 ) . A framework for music similarity and cover song identification . Conference: 15th International Symposium on Computer Music Multidisciplinary Research (CMMR) ( Tokyo, Japan ) from: 15/11/2021 to: 19/11/2021 , 205 - 214 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/75043

Liu L, Morfi V, Benetos E ( 2021 ) . ACPAS: A Dataset of Aligned Classical Piano Audio and Scores for Audio-to-Score Transcription . Conference: Late-Breaking Demo Session of the 22nd Int. Society for Music Information Retrieval Conference

https://qmro.qmul.ac.uk/xmlui/handle/123456789/79136

Ozaki Y, McBride J, Benetos E, Pfordresher PQ, Six J, T. Tierney A, Proutskova P, Fukatsu H et al. ( 2021 ) . Agreement among human and annotated transcriptions of global songs . Conference: 22nd International Society for Music Information Retrieval Conference (ISMIR) from: 09/11/2021 to: 12/11/2021 , 500 - 508 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/73595

Vianna Lordelo C, Benetos E, Dixon S, Ahlbäck S ( 2021 ) . Pitch-informed instrument assignment using a deep convolutional network with multiple kernel shapes . Conference: 22nd International Society for Music Information Retrieval Conference (ISMIR) from: 09/11/2021 to: 12/11/2021 , 389 - 395 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/73591

O'Hanlon K, Benetos E, Dixon S ( 2021 ) . Detecting Cover Songs with Pitch Class Key-Invariant Networks . Conference: 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP) vol. 00 , 1 - 6 .

10.1109/mlsp52302.2021.9596389

https://qmro.qmul.ac.uk/xmlui/handle/123456789/74336

Holzapfel A, Benetos E, Widdess R ( 2021 ) . Humanities and engineering perspectives on music transcription . Digital Scholarship in the Humanities vol. 37 , ( 3 ) 747 - 764 .

10.1093/llc/fqab074

https://qmro.qmul.ac.uk/xmlui/handle/123456789/73459

Bear HL, Morfi V . An Evaluation of Data Augmentation Methods for Sound Scene Geotagging . Conference: Interspeech 2021581 - 585 .

10.21437/interspeech.2021-1837

https://qmro.qmul.ac.uk/xmlui/handle/123456789/72840

Sarkar S, Benetos E, Sandler M ( 2021 ) . Vocal Harmony Separation using Time-domain Neural Networks . Conference: 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH) ( Brno, Czech Republic ) from: 30/08/2021 to: 03/09/2021 , 3515 - 3519 .

10.21437/Interspeech.2021-1531

https://qmro.qmul.ac.uk/xmlui/handle/123456789/72826

Zhao Y, Wang C, Fazekas G, Benetos E, Sandler M ( 2021 ) . Violinist identification based on vibrato features . Conference: 2021 29th European Signal Processing Conference (EUSIPCO) vol. 00 , 381 - 385 .

10.23919/eusipco54536.2021.9616197

https://qmro.qmul.ac.uk/xmlui/handle/123456789/72433

Manco I, Benetos E, Quinton E ( 2021 ) . MusCaps: generating captions for music audio . Conference: International Joint Conference on Neural Networks (IJCNN) from: 18/07/2021 to: 22/07/2021 ,

10.1109/IJCNN52387.2021.9533461

https://qmro.qmul.ac.uk/xmlui/handle/123456789/72068

Cheuk KW, Luo Y-J, Benetos E, Herremans D ( 2021 ) . Revisiting the onsets and frames model with additive attention . Conference: International Joint Conference on Neural Networks (IJCNN) from: 18/07/2021 to: 22/07/2021 ,

10.1109/IJCNN52387.2021.9533407

https://qmro.qmul.ac.uk/xmlui/handle/123456789/72070

Liu L, Benetos E ( 2021 ) . From Audio to Music Notation . Handbook of Artificial Intelligence for Music , Springer Nature

https://qmro.qmul.ac.uk/xmlui/handle/123456789/73211

Ragano A, Benetos E, Hines A ( 2021 ) . More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations . Conference: 2021 13th International Conference on Quality of Multimedia Experience (QoMEX) vol. 00 , 103 - 108 .

10.1109/qomex51781.2021.9465410

https://qmro.qmul.ac.uk/xmlui/handle/123456789/72280

Liu L, Morfi G-V, Benetos E ( 2021 ) . Joint multi-pitch detection and score transcription for polyphonic piano music . Conference: IEEE International Conference on Acoustics, Speech and Signal Processing ( Toronto, Canada ) from: 06/06/2021 to: 11/06/2021 ,

10.1109/ICASSP39728.2021.9413601

https://qmro.qmul.ac.uk/xmlui/handle/123456789/70432

Singh S, Bear H, Benetos E ( 2021 ) . Prototypical Networks for Domain Adaptation in Acoustic Scene Classification . Conference: IEEE International Conference on Acoustics, Speech and Signal Processing ( Toronto, Canada ) from: 06/06/2021 to: 11/06/2021 ,

10.1109/ICASSP39728.2021.9414876

https://qmro.qmul.ac.uk/xmlui/handle/123456789/70431

Subramanian V, Gururani S, Benetos E, Sandler M ( 2021 ) . Anomalous behaviour in loss-gradient based interpretability methods . Conference: RobustML workshop paper at ICLR 2021

https://qmro.qmul.ac.uk/xmlui/handle/123456789/72075

Cheuk KW, Benetos E, Luo Y, Herremans D ( 2021 ) . The effect of spectrogram reconstructions on automatic music transcription: an alternative approach to improve transcription accuracy . Conference: 25th International Conference on Pattern Recognition (ICPR2020) ( Milan, Italy ) from: 10/01/2021 to: 15/01/2021 , 9091 - 9098 .

10.1109/ICPR48806.2021.9412155

https://qmro.qmul.ac.uk/xmlui/handle/123456789/67744

Vianna Lordelo C, Benetos E, Dixon S, Ahlbäck S, Ohlsson P ( 2020 ) . Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation . IEEE Signal Processing Letters vol. 28 , 81 - 85 .

10.1109/LSP.2020.3045915

https://qmro.qmul.ac.uk/xmlui/handle/123456789/69527

Liu L, Morfi G-V, Benetos E ( 2020 ) . Joint Piano-roll and Score Transcription for Polyphonic Piano Music . Conference: DMRN+15: Digital Music Research Network One-day Workshop ( London, UK ) from: 15/12/2020 to: 15/12/2020 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/70433

Chettri B, Benetos E, Sturm BLT ( 2020 ) . Dataset artefacts in anti-spoofing systems: a case study on the ASVspoof 2017 benchmark . IEEE/ACM Transactions on Audio, Speech and Language Processing vol. 28 , 3018 - 3028 .

10.1109/TASLP.2020.3036777

https://qmro.qmul.ac.uk/xmlui/handle/123456789/67745

Chettri B, Kinnunen T ( 2020 ) . Subband modeling for spoofing detection in automatic speaker verification . http://www.odyssey2020.org/ . Conference: Odyssey 2020: The Speaker and Language Recognition Workshop ( Tokyo, Japan ) from: 01/11/2020 to: 05/11/2020 , 341 - 348 .

10.21437/Odyssey.2020-48

https://qmro.qmul.ac.uk/xmlui/handle/123456789/64069

Ragano A, Benetos E ( 2020 ) . Development of a Speech Quality Database Under Uncontrolled Conditions . Conference: 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020) ( Shanghai, China ) from: 25/10/2020 to: 29/10/2020 ,

10.21437/Interspeech.2020-1899

https://qmro.qmul.ac.uk/xmlui/handle/123456789/66680

Pankajakshan A, Bear H, Benetos E ( 2020 ) . Memory Controlled Sequential Self Attention for Sound Recognition . Conference: 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020) ( Shanghai, China ) from: 25/10/2020 to: 29/10/2020 ,

10.21437/Interspeech.2020-1953

https://qmro.qmul.ac.uk/xmlui/handle/123456789/67665

MISHRA S, Benetos E, Sturm B, Dixon S ( 2020 ) . Reliable Local Explanations for Machine Listening . Conference: International Joint Conference on Neural Networks (IJCNN) ( Glasgow, UK ) from: 19/07/2020 to: 24/07/2020 ,

10.1109/IJCNN48605.2020.9207444

https://qmro.qmul.ac.uk/xmlui/handle/123456789/64505

Ycart A, Liu L, Benetos E, Pearce M ( 2020 ) . Investigating the Perceptual Validity of Evaluation Metrics for Automatic Piano Music Transcription . Transactions of the International Society for Music Information Retrieval vol. 3 , ( 1 ) 68 - 81 .

10.5334/tismir.57

https://qmro.qmul.ac.uk/xmlui/handle/123456789/65069

Ragano A, Benetos E ( 2020 ) . Audio impairment recognition using a correlation-based feature representation . http://qomex2020.ie/ . Conference: 12th International Conference on Quality of Multimedia Experience (QoMEX) ( Athlone, Ireland ) from: 26/05/2020 to: 28/05/2020 ,

10.1109/QoMEX48832.2020.9123111

https://qmro.qmul.ac.uk/xmlui/handle/123456789/63540

SUBRAMANIAN V, Pankajakshan A, Benetos E, Xu N, McDonald S, Sandler M ( 2020 ) . A Study on the Transferability of Adversarial Attacks in Sound Event Classification . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) ( Barcelona, Spain ) from: 04/05/2020 to: 08/05/2020 , 301 - 305 .

10.1109/ICASSP40776.2020.9054445

https://qmro.qmul.ac.uk/xmlui/handle/123456789/63241

Wei W, Zhu H, Benetos E ( 2020 ) . A-CRNN: a domain adaptation model for sound event detection . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) ( Barcelona, Spain ) from: 04/05/2020 to: 08/05/2020 , 276 - 280 .

10.1109/ICASSP40776.2020.9054248

https://qmro.qmul.ac.uk/xmlui/handle/123456789/63518

Martinez Ramirez M, Benetos E, Reiss J ( 2020 ) . Modeling plate and spring reverberation using a DSP-informed deep neural network . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) ( Barcelona, Spain ) from: 04/05/2020 to: 08/05/2020 , 241 - 245 .

10.1109/ICASSP40776.2020.9053093

https://qmro.qmul.ac.uk/xmlui/handle/123456789/62846

Wang C, Lostanlen V, Benetos E ( 2020 ) . Playing Technique Recognition by Joint Time–Frequency Scattering . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) ( Barcelona, Spain ) from: 04/05/2020 to: 08/05/2020 , 881 - 885 .

10.1109/ICASSP40776.2020.9053474

https://qmro.qmul.ac.uk/xmlui/handle/123456789/63588

Ycart A, Liu L, Benetos E ( 2020 ) . Musical Features for Automatic Music Transcription Evaluation .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/70213

Ycart A, Benetos E ( 2020 ) . Learning and Evaluation Methodologies for Polyphonic Music Sequence Prediction with LSTMs . IEEE/ACM Transactions on Audio, Speech and Language Processing vol. 28 , ( 1 ) 1328 - 1341 .

10.1109/TASLP.2020.2987130

https://qmro.qmul.ac.uk/xmlui/handle/123456789/63818

Chettri B, Kinnunen T ( 2020 ) . Deep Generative Variational Autoencoding for Replay Spoof Detection in Automatic Speaker Verification . Computer Speech and Language vol. 63 , Article 101092 ,

10.1016/j.csl.2020.101092

https://qmro.qmul.ac.uk/xmlui/handle/123456789/63242

Martinez Ramirez M, Benetos E, Reiss J ( 2020 ) . Deep Learning for Black-Box Modeling of Audio Effects . Applied Sciences vol. 10 , ( 2 ) Article 638 ,

10.3390/app10020638

https://qmro.qmul.ac.uk/xmlui/handle/123456789/62479

Liu L, Benetos E ( 2019 ) . Automatic Music Accompaniment with a Chroma-based Music Data Representation . Conference: DMRN+14: Digital Music Research Network One-day Workshop

https://qmro.qmul.ac.uk/xmlui/handle/123456789/62518

Ycart A, Stoller D ( 2019 ) . A Comparative Study of Neural Models for Polyphonic Music Sequence Transduction . Conference: 20th conference of the International Society for Music Information Retrieval (ISMIR) ( Delft, The Netherlands ) from: 04/11/2019 to: 08/11/2019 , 470 - 477 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/59184

Wang C, Benetos E, Lostanlen V ( 2019 ) . Adaptive Time–Frequency Scattering for Periodic Modulation Recognition in Music Signals . Conference: International Society for Music Information Retrieval Conference ( Delft, The Netherlands ) from: 04/11/2019 to: 08/11/2019 , 809 - 815 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/59179

Holzapfel A ( 2019 ) . Automatic music transcription and ethnomusicology: a user study . Conference: 20th conference of the International Society for Music Information Retrieval (ISMIR) ( Delft, The Netherlands ) from: 04/11/2019 to: 08/11/2019 , 678 - 684 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/59182

Ycart A, McLeod A, Benetos E ( 2019 ) . Blending acoustic and language model predictions for automatic music transcription . Conference: 20th conference of the International Society for Music Information Retrieval (ISMIR) ( Delft, The Netherlands ) from: 04/11/2019 to: 08/11/2019 , 454 - 461 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/59183

Wang C, Benetos E ( 2019 ) . CBF-periDB: A Chinese Bamboo Flute Dataset for Periodic Modulation Analysis . Conference: International Society for Music Information Retrieval Conference Late-Breaking Demo Session ( Delft, The Netherlands ) from: 04/11/2019 to: 08/11/2019 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/59775

Singh S, Pankajakshan A ( 2019 ) . Audio tagging using a linear noise modelling layer . http://dcase.community/workshop2019/ . Conference: 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019) ( New York, USA ) from: 25/10/2019 to: 26/10/2019 , 234 - 238 .

10.33682/zyc0-jw35

https://qmro.qmul.ac.uk/xmlui/handle/123456789/59660

Pankajakshan A, Benetos E ( 2019 ) . Onsets, activity, and events: a multi-task approach for polyphonic sound event modelling . http://dcase.community/workshop2019/ . Conference: 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019) ( New York, USA ) from: 25/10/2019 to: 26/10/2019 , 174 - 178 .

10.33682/sm6r-8p49

https://qmro.qmul.ac.uk/xmlui/handle/123456789/59659

SUBRAMANIAN V, Benetos E, Sandler M ( 2019 ) . Robustness of Adversarial Attacks in Sound Event Classification . http://dcase.community/workshop2019/ . Conference: 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019) ( New York, USA ) from: 25/10/2019 to: 26/10/2019 , 239 - 243 .

10.33682/sp9n-qk06

https://qmro.qmul.ac.uk/xmlui/handle/123456789/59658

Bear H, Heittola T, Mesaros A, Virtanen T ( 2019 ) . City classification from multiple real-world sound scenes . http://www.waspaa.com/ . Conference: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics ( New Paltz, NY, USA ) from: 20/10/2019 to: 23/10/2019 , 11 - 15 .

10.1109/WASPAA.2019.8937271

https://qmro.qmul.ac.uk/xmlui/handle/123456789/59069

Vianna Lordelo C, Benetos E, Dixon S, Ahlbäck S ( 2019 ) . Investigating kernel shapes and skip connections for deep learning-based harmonic-percussive separation . http://www.waspaa.com/ . Conference: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics ( New Paltz, NY, USA ) from: 20/10/2019 to: 23/10/2019 , 40 - 44 .

10.1109/WASPAA.2019.8937079

https://qmro.qmul.ac.uk/xmlui/handle/123456789/59067

Pankajakshan A, Bear H ( 2019 ) . Polyphonic sound event and sound activity detection: a multi-task approach . http://www.waspaa.com/ . Conference: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics ( New Paltz, NY, USA ) from: 20/10/2019 to: 23/10/2019 , 318 - 322 .

10.1109/WASPAA.2019.8937193

https://qmro.qmul.ac.uk/xmlui/handle/123456789/59068

Chettri B, Stoller D, Morfi V, Martinez Ramirez M ( 2019 ) . Ensemble Models for Spoofing Detection in Automatic Speaker Verification . Conference: 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) ( Graz, Austria ) from: 15/07/2019 to: 19/09/2019 , 1018 - 1022 .

10.21437/Interspeech.2019-2505

https://qmro.qmul.ac.uk/xmlui/handle/123456789/58459

Bear H, Nolasco I ( 2019 ) . Towards joint sound scene and polyphonic sound event recognition . Conference: 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) ( Graz, Austria ) from: 15/09/2019 to: 19/09/2019 , 4594 - 4598 .

10.21437/Interspeech.2019-2169

https://qmro.qmul.ac.uk/xmlui/handle/123456789/58478

Martinez Ramirez M, Benetos E, Reiss J ( 2019 ) . A general-purpose deep learning approach to model time-varying audio effects . Conference: International Conference on Digital Audio Effects (DAFx-19) ( Birmingham, UK ) from: 02/09/2019 to: 06/09/2019 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/58376

Zhou Q, Feng Z ( 2019 ) . Adaptive Noise Reduction for Sound Event Detection Using Subband-Weighted NMF . Sensors vol. 19 , ( 14 ) Article 3206 ,

10.3390/s19143206

https://qmro.qmul.ac.uk/xmlui/handle/123456789/58616

Subramanian V, Benetos E, Xu N, McDonald S, Sandler MB ( 2019 ) . Adversarial Attacks in Sound Event Classification .

Covas E ( 2019 ) . Optimal Neural Network Feature Selection for Spatial-Temporal Forecasting . Chaos vol. 29 , ( 6 ) Article 063111 ,

10.1063/1.5095060

https://qmro.qmul.ac.uk/xmlui/handle/123456789/57841

Ragano A, BENETOS E ( 2019 ) . Adapting the Quality of Experience Framework for Audio Archive Evaluation . https://www.qomex2019.de/ . Conference: 11th International Conference on Quality of Multimedia Experience ( Berlin, Germany ) from: 05/06/2019 to: 07/06/2019 ,

10.1109/QoMEX.2019.8743302

https://qmro.qmul.ac.uk/xmlui/handle/123456789/57323

WANG C, BENETOS E, MENG X ( 2019 ) . HMM-based Glissando Detection for Recordings of Chinese Bamboo Flute . Proceedings of Sound and Music Computing Conference . Conference: Sound and Music Computing Conference ( Malaga, Spain ) from: 28/05/2019 to: 31/05/2019 , 545 - 550 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/57029

Lins F, Johann M, BENETOS E ( 2019 ) . Automatic Transcription of Diatonic Harmonica Recordings . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing ( Brighton, UK ) from: 12/05/2019 to: 17/05/2019 ,

10.1109/ICASSP.2019.8682334

https://qmro.qmul.ac.uk/xmlui/handle/123456789/56489

Phaye SSR, BENETOS E, Wang Y ( 2019 ) . SubSpectralNet - Using sub-spectrogram based convolutional neural networks for acoustic scene classification . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing ( Brighton, UK ) from: 12/05/2019 to: 17/05/2019 ,

10.1109/ICASSP.2019.8683288

https://qmro.qmul.ac.uk/xmlui/handle/123456789/55777

MISHRA S, STOLLER D, BENETOS E, STURM B, DIXON S ( 2019 ) . GAN-based Generation and Automatic Selection of Explanations for Neural Networks . https://sites.google.com/view/safeml-iclr2019 . Conference: SafeML ICLR 2019 Workshop ( New Orleans, USA ) from: 06/05/2019 to: 06/05/2019 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/57216

Nolasco I, Terenzi A, Cecchi S, Orcioni S ( 2019 ) . Audio-based identification of beehive states . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing ( Brighton, UK ) from: 12/05/2019 to: 17/05/2019 ,

10.1109/ICASSP.2019.8682981

https://qmro.qmul.ac.uk/xmlui/handle/123456789/55776

BENETOS E, DIXON S, Duan Z, EWERT S ( 2019 ) . Automatic Music Transcription: An Overview . IEEE Signal Processing Magazine vol. 36 , ( 1 ) 20 - 30 .

10.1109/MSP.2018.2869928

https://qmro.qmul.ac.uk/xmlui/handle/123456789/54987

CHETTRI B, MISHRA S, STURM B, BENETOS E ( 2018 ) . Analysing the predictions of a CNN-based replay spoofing detection system . http://www.slt2018.org/ . Conference: 2018 IEEE Workshop on Spoken Language Technology ( Athens, Greece ) from: 18/12/2018 to: 21/12/2018 , 92 - 97 .

10.1109/SLT.2018.8639666

https://qmro.qmul.ac.uk/xmlui/handle/123456789/55093

BEAR H ( 2018 ) . An extensible cluster-graph taxonomy for open set sound scene analysis . http://dcase.community/workshop2018/ . Conference: Workshop on Detection and Classification of Acoustic Scenes and Events ( Surrey, UK ) from: 19/11/2018 to: 20/11/2018 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/45944

Nolasco I, BENETOS E ( 2018 ) . To bee or not to bee: Investigating machine learning approaches for beehive sound recognition . http://dcase.community/documents/workshop2018/proceedings/DCASE2018Workshop_Nolasco_131.pdf . Conference: 2018 Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2018) ( Surrey, UK ) from: 19/11/2018 to: 20/11/2018 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/53444

YCART A ( 2018 ) . A-MAPS: Augmented MAPS Dataset with Rhythm and Key Annotations . Conference: 19th International Society for Music Information Retrieval Conference Late-Breaking Demos Session ( Paris ) from: 23/09/2018 to: 27/09/2018 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/45985

WANG C, BENETOS E, MENG X ( 2018 ) . Towards HMM-based glissando detection for recordings of Chinese bamboo flute . http://ismir2018.ircam.fr/pages/events-lbd.html . Conference: International Society for Music Information Retrieval Conference Late-Breaking Demos Session ( Paris, France ) from: 23/09/2018 to: 27/09/2018 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/45825

CHETTRI B, STURM BLT, BENETOS E ( 2018 ) . Analysing replay spoofing countermeasure performance under varied conditions . Conference: IEEE International Workshop on Machine Learning for Signal Processing ( Aalborg, Denmark ) from: 17/09/2018 to: 20/09/2018 ,

10.1109/MLSP.2018.8516968

https://qmro.qmul.ac.uk/xmlui/handle/123456789/49725

Ali H, Tran SN, d'Avila Garcez AS ( 2018 ) . Speaker recognition with hybrid features from a deep belief network . Neural Computing and Applications vol. 29 , ( 6 ) 13 - 19 .

10.1007/s00521-016-2501-7

https://qmro.qmul.ac.uk/xmlui/handle/123456789/15698

Chettri B, Mishra S, Sturm BL ( 2018 ) . A Study On Convolutional Neural Network Based End-To-End Replay Anti-Spoofing .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/62299

YCART A ( 2018 ) . Polyphonic Music Sequence Transduction with Meter-Constrained LSTM Networks . Conference: IEEE International Conference on Acoustics, Speech and Signal Processing ( Calgary, Canada ) from: 15/04/2018 to: 20/04/2018 , 386 - 390 .

10.1109/ICASSP.2018.8462128

https://qmro.qmul.ac.uk/xmlui/handle/123456789/34703

Nakamura E, BENETOS E, Yoshii K, DIXON S ( 2018 ) . Towards Complete Polyphonic Music Transcription: Integrating Multi-Pitch Detection and Rhythm Quantization . Conference: IEEE International Conference on Acoustics, Speech and Signal Processing ( Calgary, Canada ) from: 15/04/2018 to: 20/04/2018 , 101 - 105 .

10.1109/ICASSP.2018.8461914

https://qmro.qmul.ac.uk/xmlui/handle/123456789/40583

Valero-Mas JJ, BENETOS E, Iñesta JM ( 2018 ) . A Supervised Classification Approach for Note Tracking in Polyphonic Piano Transcription . Journal of New Music Research vol. 47 , ( 3 ) 249 - 263 .

10.1080/09298215.2018.1451546

https://qmro.qmul.ac.uk/xmlui/handle/123456789/36298

Mesaros A, Heittola T, Benetos E, Foster P, Lagrange M ( 2018 ) . Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge . IEEE/ACM Transactions on Audio, Speech and Language Processing vol. 26 , ( 2 ) 379 - 393 .

10.1109/TASLP.2017.2778423

https://qmro.qmul.ac.uk/xmlui/handle/123456789/28982

PANTELI M, BENETOS E, DIXON S ( 2018 ) . A review of manual and computational approaches for the study of world music corpora . Journal of New Music Research vol. 47 , ( 2 ) 176 - 189 .

10.1080/09298215.2017.1418896

https://qmro.qmul.ac.uk/xmlui/handle/123456789/31440

BENETOS E, STOWELL D, PLUMBLEY M, Virtanen T, PLUMBLEY M, Ellis D ( 2018 ) . Approaches to complex sound scene analysis . Computational Analysis of Sound Scenes and Events , Edition. 1 , Springer International Publishing

( 2018 ) . Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018, Paris, France, September 23-27, 2018 . ISMIR .

PANTELI M, BENETOS E, DIXON S ( 2017 ) . A computational study on outliers in world music . PLoS ONE vol. 12 , ( 12 ) Article e0189399 , 1 - 28 .

10.1371/journal.pone.0189399

https://qmro.qmul.ac.uk/xmlui/handle/123456789/31245

McLeod A, Steedman M, BENETOS E ( 2017 ) . Automatic Transcription of Polyphonic Vocal Music . Applied Sciences vol. 7 , ( 12 ) Article 1285 ,

10.3390/app7121285

https://qmro.qmul.ac.uk/xmlui/handle/123456789/30824

Ycart A, Benetos E ( 2017 ) . A study on LSTM networks for polyphonic music sequence modelling . Conference: 18th International Society for Music Information Retrieval Conference (ISMIR 2017) ( Suzhou, China ) from: 23/10/2017 to: 27/10/2017 , 421 - 427 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/24946

Schramm R, McLeod A, Benetos E ( 2017 ) . Multi-pitch detection and voice assignment for a cappella recordings of multiple singers . Conference: 18th International Society for Music Information Retrieval Conference (ISMIR 2017) ( Suzhou, China ) from: 23/10/2017 to: 27/10/2017 , 552 - 559 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/25292

Lafay G, Lagrange M ( 2017 ) . Sound Event Detection in Synthetic Audio: Analysis of the DCASE 2016 Task Results . http://www.waspaa.com/ . Conference: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2017) ( New Paltz, NY, USA ) from: 18/10/2017 to: 15/10/2017 , 11 - 15 .

10.1109/WASPAA.2017.8169985

https://qmro.qmul.ac.uk/xmlui/handle/123456789/25293

YCART A, BENETOS E ( 2017 ) . Neural Music Language Models: investigating the training process . Conference: International Conference of Students of Systematic Musicology

https://qmro.qmul.ac.uk/xmlui/handle/123456789/36559

Valero-Mas JJ, Benetos E ( 2017 ) . Assessing the Relevance of Onset Information for Note Tracking in Piano Music Transcription . Conference: 2017 AES International Conference on Semantic Audio ( Erlangen, Germany ) from: 22/06/2017 to: 24/06/2017 ,

10.17743/aesconf.2017.978-1-942220-15-2

https://qmro.qmul.ac.uk/xmlui/handle/123456789/22497

Schramm R ( 2017 ) . Automatic Transcription of a Cappella Recordings from Multiple Singers . Conference: 2017 AES International Conference on Semantic Audio ( Erlangen, Germany ) from: 22/06/2017 to: 24/06/2017 ,

10.17743/aesconf.2017.978-1-942220-15-2

https://qmro.qmul.ac.uk/xmlui/handle/123456789/22496

Benetos E ( 2017 ) . Polyphonic note and instrument tracking using linear dynamical systems . Conference: 2017 AES International Conference on Semantic Audio ( Erlangen, Germany ) from: 22/06/2017 to: 24/06/2017 ,

10.17743/aesconf.2017.978-1-942220-15-2

https://qmro.qmul.ac.uk/xmlui/handle/123456789/22284

Stowell D, Benetos E, Gill LF ( 2017 ) . On-Bird Sound Recordings: Automatic Acoustic Recognition of Activities and Contexts . IEEE/ACM Trans. Audio, Speech & Language Processing vol. 25 , ( 6 ) 1193 - 1206 .

10.1109/TASLP.2017.2690565

https://qmro.qmul.ac.uk/xmlui/handle/123456789/18483

Stowell D, Benetos E, Gill LF ( 2017 ) . On-bird Sound Recordings: Automatic Acoustic Recognition of Activities and Contexts . IEEE/ACM Transactions on Audio, Speech and Language Processing vol. 25 , ( 6 ) 1193 - 1206 .

10.1109/TASLP.2017.2690565

https://qmro.qmul.ac.uk/xmlui/handle/123456789/18483

Benetos E, Lafay G, Plumbley MD ( 2017 ) . Polyphonic Sound Event Tracking using Linear Dynamical Systems . IEEE/ACM Transactions on Audio, Speech and Language Processing vol. 25 , ( 6 ) 1266 - 1277 .

10.1109/TASLP.2017.2690576

https://qmro.qmul.ac.uk/xmlui/handle/123456789/19368

Russell AJ, Benetos E ( 2017 ) . On the Memory Properties of Recurrent Neural Models . Conference: International Joint Conference on Neural Networks (IJCNN 2017) ( Anchorage, Alaska, USA ) from: 19/05/2017 to: 14/05/2017 , 2596 - 2603 .

10.1109/IJCNN.2017.7966173

https://qmro.qmul.ac.uk/xmlui/handle/123456789/22173

Abdallah S, Benetos E, Gold N, Hargreaves S ( 2017 ) . The Digital Music Lab: A Big Data Infrastructure for Digital Musicology . ACM Journal on Computing and Cultural Heritage vol. 10 , ( 1 )

10.1145/2983918

https://qmro.qmul.ac.uk/xmlui/handle/123456789/15701

BENETOS E ( 2016 ) . Automatic Transcription of Vocal Quartets . DMRN+11: Digital Music Research Network Workshop Proceedings 2016 . Conference: DMRN+11: Digital Music Research Network One-day Workshop 2016 ( Centre for Digital Music, Queen Mary University of London ) from: 20/12/2016 to: 20/12/2016 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/19399

YCART A, Benetos E ( 2016 ) . Towards a Music Language Model for Audio Analysis . DMRN+11: Digital Music Research Network Workshop Proceedings 2016 . Conference: DMRN+11: Digital Music Research Network One-day Workshop 2016 ( Centre for Digital Music, Queen Mary University of London ) from: 20/12/2016 to: 20/12/2016 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/19394

Valero-Mas JJ, Benetos E ( 2016 ) . Classification-based Note Tracking for Automatic Music Transcription . https://sites.google.com/site/musicmachinelearning16/proceedings . Conference: 9th International Workshop on Machine Learning and Music ( Riva del Garda, Italy ) from: 23/09/2016 to: 23/09/2016 , 61 - 65 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/14957

Abdallah S, Gold N, Hargreaves S, Weyde T, Wolff D ( 2016 ) . Digital Music Lab: A Framework for Analysing Big Music Data . Conference: 24th European Signal Processing Conference ( Budapest, Hungary ) from: 29/08/2016 to: 02/09/2016 , 1118 - 1122 .

10.1109/EUSIPCO.2016.7760422

https://qmro.qmul.ac.uk/xmlui/handle/123456789/13424

Cheng T, Mauch M, Benetos E, Dixon S ( 2016 ) . An attack/decay model for piano transcription . Conference: 17th International Society for Music Information Retrieval Conference ( New York, USA ) from: 07/08/2016 to: 11/08/2016 , 584 - 590 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/12637

Panteli M, Benetos E, Dixon S ( 2016 ) . Learning a feature space for similarity in world music . Conference: 17th International Society for Music Information Retrieval Conference ( New York, USA ) from: 07/08/2016 to: 11/08/2016 , 538 - 544 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/12616

Holzapfel A, Benetos E ( 2016 ) . The Sousta corpus: Beat-informed automatic transcription of traditional dance tunes . Conference: 17th International Society for Music Information Retrieval Conference ( New York, USA ) from: 07/08/2016 to: 11/08/2016 , 531 - 537 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/12636

Lafay G, Lagrange M, Rossignol M, Benetos E ( 2016 ) . A morphological model for simulating acoustic scenes and its application to sound event detection . IEEE/ACM Transactions on Audio, Speech, and Language Processing vol. 24 , ( 10 ) 1854 - 1864 .

10.1109/TASLP.2016.2587218

https://qmro.qmul.ac.uk/xmlui/handle/123456789/17621

Panteli M, Benetos E, Dixon S ( 2016 ) . Automatic detection of outliers in world music collections . Conference: Fourth International Conference on Analytical Approaches to World Music (AAWM 2016) ( New York, USA ) from: 11/06/2016 to: 08/06/2016 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/11743

Benetos E, Lafay G, Lagrange M ( 2016 ) . Detection of Overlapping Acoustic Events Using a Temporally-Constrained Probabilistic Model . Conference: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)6450 - 6454 .

10.1109/icassp.2016.7472919

https://qmro.qmul.ac.uk/xmlui/handle/123456789/11120

Sigtia S, Benetos E, Dixon S ( 2016 ) . An End-to-End Neural Network for Polyphonic Piano Music Transcription . IEEE/ACM Transactions on Audio, Speech, and Language Processing vol. 24 , ( 5 ) 927 - 939 .

10.1109/TASLP.2016.2533858

https://qmro.qmul.ac.uk/xmlui/handle/123456789/17623

Benetos E ( 2015 ) . An efficient temporally-constrained probabilistic model for multiple-instrument music transcription . http://ismir2015.uma.es/docs/ISMIR2015_Proceedings.pdf . Conference: 16th International Society for Music Information Retrieval Conference (ISMIR) ( Malaga, Spain ) from: 26/10/2015 to: 30/10/2015 , 701 - 707 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9852

BENETOS E, Holzapfel A ( 2015 ) . Automatic transcription of Turkish microtonal music . Journal of the Acoustical Society of America vol. 138 , ( 4 ) 2118 - 2130 .

10.1121/1.4930187

https://qmro.qmul.ac.uk/xmlui/handle/123456789/11122

Stowell D, Giannoulis D, Benetos E, Lagrange M, Plumbley MD ( 2015 ) . Detection and Classification of Acoustic Scenes and Events . IEEE Transactions on Multimedia vol. 17 , ( 10 ) 1733 - 1746 .

10.1109/TMM.2015.2428998

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9390

Rossignol M, Lagrange M, Lafay G ( 2015 ) . Alternate level clustering for drum transcription . Conference: 23rd European Signal Processing Conference (EUSIPCO) ( Nice, France ) from: 04/09/2015 to: 31/08/2015 , 2068 - 2072 .

10.1109/EUSIPCO.2015.7362739

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9853

Abdallah S, Alencar-Brayner A, BENETOS E, Cottrell S, Dykes J, Gold N, Kachkaev A, Tidhar D ( 2015 ) . Automatic transcription and pitch analysis of the British Library World & Traditional Music Collection . http://fma2015.sciencesconf.org/conference/fma2015/FMA2015_OfficialProceedings.pdf . Conference: 5th International Workshop on Folk Music Analysis ( Paris, France ) from: 10/06/2015 to: 12/06/2015 , 10 - 12 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9391

Sigtia S, Benetos E, Boulanger-Lewandowski N, Weyde T, Garcez ASDA, Dixon S ( 2015 ) . A Hybrid Recurrent Neural Network for Music Transcription . IEEE International Conference on Acoustics Speech and Signal Processing . Conference: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ( Brisbane, Australia ) from: 19/04/2015 to: 24/04/2015 , 2061 - 2065 .

10.1109/ICASSP.2015.7178333

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9420

Benetos E, Badeau R, Weyde T ( 2014 ) . Template Adaptation for Improving Automatic Music Transcription . http://www.terasoft.com.tw/conf/ismir2014//proceedings%5CISMIR2014_Proceedings.pdf . Conference: 15th International Society for Music Information Retrieval Conference (ISMIR) ( Taipei, Taiwan ) from: 27/10/2014 to: 31/10/2014 , 175 - 180 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9414

Tidhar D, Dixon S, Benetos E, Weyde T ( 2014 ) . The temperament police . Early Music vol. 42 , ( 4 ) 579 - 590 .

10.1093/em/cau101

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9419

Weyde T, Cottrell S, Dykes J, Benetos E, Wolff D, Tidhar D, Gold N, Abdallah S et al. ( 2014 ) . Big Data for Musicology . Conference: 1st International Digital Libraries for Musicology workshop ( London, UK ) from: 12/09/2014 to: 12/09/2014 ,

10.1145/2660168.2660187

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9417

Wolff D, Tidhar D, Benetos E, Dumon E, Cherla S, Page K, Fields B ( 2014 ) . Incremental dataset definition for large scale musicological research . Conference: 1st International Digital Libraries for Musicology workshop ( London, UK ) from: 12/09/2014 to: 12/09/2014 ,

10.1145/2660168.2660176

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9416

Tran S, Benetos E, d Avila Garcez A ( 2014 ) . Learning motion-difference features using Gaussian restricted Boltzmann machines for efficient human action recognition . Conference: 2014 International Joint Conference on Neural Networks (IJCNN) ( Beijing, China ) from: 06/07/2014 to: 11/07/2014 , 2123 - 2129 .

10.1109/IJCNN.2014.6889945

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9411

Benetos E, Holzapfel A, Holzapfel A ( 2014 ) . Incorporating pitch class profiles for improving automatic transcription of Turkish makam music . Proceedings of the Fourth International Workshop on Folk Music Analysis (FM . Conference: 4th International Workshop on Folk Music Analysis ( Istanbul, Turkey ) from: 12/06/2014 to: 13/06/2014 , 15 - 20 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9392

Giannoulis D, Benetos E, Klapuri A ( 2014 ) . Improving instrument recognition in polyphonic music through system integration . Conference: IEEE International Conference on Acoustics, Speech, and Signal Processing ( Florence, Italy ) from: 04/05/2014 to: 09/05/2014 , 5259 - 5263 .

10.1109/ICASSP.2014.6854599

https://qmro.qmul.ac.uk/xmlui/handle/123456789/5979

Benetos E, Weyde T ( 2014 ) . Improving automatic music transcription through key detection . http://www.aes.org/conferences/53/technical_programme.cfm . Conference: AES 53rd International Conference on Semantic Audio ( London, UK ) from: 27/01/2014 to: 29/01/2014 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9412

Benetos E, Ewert S, Weyde T ( 2014 ) . Automatic Transcription Of Pitched And Unpitched Sounds From Polyphonic Music . Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) . 3131 - 3135 .

10.1109/ICASSP.2014.6854172

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9413

Sigtia S, Benetos E, Cherla S, Weyde T, Garcez A, Dixon S ( 2014 ) . RNN-based Music Language Models for Improving Automatic Music Transcription . 15th International Society for Music Information Retrieval Conference . 53 - 58 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9415

BARTHET M, Benetos E, Cottrell S, Dixon S, Dykes J, Gold N, Mahey M, Plumbley MD et al. ( 2014 ) . The DML Research Project: Digital Music Lab - Analysing Big Music Data . Presented at: Workshop on "Big Data: Challenges and Applications", Imperial College, London ,

Benetos E, Holzapfel A ( 2013 ) . Automatic transcription of Turkish makam music . Conference: 14th International Society for Music Information Retrieval Conference ( Curitiba, PR, Brazil ) from: 04/11/2013 to: 08/11/2013 , 355 - 360 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9421

Benetos E, Weyde T ( 2013 ) . Explicit duration hidden Markov models for multiple-instrument polyphonic music transcription . Conference: 14th International Society for Music Information Retrieval Conference ( Curitiba, PR, Brazil ) from: 04/11/2013 to: 08/11/2013 , 269 - 274 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9432

de Valk R, Weyde T, Britto AS, Gouyon F, Dixon S ( 2013 ) . A machine learning approach to voice separation in lute tablature . Conference: 14th International Society for Music Information Retrieval Conference ( Curitiba, PR, Brazil ) from: 04/11/2013 to: 08/11/2013 , 555 - 560 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9436

Giannoulis D, Benetos E, Stowell D, Rossignol M, Lagrange M, Plumbley MD ( 2013 ) . DETECTION AND CLASSIFICATION OF ACOUSTIC SCENES AND EVENTS: AN IEEE AASP CHALLENGE . Conference: 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics1 - 4 .

10.1109/waspaa.2013.6701819

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9439

Giannoulis D, Stowell D, Benetos E, Rossignol M, Lagrange M, Plumbley MD ( 2013 ) . A database and challenge for acoustic scene classification and event detection . Conference: 21st European Signal Processing Conference ( Marrakech, Morocco )

Benetos E, Cherla S ( 2013 ) . An efficient shift-invariant model for polyphonic music transcription . Conference: 6th International Workshop on Machine Learning and Music ( Prague, Czech Republic )

https://qmro.qmul.ac.uk/xmlui/handle/123456789/9438

Benetos E, Dixon S, Giannoulis D, Kirchhoff H, Klapuri A ( 2013 ) . Automatic music transcription: challenges and future directions . Journal of Intelligent Information Systems vol. 41 , ( 3 ) 407 - 434 .

10.1007/s10844-013-0258-3

https://qmro.qmul.ac.uk/xmlui/handle/123456789/7938

Serra X, Magas M, Benetos E, Chudy M, Dixon S, Flexer A, Gomez E, Gouyon F et al. ( 2013 ) . Roadmap for Music Information ReSearch . The MIReS Consortium

Benetos E, Dixon S ( 2013 ) . Multiple-instrument polyphonic music transcription using a temporally constrained shift-invariant model . The Journal of the Acoustical Society of America vol. 133 , ( 3 ) 1727 - 1741 .

10.1121/1.4790351

Benetos E, Dixon S ( 2012 ) . A Shift-Invariant Latent Variable Model for Automatic Music Transcription . Computer Music Journal vol. 36 , ( 4 ) 81 - 94 .

10.1162/comj_a_00146

BENETOS E, Dixon S, Giannoulis D, Kirchhoff H, Klapuri A ( 2012 ) . Automatic Music Transcription: Breaking the Glass Ceiling . Conference: 13th International Society for Music Information Retrieval Conference (ISMIR 2012) ( Porto, Portugal ) from: 08/10/2012 to: 12/10/2012 , 379 - 384 .

Zijlstra A, Mancini M, Lindemann U, Chiari L ( 2012 ) . Sit-stand and stand-sit transitions in older adults and patients with Parkinson’s disease: event detection based on motion sensors versus force plates . Journal of NeuroEngineering and Rehabilitation vol. 9 , ( 1 )

10.1186/1743-0003-9-75

Benetos E, Lagrange M, Dixon S ( 2012 ) . Characterisation of acoustic scenes using a temporally-constrained shift-invariant model . 15th International Conference on Digital Audio Effects, DAFx 2012 Proceedings .

Benetos E, Klapuri A, Dixon S ( 2012 ) . Score-informed transcription for automatic piano tutoring . Conference: 20th European Signal Processing Conference ( Bucharest, Romania ) 2153 - 2157 .

Benetos E, Dixon S ( 2012 ) . Temporally-Constrained Convolutive Probabilistic Latent Component Analysis for Multi-pitch Detection . Lecture Notes in Computer Science . vol. 7191 , 364 - 371 .

10.1007/978-3-642-28551-6_45

Benetos E, Dixon S ( 2011 ) . A TEMPORALLY-CONSTRAINED CONVOLUTIVE PROBABILISTIC MODEL FOR PITCH DETECTION . Conference: 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)133 - 136 .

10.1109/aspaa.2011.6082270

Benetos E, Dixon S ( 2011 ) . Joint Multi-Pitch Detection Using Harmonic Envelope Estimation for Polyphonic Music Transcription . IEEE Journal of Selected Topics in Signal Processing vol. 5 , ( 6 ) 1111 - 1123 .

10.1109/jstsp.2011.2162394

Mearns L, Benetos E, Dixon S ( 2011 ) . Automatically detecting key modulations in J.S. Bach chorale recordings . 8th Sound and Music Computing Conference . 25 - 32 .

Benetos E, Dixon S ( 2011 ) . Multiple-instrument polyphonic music transcription using a convolutive probabilistic model . Conference: 8th Sound and Music Computing Conference ( Padova, Italy ) from: 06/07/2011 to: 09/07/2011 , 19 - 24 .

Benetos E, Dixon S ( 2011 ) . Polyphonic Music Transcription Using Note Onset and Offset Detection . Conference: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)37 - 40 .

10.1109/icassp.2011.5946322

Dixon S, Tidhar D, Benetos E ( 2011 ) . The temperament police: The truth, the ground truth, and nothing but the truth . Proceedings of the 12th International Society for Music Information Retrieval Conference, ISMIR 2011 . Conference: 12th International Society for Music Information Retrieval Conference ( Miami, Florida, USA ) from: 24/10/2011 to: 28/10/2011 , 281 - 286 .

Anglade A, Benetos E, Mauch M, Dixon S ( 2010 ) . Improving Music Genre Classification Using Automatically Induced Harmony Rules . Journal of New Music Research vol. 39 , ( 4 ) 349 - 361 .

10.1080/09298215.2010.525654

Benetos E, Dixon S ( 2010 ) . Multiple-F0 estimation of piano sounds exploiting spectral structure and temporal evolution . ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition . 13 - 18 .

Benetos E, Stylianou Y ( 2010 ) . Auditory Spectrum-Based Pitched Instrument Onset Detection . IEEE Transactions on Audio Speech and Language Processing vol. 18 , ( 8 ) 1968 - 1977 .

10.1109/tasl.2010.2040785

Benetos E, Kotropoulos C ( 2010 ) . Non-Negative Tensor Factorization Applied to Music Genre Classification . IEEE Transactions on Audio Speech and Language Processing vol. 18 , ( 8 ) 1955 - 1967 .

10.1109/tasl.2010.2040784

Benetos E, Holzapfel A ( 2009 ) . Pitched instrument onset detection based on auditory spectra . Proceedings of the 10th International Society for Music Information Retrieval Conference, ISMIR 2009 . 105 - 110 .

Benetos E, Kotropoulos C ( 2008 ) . A tensor-based approach for automatic music genre classification . 16th European Signal Processing Conference .

Spachos D, Zlantintsi A, Moschou V, Antonopoulos P, Benetos E, Kotti M, Tzimouli K, Kotropoulos C et al. ( 2008 ) . MUSCLE movie-database: a multimodal corpus with rich annotation for dialogue and saliency detection . 6th Language Resources and Evaluation Conference . 16 - 19 .

BENETOS E, Siatras S, Kotropoulos C, Nikolaidis N ( 2008 ) . Movie analysis with emphasis to dialogue and action scene detection . Multimodal Processing and Interaction , vol. 33 , Springer

Panagakis I, Benetos E, Kotropoulos C, Bello JP, Chew E, Turnbull D ( 2008 ) . Music Genre Classification: A Multilinear Approach . ISMIR . 583 - 588 .

Kotti M, Benetos E ( 2007 ) . A neural network approach to audio-assisted movie dialogue detection . Neurocomputing vol. 71 , ( 1-3 ) 157 - 166 .

10.1016/j.neucom.2007.08.006

Moschou V, Benetos E, Kotropoulos C ( 2007 ) . Systematic comparison of BIC-based speaker segmentation systems . Conference: 2007 IEEE 9th Workshop on Multimedia Signal Processing66 - 69 .

10.1109/mmsp.2007.4412819

Kotti M, Benetos E ( 2007 ) . Neural network-based movie dialogue detection . 10th International Conference on Engineering Applications of Neural Networks .

Benetos E, Kotti M, Kotropoulos C ( 2007 ) . Large scale musical instrument identification . 4th Sound and Music Computing Conference . 283 - 286 .

Benetos E, Kotropoulos C ( 2006 ) . Applying Supervised Classifiers Based on Non-negative Matrix Factorization to Musical Instrument Classification . 2006 IEEE International Conference on Multimedia and Expo . Conference: 2006 IEEE International Conference on Multimedia and Expo2105 - 2108 .

10.1109/icme.2006.262650

Kotti M, Martins LGPM, Cardoso JS ( 2006 ) . Automatic Speaker Segmentation using Multiple Features and Distance Measures: A Comparison of Three Approaches . 2006 IEEE International Conference on Multimedia and Expo . Conference: 2006 IEEE International Conference on Multimedia and Expo1101 - 1104 .

10.1109/icme.2006.262727

Kotti M, Benetos E ( 2006 ) . Automatic speaker change detection with the Bayesian information criterion using MPEG-7 features and a fusion scheme . 2005 IEEE International Symposium on Circuits and Systems (ISCAS) . Conference: 2006 IEEE International Symposium on Circuits and Systems4 - pp. .

10.1109/iscas.2006.1692970

Benetos E, Kotropoulos C ( 2006 ) . Musical instrument classification using non-negative matrix factorization algorithms . 2005 IEEE International Symposium on Circuits and Systems (ISCAS) . Conference: 2006 IEEE International Symposium on Circuits and Systems4 - pp. .

10.1109/iscas.2006.1692967

Benetos E, Kotti M ( 2006 ) . Musical instrument classification using non-negative matrix factorization algorithms and subset feature selection . ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings . vol. 5 ,

Benetos E, Kotropoulos C, Lidy T ( 2006 ) . Testing supervised classifiers based on non-negative matrix factorization to musical instrument classification . European Signal Processing Conference .

Benetos E, Kotti M, Kotropoulos C, Burred JJ, Eisenberg G, Sikora T ( 2005 ) . Comparison of subspace analysis-based and statistical model-based algorithms for musical instrument classification . 2nd Workshop On Immersive Communication And Broadcast Systems .

Liang J, Benetos E, Phan H . Adapting Language-Audio Models as Few-Shot Audio Learners . Conference: INTERSPEECH 2023

10.21437/Interspeech.2023-1082

https://qmro.qmul.ac.uk/xmlui/handle/123456789/88692

Savage PE, Ampiah-Bonney A, Arabadjiev A, Arhine A, Ariza JF, Bamford JS, Barbosa BS, Beck A-K et al. . Does synchronised singing enhance social bonding more than speaking does? A global experimental Stage 1 Registered Report .

10.31234/osf.io/pv3m9

. From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems . Conference: 35th IEEE International Workshop on Machine Learning for Signal Processing

10.1109/MLSP62443.2025.11204254

https://qmro.qmul.ac.uk/xmlui/handle/123456789/108302

de Fleurian R, Clemente A, Benetos E, Pearce MT . Melodic expectation as an elicitor of music-evoked chills .

10.1101/2024.10.02.616280

Qu X, Bai Y, Ma Y, Zhou Z, Lo KM, Liu J, Yuan R, Min L et al. . MuPT: A Generative Symbolic Music Pretrained Transformer . Conference: The Thirteenth International Conference on Learning Representations ( Singapore ) from: 23/04/2025 to: 28/04/2025 ,

10.48550/arxiv.2404.06393

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106408

Tablas De Paula P, Marttila D, Díaz R, Román I, Benetos E, Reiss JD . Sound Matching with a Differentiable Karplus-Strong Algorithm . Conference: The 29th International Conference on Digital Audio Effects ( Cambridge, MA, USA ) from: 01/09/2026 to: 04/09/2026 ,

Global main menu

Areas of study

Study at Queen Mary

Experience Queen Mary

Research and Innovation

Research by faculties and centres

Collaborations and partnerships

Publications: DR Emmanouil Benetos