Publications: DR Emmanouil Benetos
(
2026
)
.
Domain-invariant representation learning of bird sounds
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
(
Barcelona, Spain
)
from:
04/05/2026
to:
08/05/2026
,
(
2026
)
.
Scalable Evaluation for Audio Identification via Synthetic Latent Fingerprint Generation
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
(
Barcelona, Spain
)
from:
04/05/2026
to:
08/05/2026
,
(
2026
)
.
OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs
.
Conference:
14th International Conference on Learning Representations (ICLR)
(
Rio de Janeiro, Brazil
)
from:
23/04/2026
to:
27/04/2026
,
Mitcheltree C, Lostanlen V, Benetos E, Lagrange M
(
2026
)
.
SCRAPL: Scattering Transform with Random Paths for Machine Learning
.
Conference:
14th International Conference on Learning Representations (ICLR)
(
Rio de Janeiro, Brazil
)
from:
23/04/2026
to:
27/04/2026
,
(
2026
)
.
YuE: Scaling Open Foundation Models for Long-Form Music Generation
.
Conference:
14th International Conference on Learning Representations (ICLR)
(
Rio de Janeiro, Brazil
)
from:
23/04/2026
to:
27/04/2026
,
(
2026
)
.
Computational hermeneutics: evaluating generative AI as a cultural technology
.
Frontiers in Artificial Intelligence
vol.
9
,
Article
1753041
,
Tang X, Lei X, Zhu C, Chen S, Yuan R, Li Y, Oh C, Zhang G et al.
(
2025
)
.
AutoMV: An Automatic Multi-Agent System for Music Video Generation
.
Ma Z, Ma Y, Zhu Y, Yang C, Chao Y-W, Xu R, Chen W, Chen Y et al.
(
2025
)
.
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
.
Conference:
The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025)
from:
02/12/2025
to:
07/12/2025
,
Li Y, Ma Y, Ma Y, Yuan R, Zhu K, Guo H, Liang Y, Liu J et al.
(
2025
)
.
OmniBench: Towards The Future of Universal Omni-Language Models
.
Conference:
The Thirty-Ninth Annual Conference on Neural Information Processing Systems. (NeurIPS 2025)
from:
02/12/2025
to:
07/12/2025
,
Kim H, Benetos E, Serra X
(
2025
)
.
Velocity2DMs: A contextual modeling approach to dynamics marking prediction in piano performance
.
IEEE Signal Processing Letters
vol.
32
,
Chang SK, Dixon S, Benetos E
(
2025
)
.
RUMAA: Repeat-Aware Unified Music Audio Analysis for Score-Performance Alignment, Transcription, and Mistake Detection
.
Conference:
2025 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2025)
(
Granlibakken Tahoe, Toahoe City, CA
)
from:
12/10/2025
to:
15/10/2025
,
Ma Y, Li S, Yu J, Benetos E, Maezawa A
(
2025
)
.
CMI-Bench: A Comprehensive Benchmark for Evaluating Music Instruction Following
.
Conference:
26th International Society for Music Information Retrieval Conference (ISMIR)
(
Daejeon, Korea
)
from:
21/09/2025
to:
25/09/2025
,
Sarkar S, Moomjian V, Woods B, Benetos E, Sandler M
(
2025
)
.
Perceptual errors in music source separation: looking beyond SDR averages
.
Conference:
26th International Society for Music Information Retrieval Conference (ISMIR)
(
Daejeon, Korea
)
from:
21/09/2025
to:
25/09/2025
,
Bhattacharjee A, Meresman Higgs I, Sandler M, Benetos E
(
2025
)
.
Refining music sample identification with a self-supervised graph neural network
.
Conference:
26th International Society for Music Information Retrieval Conference (ISMIR 2025)
(
Daejeon, Korea
)
from:
21/09/2025
to:
25/09/2025
,
Papaioannou C, Benetos E, Potamianos A
(
2025
)
.
Universal Music Representations? Evaluating Foundation Models on World Music Corpora
.
Conference:
26th International Society for Music Information Retrieval Conference (ISMIR)
(
Daejeon, Korea
)
from:
21/09/2025
to:
25/09/2025
,
Zhang H, Liang J, Phan QH, Wang W, Benetos E
(
2025
)
.
From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems
.
Conference:
IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2025)
(
Istanbul, Turkey
)
from:
31/08/2025
to:
03/09/2025
,
Huang J, Sousa F, Demirel E, Benetos E, Gadelha I
(
2025
)
.
Enhancing Lyrics Transcription on Music Mixtures with Consistency Loss
.
Conference:
Interspeech 2025
(
Rotterdam, The Netherlands
)
from:
21/08/2025
to:
17/08/2025
,
Plachouras C, Guinot J, Fazekas G, Quinton E, Benetos E, Pauwels J
(
2025
)
.
Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks
.
Conference:
International Joint Conference on Neural Networks (IJCNN)
(
Rome, Italy
)
from:
30/06/2025
to:
05/07/2025
,
Qu X, Bai Y, Ma Y, Zhou Z, Lo KM, Liu J, Yuan R, Min L et al.
(
2025
)
.
MuPT: A Generative Symbolic Music Pretrained Transformer
.
https://openreview.net/forum?id=iAK9oHp4Zz
.
Conference:
International Conference on Learning Representations (ICLR)
(
Singapore
)
from:
24/04/2025
to:
28/04/2025
,
Peeters G, Rafii Z, Fuentes M, Duan Z, Benetos E, Nam J, Mitsufuji Y
(
2025
)
.
Twenty-Five Years of MIR Research: Achievements, Practices, Evaluations, and Future Challenges
.
Conference:
ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
vol.
00
,
1
-
5
.
De Almeida Nolasco IS, Stowell D, Benetos E
(
2025
)
.
Acoustic identification of individual animals based on hierarchical contrastive learning
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
(
Hyderabad, India
)
from:
06/04/2025
to:
11/04/2025
,
Singh S, Bhattacharjee A, Benetos E
(
2025
)
.
GraFPrint: A GNN-Based Approach for Audio Identification
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
(
Hyderabad, India
)
from:
06/04/2025
to:
11/04/2025
,
Singh S, Benetos E, Phan H, Stowell D
(
2025
)
.
LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
(
Hyderabad, India
)
from:
06/04/2025
to:
11/04/2025
,
Plachouras C, Benetos E, Pauwels J
(
2025
)
.
Learning Music Audio Representations With Limited Data
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
(
Hyderabad, India
)
from:
06/04/2025
to:
11/04/2025
,
Huang J
(
2025
)
.
Singing to speech conversion with generative flow
.
EURASIP Journal on Audio, Speech, and Music Processing
vol.
2025
,
Article
12
,
Liang J, Liu X, Wang W, Plumbley M, Phan H, Benetos E
(
2025
)
.
Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities
.
IEEE Transactions on Audio, Speech and Language Processing
vol.
33
,
949
-
961
.
Papaioannou C, Benetos E
(
2025
)
.
LC-Protonets: Multi-label Few-shot learning for world music audio tagging
.
IEEE Open Journal of Signal Processing
vol.
6
,
138
-
146
.
Elisha S, McDowell A, Beguerisse-Díaz M
(
2024
)
.
Classification of spontaneous and scripted speech for multilingual audio
.
Conference:
IEEE Spoken Language Technology Workshop 2024
(
Macao, China
)
from:
02/12/2024
to:
05/12/2024
,
489
-
495
.
Zhou Z, Wu Y, Wu Z, Zhang X, Yuan R, Ma Y, Xue W
(
2024
)
.
Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
.
Conference:
25th International Society for Music Information Retrieval Conference (ISMIR)
(
San Franscisco, CA, USA
)
from:
10/11/2024
to:
14/11/2024
,
Deng Q, Yang Q, Yuan R, Huang Y, Wang Y, Liu X, Tian Z, Pan J et al.
(
2024
)
.
ComposerX: Multi-Agent Symbolic Music Composition with LLMs
.
Conference:
25th International Society for Music Information Retrieval Conference (ISMIR),
(
San Francisco, CA, USA
)
from:
10/11/2024
to:
14/11/2024
,
Weck B, Manco I, Benetos E, QUINTON E
(
2024
)
.
MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models
.
Conference:
25th International Society for Music Information Retrieval Conference (ISMIR)
(
San Francisco, CA, USA
)
from:
10/11/2024
to:
14/11/2024
,
Steinmetz C, Singh S, Comunit� M, Ibnyahya I, Yuan S, Benetos E, Reiss J
(
2024
)
.
ST-ITO: Controlling audio effects for style transfer with inference-time optimization
.
Conference:
25th International Society for Music Information Retrieval Conference (ISMIR)
(
San Francisco, CA, USA
)
from:
10/11/2024
to:
14/11/2024
,
Chang SK, Benetos E, KIRCHHOFF H, Dixon S
(
2024
)
.
ËœYourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation
.
Conference:
IEEE International Workshop on Machine Learning for Signal Processing (MLSP)
(
London, UK
)
from:
25/09/2024
to:
22/09/2024
,
Torrisi A, De Almeida Nolasco IS, Versace E, Benetos E
(
2024
)
.
Exploratory analysis of early-life chick calls
.
Conference:
4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots (VIHAR)
(
Kos, Greece
)
from:
06/09/2024
to:
06/09/2024
,
Ma Y, Øland A, Ragni A, Del Sette BM, Saitis C, Donahue C, Lin C, Plachouras C et al.
(
2024
)
.
Foundation Models for Music: A Survey
.
Liang J, Nolasco I, Ghani B, Phan H, Benetos E, Stowell D
(
2024
)
.
Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event Detection
.
Conference:
32nd European Signal Processing Conference (EUSIPCO 2024)
(
Lyon, France
)
from:
26/08/2024
to:
30/08/2024
,
Huang J, Benetos E
(
2024
)
.
Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model
.
Conference:
32nd European Signal Processing Conference (EUSIPCO)
(
Lyon, France
)
from:
26/08/2024
to:
30/08/2024
,
Yuan R, Lin H, Wang Y, Tian Z, Wu S, Shen T, Zhang G, Wu Y et al.
(
2024
)
.
ChatMusician: Understanding and Generating Music Intrinsically with LLM
.
Conference:
62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)
(
Bangkok, Thailand
)
from:
11/08/2024
to:
16/08/2024
,
Xompero A, Bontonou M, Arbona J-M, Benetos E
(
2024
)
.
Explaining models relating objects and privacy
.
Proceedings of CVPR 2024 Workshops
.
Conference:
3rd Explainable AI for Computer Vision (XAI4CV) Workshop at CVPR 2024
(
Seattle Convention Center, Seattle WA, USA
)
from:
18/06/2024
to:
18/06/2024
,
Deng Z, Ma Y, Liu Y, Guo R, Zhang G, Chen W, Huang W, Benetos E
(
2024
)
.
MusiLingo: bridging music and text with pre-trained language models for music captioning and query response
.
Conference:
2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024)
(
Mexico City, Mexico
)
from:
16/06/2024
to:
21/06/2024
,
3643
-
3655
.
Ozaki Y, Tierney A, Pfordresher PQ, McBride JM, Benetos E, Proutskova P, Chiba G, Liu F et al.
(
2024
)
.
Globally, songs and instrumental melodies are slower and higher and use more stable pitches than speech: A Registered Report
.
Science Advances
vol.
10
,
(
20
)
Liang J, Zhang H, Liu H, Cao Y, Kong Q, Liu X, Wang W, Plumbley MD et al.
(
2024
)
.
WavCraft: audio editing and generation with large language models
.
Conference:
ICLR 2024 Workshop on LLM Agents
(
Vienna, Austria
)
from:
11/05/2024
to:
11/05/2024
,
Li Y, Yuan R, Zhang G, Ma Y, Chen X, Yin H, Xiao C, Lin C et al.
(
2024
)
.
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
.
Conference:
International Conference on Learning Representations (ICLR)
(
Vienna, Austria
)
from:
07/05/2024
to:
11/05/2024
,
Postolache E, Mariani G, Cosmo L
(
2024
)
.
Generalized multi-source inference for text conditioned music diffusion models
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
(
Seoul, Korea
)
from:
14/04/2024
to:
19/04/2024
,
6980
-
6984
.
Liang J, Phan QH, Benetos E
(
2024
)
.
Learning from taxonomy: multi-label few-shot classification for everyday sound recognition
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
(
Seoul, Korea
)
from:
14/04/2024
to:
19/04/2024
,
771
-
775
.
Li D, Ma Y, Wei W, KONG Q, Wu Y, Che M, Xia F, Benetos E et al.
(
2024
)
.
MERTech: instrument playing technique detection using self-supervised pretrained model with multi-task finetuning
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
(
Seoul, Korea
)
from:
14/04/2024
to:
19/04/2024
,
521
-
525
.
EDWARDS D, Dixon S, Benetos E, Maezawa A, Kusaka Y
(
2024
)
.
A Data-Driven Analysis of Robust Automatic Piano Transcription
.
IEEE Signal Processing Letters
vol.
31
,
681
-
685
.
Singh S, Steinmetz C, Benetos E, Phan QH, Stowell D
(
2024
)
.
ATGNN: audio tagging graph neural network
.
IEEE Signal Processing Letters
vol.
31
,
825
-
829
.
Deb O, Torr P
(
2023
)
.
Remaining-useful-life prediction and uncertainty quantification using LSTM ensembles for aircraft engines
.
Conference:
NeurIPS Workshop on Advancing Neural Network Training (WANT): Computational Efficiency, Scalability, and Resource Optimization
(
New Orleans, USA
)
from:
16/12/2023
to:
16/12/2023
,
Manco I, Weck B, Doh S, Won M, Bodganov D, Wu Y, Tovstogan P, Benetos E et al.
(
2023
)
.
The Song Describer dataset: a corpus of audio captions for music-and-language evaluation
.
Conference:
NeurIPS Machine Learning for Audio Workshop
(
New Orleans, USA
)
from:
16/12/2023
to:
16/12/2023
,
Yuan R, Ma Y, Li Y, Zhang G, Chen X, Yin H, Zhuo L, Liu Y et al.
(
2023
)
.
MARBLE: Music Audio Representation Benchmark for Universal Evaluation
.
Conference:
37th Conference on Neural Information Processing Systems (NeurIPS)
from:
10/12/2023
to:
16/12/2023
,
Ragano A, Benetos E
(
2023
)
.
Learning Music Representations with wav2vec 2.0
.
Conference:
31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS)
(
Letterkenny, Ireland
)
from:
07/12/2023
to:
07/12/2023
,
Papaioannou C, Benetos E, Potamianos A
(
2023
)
.
From West to East: Who can understand the music of the others better?
.
Conference:
24th International Society for Music Information Retrieval Conference (ISMIR)
(
Milan, Italy
)
from:
05/11/2023
to:
09/11/2023
,
Zhuo L, Yuan R, Pan J, Ma Y, Li Y, Zhang G, Liu S, Dannenberg R et al.
(
2023
)
.
LyricWhiz: Robust Multilingual Lyrics Transcription by Whispering to ChatGPT
.
Conference:
24th International Society for Music Information Retrieval Conference (ISMIR)
(
Milan, Italy
)
from:
05/11/2023
to:
09/11/2023
,
Sarkar S, Thorpe L, Benetos E, Sandler M
(
2023
)
.
Leveraging Synthetic Data for Improving Chamber Ensemble Separation
.
Conference:
2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
vol.
00
,
1
-
5
.
Vahidi C, Singh S, Benetos E, Phan H, Stowell D, Fazekas G, Lagrange M
(
2023
)
.
Perceptual Musical Similarity Metric Learning with Graph Neural Networks
.
Conference:
2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
vol.
00
,
1
-
5
.
Edwards D, Dixon S, Benetos E
(
2023
)
.
PiJAMA: Piano Jazz with Automatic MIDI Annotations
.
Transactions of the International Society for Music Information Retrieval
vol.
6
,
(
1
)
89
-
102
.
Liang J, Liu X, Liu H, Phan H, Benetos E, Plumbley M, Wang W
(
2023
)
.
Adapting Language-Audio Models as Few-Shot Audio Learners
.
Conference:
24th Annual Conference of the International Speech Communication Association (INTERSPEECH)
(
Dublin, Ireland
)
from:
20/08/2023
to:
24/08/2023
,
Ma Y, Yuan R, Li Y, Zhang G, Chen X, Yin H, Lin C, Benetos E et al.
(
2023
)
.
On the Effectiveness of Speech Self-supervised Learning for Music
.
Ragano A, Benetos E, Chinen M, Becerra H, Chandan Karadagur Ananda R
(
2023
)
.
A Comparison Of Deep Learning MOS Predictors For Speech Synthesis Quality
.
Conference:
Irish Signals & Systems Conference 2023
(
Dublin, Ireland
)
from:
13/06/2023
to:
14/06/2023
,
Ragano A, Benetos E
(
2023
)
.
Audio Quality Assessment of Vinyl Music Collections Using Self-Supervised Learning
.
Conference:
2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
from:
04/06/2023
to:
10/06/2023
,
1
-
5
.
Li Y, Cao W, Xie W
(
2023
)
.
Few-shot Class-incremental Audio Classification Using Dynamically Expanded Classifier with Self-attention Modified Prototypes
.
IEEE Transactions on Multimedia
vol.
26
,
1346
-
1360
.
Wang C, Benetos E, Wang S, Versace E
.
Joint Scattering for Automatic Chick Call Recognition
.
2015 23rd European Signal Processing Conference (EUSIPCO)
.
Conference:
2022 30th European Signal Processing Conference (EUSIPCO)195
-
199
.
Li Y, Yuan R, Zhang G, Ma Y, Lin C, Chen X, Ragni A, Yin H et al.
(
2022
)
.
Large-Scale Pretrained Model for Self-Supervised Music Audio Representation Learning
.
Conference:
DMRN+17: Digital Music Research Network One-day Workshop 2022
(
London, UK
)
from:
20/12/2022
to:
20/12/2022
,
Liu L, KONG Q, Morfi G-V, Benetos E
(
2022
)
.
Performance MIDI-to-score conversion by neural beat tracking
.
Conference:
23rd International Society for Music Information Retrieval Conference (ISMIR)
(
Bengaluru, India
)
from:
04/12/2022
to:
08/12/2022
,
Sarkar S, Benetos E, Sandler M
(
2022
)
.
EnsembleSet: A new high-quality synthesised dataset for chamber ensemble separation
.
Conference:
23rd International Society for Music Information Retrieval Conference (ISMIR)
(
Bengaluru, India
)
from:
05/12/2022
to:
08/12/2022
,
Manco I, Benetos E, Fazekas G
(
2022
)
.
Contrastive audio-language learning for music
.
https://ismir2022.ismir.net/
.
Conference:
23rd International Society for Music Information Retrieval Conference (ISMIR)
(
Bengaluru, India
)
from:
04/12/2022
to:
08/12/2022
,
Mai KT, Davies T
(
2022
)
.
Explaining the decisions of anomalous sound detectors
.
https://dcase.community/workshop2022/
.
Conference:
7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE)
(
Nancy, France
)
from:
03/11/2022
to:
04/11/2022
,
Liang J, Phan QH, Benetos E
(
2022
)
.
Leveraging label hierarchies for few-shot everyday sound recognition
.
Conference:
7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE)
(
Nancy, France
)
from:
03/11/2022
to:
04/11/2022
,
Ozaki Y, Kuroyanagi J, McBride J, Proutskova P, Tierney A, Benetos E
(
2022
)
.
Similarities and differences in a cross-linguistic sample of song and speech recordings
.
Conference:
Joint Conference on Language Evolution
(
Kanazawa, Japan
)
from:
05/09/2022
to:
08/09/2022
,
Singh S, Benetos E, Phan QH
(
2022
)
.
Hypernetworks for sound event detection: a proof-of-concept
.
Conference:
30th European Signal Processing Conference (EUSIPCO 2022)
(
Belgrade, Serbia
)
from:
29/08/2022
to:
03/09/2022
,
429
-
433
.
Daikoku H, Ding S, Benetos E, Wood ALC, Shimizono T, Sanne US
(
2022
)
.
Agreement among human and automated estimates of similarity in a global music sample
.
Conference:
10th International Workshop on Folk Music Analysis (FMA 2022)
(
Sheffield, UK
)
from:
14/06/2022
to:
17/06/2022
,
Ou L, Guo Z, Benetos E, Han J, Wang Y
(
2022
)
.
Exploring Transformer’s Potential on Automatic Piano Transcription
.
Conference:
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
vol.
00
,
776
-
780
.
Huang J, Benetos E, Ewert S
(
2022
)
.
Improving lyrics Alignment through Joint Pitch Detection
.
Conference:
2022 IEEE International Conference on Acoustics, Speech and Signal Processing
(
Singapore
)
from:
22/05/2022
to:
27/05/2022
,
451
-
455
.
Manco I, Benetos E, Quinton E
(
2022
)
.
Learning music audio representations via weak language supervision
.
Conference:
2022 IEEE International Conference on Acoustics, Speech and Signal Processing
(
Singapore
)
from:
22/05/2022
to:
27/05/2022
,
456
-
460
.
Ragano A, Benetos E, Hines A
(
2022
)
.
Automatic Quality Assessment of Digitized and Restored Sound Archives
.
Journal of the Audio Engineering Society
vol.
70
,
(
4
)
252
-
270
.
Wang C, Benetos E, Lostanlen V
(
2022
)
.
Adaptive Scattering Transforms for Playing Technique Recognition
.
IEEE/ACM Transactions on Audio, Speech and Language Processing
vol.
30
,
1407
-
1421
.
Benetos E, Ragano A, Sgroi D, Tuckwell A
(
2022
)
.
Measuring national mood with music: using machine learning to construct a measure of national valence from audio data
.
Behavior Research Methods
vol.
54
,
(
6
)
3085
-
3092
.
Terenzi A, Nolasco I, Benetos E
(
2021
)
.
Comparison of Feature Extraction Methods for Sound-Based Classification of Honey Bee Activity
.
IEEE Transactions on Audio Speech and Language Processing
vol.
30
,
112
-
122
.
Bodo RPP, Benetos E
(
2021
)
.
A framework for music similarity and cover song identification
.
Conference:
15th International Symposium on Computer Music Multidisciplinary Research (CMMR)
(
Tokyo, Japan
)
from:
15/11/2021
to:
19/11/2021
,
205
-
214
.
Liu L, Morfi V, Benetos E
(
2021
)
.
ACPAS: A Dataset of Aligned Classical Piano Audio and Scores for Audio-to-Score Transcription
.
Conference:
Late-Breaking Demo Session of the 22nd Int. Society for Music Information Retrieval Conference
Ozaki Y, McBride J, Benetos E, Pfordresher PQ, Six J, T. Tierney A, Proutskova P, Fukatsu H et al.
(
2021
)
.
Agreement among human and annotated transcriptions of global songs
.
Conference:
22nd International Society for Music Information Retrieval Conference (ISMIR)
from:
09/11/2021
to:
12/11/2021
,
500
-
508
.
Vianna Lordelo C, Benetos E, Dixon S, Ahlbäck S
(
2021
)
.
Pitch-informed instrument assignment using a deep convolutional network with multiple kernel shapes
.
Conference:
22nd International Society for Music Information Retrieval Conference (ISMIR)
from:
09/11/2021
to:
12/11/2021
,
389
-
395
.
O'Hanlon K, Benetos E, Dixon S
(
2021
)
.
Detecting Cover Songs with Pitch Class Key-Invariant Networks
.
Conference:
2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)
vol.
00
,
1
-
6
.
Holzapfel A, Benetos E, Killick A, Widdess R
(
2021
)
.
Humanities and engineering perspectives on music transcription
.
Digital Scholarship in the Humanities
vol.
37
,
(
3
)
747
-
764
.
Bear HL, Morfi V, Benetos E
.
An Evaluation of Data Augmentation Methods for Sound Scene Geotagging
.
Conference:
Interspeech 2021581
-
585
.
Sarkar S, Benetos E, Sandler M
(
2021
)
.
Vocal Harmony Separation using Time-domain Neural Networks
.
Conference:
22nd Annual Conference of the International Speech Communication Association (INTERSPEECH)
(
Brno, Czech Republic
)
from:
30/08/2021
to:
03/09/2021
,
3515
-
3519
.
Zhao Y, Wang C, Fazekas G, Benetos E, Sandler M
(
2021
)
.
Violinist identification based on vibrato features
.
Conference:
2021 29th European Signal Processing Conference (EUSIPCO)
vol.
00
,
381
-
385
.
Manco I, Benetos E, Quinton E
(
2021
)
.
MusCaps: generating captions for music audio
.
Conference:
International Joint Conference on Neural Networks (IJCNN)
from:
18/07/2021
to:
22/07/2021
,
Cheuk KW, Luo Y-J, Benetos E, Herremans D
(
2021
)
.
Revisiting the onsets and frames model with additive attention
.
Conference:
International Joint Conference on Neural Networks (IJCNN)
from:
18/07/2021
to:
22/07/2021
,
(
2021
)
.
From Audio to Music Notation
.
Handbook of Artificial Intelligence for Music
,
Springer Nature
Ragano A, Benetos E, Hines A
(
2021
)
.
More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations
.
Conference:
2021 13th International Conference on Quality of Multimedia Experience (QoMEX)
vol.
00
,
103
-
108
.
Liu L, Morfi G-V, Benetos E
(
2021
)
.
Joint multi-pitch detection and score transcription for polyphonic piano music
.
Conference:
IEEE International Conference on Acoustics, Speech and Signal Processing
(
Toronto, Canada
)
from:
06/06/2021
to:
11/06/2021
,
Singh S, Bear H, Benetos E
(
2021
)
.
Prototypical Networks for Domain Adaptation in Acoustic Scene Classification
.
Conference:
IEEE International Conference on Acoustics, Speech and Signal Processing
(
Toronto, Canada
)
from:
06/06/2021
to:
11/06/2021
,
Subramanian V, Gururani S, Benetos E, Sandler M
(
2021
)
.
Anomalous behaviour in loss-gradient based interpretability methods
.
Conference:
RobustML workshop paper at ICLR 2021
Cheuk KW, Benetos E, Luo Y, Herremans D
(
2021
)
.
The effect of spectrogram reconstructions on automatic music transcription: an alternative approach to improve transcription accuracy
.
Conference:
25th International Conference on Pattern Recognition (ICPR2020)
(
Milan, Italy
)
from:
10/01/2021
to:
15/01/2021
,
9091
-
9098
.
Vianna Lordelo C, Benetos E, Dixon S, Ahlbäck S, Ohlsson P
(
2020
)
.
Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation
.
IEEE Signal Processing Letters
vol.
28
,
81
-
85
.
Liu L, Morfi G-V, Benetos E
(
2020
)
.
Joint Piano-roll and Score Transcription for Polyphonic Piano Music
.
Conference:
DMRN+15: Digital Music Research Network One-day Workshop
(
London, UK
)
from:
15/12/2020
to:
15/12/2020
,
Chettri B, Benetos E, Sturm BLT
(
2020
)
.
Dataset artefacts in anti-spoofing systems: a case study on the ASVspoof 2017 benchmark
.
IEEE/ACM Transactions on Audio, Speech and Language Processing
vol.
28
,
3018
-
3028
.
Chettri B, Kinnunen T
(
2020
)
.
Subband modeling for spoofing detection in automatic speaker verification
.
http://www.odyssey2020.org/
.
Conference:
Odyssey 2020: The Speaker and Language Recognition Workshop
(
Tokyo, Japan
)
from:
01/11/2020
to:
05/11/2020
,
341
-
348
.
Ragano A, Benetos E
(
2020
)
.
Development of a Speech Quality Database Under Uncontrolled Conditions
.
Conference:
21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
(
Shanghai, China
)
from:
25/10/2020
to:
29/10/2020
,
Pankajakshan A, Bear H, Benetos E
(
2020
)
.
Memory Controlled Sequential Self Attention for Sound Recognition
.
Conference:
21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
(
Shanghai, China
)
from:
25/10/2020
to:
29/10/2020
,
MISHRA S, Benetos E, Sturm B, Dixon S
(
2020
)
.
Reliable Local Explanations for Machine Listening
.
Conference:
International Joint Conference on Neural Networks (IJCNN)
(
Glasgow, UK
)
from:
19/07/2020
to:
24/07/2020
,
Ycart A, Liu L, Benetos E, Pearce M
(
2020
)
.
Investigating the Perceptual Validity of Evaluation Metrics for Automatic Piano Music Transcription
.
Transactions of the International Society for Music Information Retrieval
vol.
3
,
(
1
)
68
-
81
.
Ragano A, Benetos E
(
2020
)
.
Audio impairment recognition using a correlation-based feature representation
.
http://qomex2020.ie/
.
Conference:
12th International Conference on Quality of Multimedia Experience (QoMEX)
(
Athlone, Ireland
)
from:
26/05/2020
to:
28/05/2020
,
SUBRAMANIAN V, Pankajakshan A, Benetos E, Xu N, McDonald S, Sandler M
(
2020
)
.
A Study on the Transferability of Adversarial Attacks in Sound Event Classification
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020)
(
Barcelona, Spain
)
from:
04/05/2020
to:
08/05/2020
,
301
-
305
.
Wei W, Zhu H, Benetos E
(
2020
)
.
A-CRNN: a domain adaptation model for sound event detection
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020)
(
Barcelona, Spain
)
from:
04/05/2020
to:
08/05/2020
,
276
-
280
.
Martinez Ramirez M, Benetos E, Reiss J
(
2020
)
.
Modeling plate and spring reverberation using a DSP-informed deep neural network
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020)
(
Barcelona, Spain
)
from:
04/05/2020
to:
08/05/2020
,
241
-
245
.
Wang C, Lostanlen V, Benetos E
(
2020
)
.
Playing Technique Recognition by Joint Time–Frequency Scattering
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020)
(
Barcelona, Spain
)
from:
04/05/2020
to:
08/05/2020
,
881
-
885
.
Ycart A, Liu L, Benetos E
(
2020
)
.
Musical Features for Automatic Music Transcription Evaluation
.
Ycart A, Benetos E
(
2020
)
.
Learning and Evaluation Methodologies for Polyphonic Music Sequence Prediction with LSTMs
.
IEEE/ACM Transactions on Audio, Speech and Language Processing
vol.
28
,
(
1
)
1328
-
1341
.
Chettri B, Kinnunen T
(
2020
)
.
Deep Generative Variational Autoencoding for Replay Spoof Detection in Automatic Speaker Verification
.
Computer Speech and Language
vol.
63
,
Article
101092
,
Martinez Ramirez M, Benetos E, Reiss J
(
2020
)
.
Deep Learning for Black-Box Modeling of Audio Effects
.
Applied Sciences
vol.
10
,
(
2
)
Article
638
,
Liu L, Benetos E
(
2019
)
.
Automatic Music Accompaniment with a Chroma-based Music Data Representation
.
Conference:
DMRN+14: Digital Music Research Network One-day Workshop
Ycart A, Stoller D
(
2019
)
.
A Comparative Study of Neural Models for Polyphonic Music Sequence Transduction
.
Conference:
20th conference of the International Society for Music Information Retrieval (ISMIR)
(
Delft, The Netherlands
)
from:
04/11/2019
to:
08/11/2019
,
470
-
477
.
Wang C, Benetos E, Lostanlen V
(
2019
)
.
Adaptive Time–Frequency Scattering for Periodic Modulation Recognition in Music Signals
.
Conference:
International Society for Music Information Retrieval Conference
(
Delft, The Netherlands
)
from:
04/11/2019
to:
08/11/2019
,
809
-
815
.
Holzapfel A
(
2019
)
.
Automatic music transcription and ethnomusicology: a user study
.
Conference:
20th conference of the International Society for Music Information Retrieval (ISMIR)
(
Delft, The Netherlands
)
from:
04/11/2019
to:
08/11/2019
,
678
-
684
.
Ycart A, McLeod A, Benetos E
(
2019
)
.
Blending acoustic and language model predictions for automatic music transcription
.
Conference:
20th conference of the International Society for Music Information Retrieval (ISMIR)
(
Delft, The Netherlands
)
from:
04/11/2019
to:
08/11/2019
,
454
-
461
.
Wang C, Benetos E
(
2019
)
.
CBF-periDB: A Chinese Bamboo Flute Dataset for Periodic Modulation Analysis
.
Conference:
International Society for Music Information Retrieval Conference Late-Breaking Demo Session
(
Delft, The Netherlands
)
from:
04/11/2019
to:
08/11/2019
,
Singh S, Pankajakshan A
(
2019
)
.
Audio tagging using a linear noise modelling layer
.
http://dcase.community/workshop2019/
.
Conference:
4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019)
(
New York, USA
)
from:
25/10/2019
to:
26/10/2019
,
234
-
238
.
Pankajakshan A, Benetos E
(
2019
)
.
Onsets, activity, and events: a multi-task approach for polyphonic sound event modelling
.
http://dcase.community/workshop2019/
.
Conference:
4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019)
(
New York, USA
)
from:
25/10/2019
to:
26/10/2019
,
174
-
178
.
SUBRAMANIAN V, Benetos E, Sandler M
(
2019
)
.
Robustness of Adversarial Attacks in Sound Event Classification
.
http://dcase.community/workshop2019/
.
Conference:
4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019)
(
New York, USA
)
from:
25/10/2019
to:
26/10/2019
,
239
-
243
.
Bear H, Heittola T, Mesaros A, Virtanen T
(
2019
)
.
City classification from multiple real-world sound scenes
.
http://www.waspaa.com/
.
Conference:
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
(
New Paltz, NY, USA
)
from:
20/10/2019
to:
23/10/2019
,
11
-
15
.
Vianna Lordelo C, Benetos E, Dixon S, Ahlbäck S
(
2019
)
.
Investigating kernel shapes and skip connections for deep learning-based harmonic-percussive separation
.
http://www.waspaa.com/
.
Conference:
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
(
New Paltz, NY, USA
)
from:
20/10/2019
to:
23/10/2019
,
40
-
44
.
Pankajakshan A, Bear H
(
2019
)
.
Polyphonic sound event and sound activity detection: a multi-task approach
.
http://www.waspaa.com/
.
Conference:
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
(
New Paltz, NY, USA
)
from:
20/10/2019
to:
23/10/2019
,
318
-
322
.
Chettri B, Stoller D, Morfi V, Martinez Ramirez M
(
2019
)
.
Ensemble Models for Spoofing Detection in Automatic Speaker Verification
.
Conference:
20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
(
Graz, Austria
)
from:
15/07/2019
to:
19/09/2019
,
1018
-
1022
.
Bear H, Nolasco I
(
2019
)
.
Towards joint sound scene and polyphonic sound event recognition
.
Conference:
20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
(
Graz, Austria
)
from:
15/09/2019
to:
19/09/2019
,
4594
-
4598
.
Martinez Ramirez M, Benetos E, Reiss J
(
2019
)
.
A general-purpose deep learning approach to model time-varying audio effects
.
Conference:
International Conference on Digital Audio Effects (DAFx-19)
(
Birmingham, UK
)
from:
02/09/2019
to:
06/09/2019
,
Zhou Q, Feng Z
(
2019
)
.
Adaptive Noise Reduction for Sound Event Detection Using Subband-Weighted NMF
.
Sensors
vol.
19
,
(
14
)
Article
3206
,
Subramanian V, Benetos E, Xu N, McDonald S, Sandler MB
(
2019
)
.
Adversarial Attacks in Sound Event Classification
.
Covas E
(
2019
)
.
Optimal Neural Network Feature Selection for Spatial-Temporal Forecasting
.
Chaos
vol.
29
,
(
6
)
Article
063111
,
Ragano A, BENETOS E
(
2019
)
.
Adapting the Quality of Experience Framework for Audio Archive Evaluation
.
https://www.qomex2019.de/
.
Conference:
11th International Conference on Quality of Multimedia Experience
(
Berlin, Germany
)
from:
05/06/2019
to:
07/06/2019
,
WANG C, BENETOS E, MENG X
(
2019
)
.
HMM-based Glissando Detection for Recordings of Chinese Bamboo Flute
.
Proceedings of Sound and Music Computing Conference
.
Conference:
Sound and Music Computing Conference
(
Malaga, Spain
)
from:
28/05/2019
to:
31/05/2019
,
545
-
550
.
Lins F, Johann M, BENETOS E
(
2019
)
.
Automatic Transcription of Diatonic Harmonica Recordings
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing
(
Brighton, UK
)
from:
12/05/2019
to:
17/05/2019
,
Phaye SSR, BENETOS E, Wang Y
(
2019
)
.
SubSpectralNet - Using sub-spectrogram based convolutional neural networks for acoustic scene classification
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing
(
Brighton, UK
)
from:
12/05/2019
to:
17/05/2019
,
MISHRA S, STOLLER D, BENETOS E, STURM B, DIXON S
(
2019
)
.
GAN-based Generation and Automatic Selection of Explanations for Neural Networks
.
https://sites.google.com/view/safeml-iclr2019
.
Conference:
SafeML ICLR 2019 Workshop
(
New Orleans, USA
)
from:
06/05/2019
to:
06/05/2019
,
Nolasco I, Terenzi A, Cecchi S, Orcioni S
(
2019
)
.
Audio-based identification of beehive states
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing
(
Brighton, UK
)
from:
12/05/2019
to:
17/05/2019
,
BENETOS E, DIXON S, Duan Z, EWERT S
(
2019
)
.
Automatic Music Transcription: An Overview
.
IEEE Signal Processing Magazine
vol.
36
,
(
1
)
20
-
30
.
CHETTRI B, MISHRA S, STURM B, BENETOS E
(
2018
)
.
Analysing the predictions of a CNN-based replay spoofing detection system
.
http://www.slt2018.org/
.
Conference:
2018 IEEE Workshop on Spoken Language Technology
(
Athens, Greece
)
from:
18/12/2018
to:
21/12/2018
,
92
-
97
.
BEAR H
(
2018
)
.
An extensible cluster-graph taxonomy for open set sound scene analysis
.
http://dcase.community/workshop2018/
.
Conference:
Workshop on Detection and Classification of Acoustic Scenes and Events
(
Surrey, UK
)
from:
19/11/2018
to:
20/11/2018
,
Nolasco I, BENETOS E
(
2018
)
.
To bee or not to bee: Investigating machine learning approaches for beehive sound recognition
.
http://dcase.community/documents/workshop2018/proceedings/DCASE2018Workshop_Nolasco_131.pdf
.
Conference:
2018 Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2018)
(
Surrey, UK
)
from:
19/11/2018
to:
20/11/2018
,
YCART A
(
2018
)
.
A-MAPS: Augmented MAPS Dataset with Rhythm and Key Annotations
.
Conference:
19th International Society for Music Information Retrieval Conference Late-Breaking Demos Session
(
Paris
)
from:
23/09/2018
to:
27/09/2018
,
WANG C, BENETOS E, MENG X
(
2018
)
.
Towards HMM-based glissando detection for recordings of Chinese bamboo flute
.
http://ismir2018.ircam.fr/pages/events-lbd.html
.
Conference:
International Society for Music Information Retrieval Conference Late-Breaking Demos Session
(
Paris, France
)
from:
23/09/2018
to:
27/09/2018
,
CHETTRI B, STURM BLT, BENETOS E
(
2018
)
.
Analysing replay spoofing countermeasure performance under varied conditions
.
Conference:
IEEE International Workshop on Machine Learning for Signal Processing
(
Aalborg, Denmark
)
from:
17/09/2018
to:
20/09/2018
,
Ali H, Tran SN, d'Avila Garcez AS
(
2018
)
.
Speaker recognition with hybrid features from a deep belief network
.
Neural Computing and Applications
vol.
29
,
(
6
)
13
-
19
.
Chettri B, Mishra S, Sturm BL
(
2018
)
.
A Study On Convolutional Neural Network Based End-To-End Replay Anti-Spoofing
.
YCART A
(
2018
)
.
Polyphonic Music Sequence Transduction with Meter-Constrained LSTM Networks
.
Conference:
IEEE International Conference on Acoustics, Speech and Signal Processing
(
Calgary, Canada
)
from:
15/04/2018
to:
20/04/2018
,
386
-
390
.
Nakamura E, BENETOS E, Yoshii K, DIXON S
(
2018
)
.
Towards Complete Polyphonic Music Transcription: Integrating Multi-Pitch Detection and Rhythm Quantization
.
Conference:
IEEE International Conference on Acoustics, Speech and Signal Processing
(
Calgary, Canada
)
from:
15/04/2018
to:
20/04/2018
,
101
-
105
.
Valero-Mas JJ, BENETOS E, Iñesta JM
(
2018
)
.
A Supervised Classification Approach for Note Tracking in Polyphonic Piano Transcription
.
Journal of New Music Research
vol.
47
,
(
3
)
249
-
263
.
Mesaros A, Heittola T, Benetos E, Foster P, Lagrange M
(
2018
)
.
Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge
.
IEEE/ACM Transactions on Audio, Speech and Language Processing
vol.
26
,
(
2
)
379
-
393
.
PANTELI M, BENETOS E, DIXON S
(
2018
)
.
A review of manual and computational approaches for the study of world music corpora
.
Journal of New Music Research
vol.
47
,
(
2
)
176
-
189
.
BENETOS E, STOWELL D, PLUMBLEY M, Virtanen T, PLUMBLEY M, Ellis D
(
2018
)
.
Approaches to complex sound scene analysis
.
Computational Analysis of Sound Scenes and Events
,
Edition.
1
,
Springer International Publishing
(
2018
)
.
Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018, Paris, France, September 23-27, 2018
.
ISMIR
.
PANTELI M, BENETOS E, DIXON S
(
2017
)
.
A computational study on outliers in world music
.
PLoS ONE
vol.
12
,
(
12
)
Article
e0189399
,
1
-
28
.
McLeod A, Steedman M, BENETOS E
(
2017
)
.
Automatic Transcription of Polyphonic Vocal Music
.
Applied Sciences
vol.
7
,
(
12
)
Article
1285
,
Ycart A, Benetos E
(
2017
)
.
A study on LSTM networks for polyphonic music sequence modelling
.
Conference:
18th International Society for Music Information Retrieval Conference (ISMIR 2017)
(
Suzhou, China
)
from:
23/10/2017
to:
27/10/2017
,
421
-
427
.
Schramm R, McLeod A, Benetos E
(
2017
)
.
Multi-pitch detection and voice assignment for a cappella recordings of multiple singers
.
Conference:
18th International Society for Music Information Retrieval Conference (ISMIR 2017)
(
Suzhou, China
)
from:
23/10/2017
to:
27/10/2017
,
552
-
559
.
Lafay G, Lagrange M
(
2017
)
.
Sound Event Detection in Synthetic Audio: Analysis of the DCASE 2016 Task Results
.
http://www.waspaa.com/
.
Conference:
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2017)
(
New Paltz, NY, USA
)
from:
18/10/2017
to:
15/10/2017
,
11
-
15
.
YCART A, BENETOS E
(
2017
)
.
Neural Music Language Models: investigating the training process
.
Conference:
International Conference of Students of Systematic Musicology
Valero-Mas JJ, Benetos E
(
2017
)
.
Assessing the Relevance of Onset Information for Note Tracking in Piano Music Transcription
.
Conference:
2017 AES International Conference on Semantic Audio
(
Erlangen, Germany
)
from:
22/06/2017
to:
24/06/2017
,
Schramm R
(
2017
)
.
Automatic Transcription of a Cappella Recordings from Multiple Singers
.
Conference:
2017 AES International Conference on Semantic Audio
(
Erlangen, Germany
)
from:
22/06/2017
to:
24/06/2017
,
Benetos E
(
2017
)
.
Polyphonic note and instrument tracking using linear dynamical systems
.
Conference:
2017 AES International Conference on Semantic Audio
(
Erlangen, Germany
)
from:
22/06/2017
to:
24/06/2017
,
Stowell D, Benetos E, Gill LF
(
2017
)
.
On-Bird Sound Recordings: Automatic Acoustic Recognition of Activities and Contexts
.
IEEE/ACM Trans. Audio, Speech & Language Processing
vol.
25
,
(
6
)
1193
-
1206
.
Stowell D, Benetos E, Gill LF
(
2017
)
.
On-bird Sound Recordings: Automatic Acoustic Recognition of Activities and Contexts
.
IEEE/ACM Transactions on Audio, Speech and Language Processing
vol.
25
,
(
6
)
1193
-
1206
.
Benetos E, Lafay G, Plumbley MD
(
2017
)
.
Polyphonic Sound Event Tracking using Linear Dynamical Systems
.
IEEE/ACM Transactions on Audio, Speech and Language Processing
vol.
25
,
(
6
)
1266
-
1277
.
Russell AJ, Benetos E
(
2017
)
.
On the Memory Properties of Recurrent Neural Models
.
Conference:
International Joint Conference on Neural Networks (IJCNN 2017)
(
Anchorage, Alaska, USA
)
from:
19/05/2017
to:
14/05/2017
,
2596
-
2603
.
Abdallah S, Benetos E, Gold N, Hargreaves S
(
2017
)
.
The Digital Music Lab: A Big Data Infrastructure for Digital Musicology
.
ACM Journal on Computing and Cultural Heritage
vol.
10
,
(
1
)
BENETOS E
(
2016
)
.
Automatic Transcription of Vocal Quartets
.
DMRN+11: Digital Music Research Network Workshop Proceedings 2016
.
Conference:
DMRN+11: Digital Music Research Network One-day Workshop 2016
(
Centre for Digital Music, Queen Mary University of London
)
from:
20/12/2016
to:
20/12/2016
,
YCART A, Benetos E
(
2016
)
.
Towards a Music Language Model for Audio Analysis
.
DMRN+11: Digital Music Research Network Workshop Proceedings 2016
.
Conference:
DMRN+11: Digital Music Research Network One-day Workshop 2016
(
Centre for Digital Music, Queen Mary University of London
)
from:
20/12/2016
to:
20/12/2016
,
Valero-Mas JJ, Benetos E
(
2016
)
.
Classification-based Note Tracking for Automatic Music Transcription
.
https://sites.google.com/site/musicmachinelearning16/proceedings
.
Conference:
9th International Workshop on Machine Learning and Music
(
Riva del Garda, Italy
)
from:
23/09/2016
to:
23/09/2016
,
61
-
65
.
Abdallah S, Gold N, Hargreaves S, Weyde T, Wolff D
(
2016
)
.
Digital Music Lab: A Framework for Analysing Big Music Data
.
Conference:
24th European Signal Processing Conference
(
Budapest, Hungary
)
from:
29/08/2016
to:
02/09/2016
,
1118
-
1122
.
Cheng T, Mauch M, Benetos E, Dixon S
(
2016
)
.
An attack/decay model for piano transcription
.
Conference:
17th International Society for Music Information Retrieval Conference
(
New York, USA
)
from:
07/08/2016
to:
11/08/2016
,
584
-
590
.
Panteli M, Benetos E, Dixon S
(
2016
)
.
Learning a feature space for similarity in world music
.
Conference:
17th International Society for Music Information Retrieval Conference
(
New York, USA
)
from:
07/08/2016
to:
11/08/2016
,
538
-
544
.
Holzapfel A, Benetos E
(
2016
)
.
The Sousta corpus: Beat-informed automatic transcription of traditional dance tunes
.
Conference:
17th International Society for Music Information Retrieval Conference
(
New York, USA
)
from:
07/08/2016
to:
11/08/2016
,
531
-
537
.
Lafay G, Lagrange M, Rossignol M, Benetos E
(
2016
)
.
A morphological model for simulating acoustic scenes and its application to sound event detection
.
IEEE/ACM Transactions on Audio, Speech, and Language Processing
vol.
24
,
(
10
)
1854
-
1864
.
Panteli M, Benetos E, Dixon S
(
2016
)
.
Automatic detection of outliers in world music collections
.
Conference:
Fourth International Conference on Analytical Approaches to World Music (AAWM 2016)
(
New York, USA
)
from:
11/06/2016
to:
08/06/2016
,
Benetos E, Lafay G, Lagrange M, Plumbley MD
(
2016
)
.
Detection of Overlapping Acoustic Events Using a Temporally-Constrained Probabilistic Model
.
Conference:
2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)6450
-
6454
.
Sigtia S, Benetos E, Dixon S
(
2016
)
.
An End-to-End Neural Network for Polyphonic Piano Music Transcription
.
IEEE/ACM Transactions on Audio, Speech, and Language Processing
vol.
24
,
(
5
)
927
-
939
.
Benetos E
(
2015
)
.
An efficient temporally-constrained probabilistic model for multiple-instrument music transcription
.
http://ismir2015.uma.es/docs/ISMIR2015_Proceedings.pdf
.
Conference:
16th International Society for Music Information Retrieval Conference (ISMIR)
(
Malaga, Spain
)
from:
26/10/2015
to:
30/10/2015
,
701
-
707
.
BENETOS E, Holzapfel A
(
2015
)
.
Automatic transcription of Turkish microtonal music
.
Journal of the Acoustical Society of America
vol.
138
,
(
4
)
2118
-
2130
.
Stowell D, Giannoulis D, Benetos E, Lagrange M, Plumbley MD
(
2015
)
.
Detection and Classification of Acoustic Scenes and Events
.
IEEE Transactions on Multimedia
vol.
17
,
(
10
)
1733
-
1746
.
Rossignol M, Lagrange M, Lafay G
(
2015
)
.
Alternate level clustering for drum transcription
.
Conference:
23rd European Signal Processing Conference (EUSIPCO)
(
Nice, France
)
from:
04/09/2015
to:
31/08/2015
,
2068
-
2072
.
Abdallah S, Alencar-Brayner A, BENETOS E, Cottrell S, Dykes J, Gold N, Kachkaev A, Tidhar D
(
2015
)
.
Automatic transcription and pitch analysis of the British Library World & Traditional Music Collection
.
http://fma2015.sciencesconf.org/conference/fma2015/FMA2015_OfficialProceedings.pdf
.
Conference:
5th International Workshop on Folk Music Analysis
(
Paris, France
)
from:
10/06/2015
to:
12/06/2015
,
10
-
12
.
Sigtia S, Benetos E, Boulanger-Lewandowski N, Weyde T, Garcez ASDA, Dixon S
(
2015
)
.
A Hybrid Recurrent Neural Network for Music Transcription
.
IEEE International Conference on Acoustics Speech and Signal Processing
.
Conference:
2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
(
Brisbane, Australia
)
from:
19/04/2015
to:
24/04/2015
,
2061
-
2065
.
Benetos E, Badeau R, Weyde T
(
2014
)
.
Template Adaptation for Improving Automatic Music Transcription
.
http://www.terasoft.com.tw/conf/ismir2014//proceedings%5CISMIR2014_Proceedings.pdf
.
Conference:
15th International Society for Music Information Retrieval Conference (ISMIR)
(
Taipei, Taiwan
)
from:
27/10/2014
to:
31/10/2014
,
175
-
180
.
Tidhar D, Dixon S, Benetos E, Weyde T
(
2014
)
.
The temperament police
.
Early Music
vol.
42
,
(
4
)
579
-
590
.
Weyde T, Cottrell S, Dykes J, Benetos E, Wolff D, Tidhar D, Gold N, Abdallah S et al.
(
2014
)
.
Big Data for Musicology
.
Conference:
1st International Digital Libraries for Musicology workshop
(
London, UK
)
from:
12/09/2014
to:
12/09/2014
,
Wolff D, Tidhar D, Benetos E, Dumon E, Cherla S, Page K, Fields B
(
2014
)
.
Incremental dataset definition for large scale musicological research
.
Conference:
1st International Digital Libraries for Musicology workshop
(
London, UK
)
from:
12/09/2014
to:
12/09/2014
,
Tran S, Benetos E, d Avila Garcez A
(
2014
)
.
Learning motion-difference features using Gaussian restricted Boltzmann machines for efficient human action recognition
.
Conference:
2014 International Joint Conference on Neural Networks (IJCNN)
(
Beijing, China
)
from:
06/07/2014
to:
11/07/2014
,
2123
-
2129
.
Benetos E, Holzapfel A, Holzapfel A
(
2014
)
.
Incorporating pitch class profiles for improving automatic transcription of Turkish makam music
.
Proceedings of the Fourth International Workshop on Folk Music Analysis (FM
.
Conference:
4th International Workshop on Folk Music Analysis
(
Istanbul, Turkey
)
from:
12/06/2014
to:
13/06/2014
,
15
-
20
.
Giannoulis D, Benetos E, Klapuri A
(
2014
)
.
Improving instrument recognition in polyphonic music through system integration
.
Conference:
IEEE International Conference on Acoustics, Speech, and Signal Processing
(
Florence, Italy
)
from:
04/05/2014
to:
09/05/2014
,
5259
-
5263
.
Benetos E, Weyde T
(
2014
)
.
Improving automatic music transcription through key detection
.
http://www.aes.org/conferences/53/technical_programme.cfm
.
Conference:
AES 53rd International Conference on Semantic Audio
(
London, UK
)
from:
27/01/2014
to:
29/01/2014
,
Benetos E, Ewert S, Weyde T
(
2014
)
.
Automatic Transcription Of Pitched And Unpitched Sounds From Polyphonic Music
.
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
.
3131
-
3135
.
Sigtia S, Benetos E, Cherla S, Weyde T, Garcez A, Dixon S
(
2014
)
.
RNN-based Music Language Models for Improving Automatic Music Transcription
.
15th International Society for Music Information Retrieval Conference
.
53
-
58
.
BARTHET M, Benetos E, Cottrell S, Dixon S, Dykes J, Gold N, Mahey M, Plumbley MD et al.
(
2014
)
.
The DML Research Project: Digital Music Lab - Analysing Big Music Data
.
Presented at:
Workshop on "Big Data: Challenges and Applications", Imperial College, London
,
Benetos E, Holzapfel A
(
2013
)
.
Automatic transcription of Turkish makam music
.
Conference:
14th International Society for Music Information Retrieval Conference
(
Curitiba, PR, Brazil
)
from:
04/11/2013
to:
08/11/2013
,
355
-
360
.
Benetos E, Weyde T
(
2013
)
.
Explicit duration hidden Markov models for multiple-instrument polyphonic music transcription
.
Conference:
14th International Society for Music Information Retrieval Conference
(
Curitiba, PR, Brazil
)
from:
04/11/2013
to:
08/11/2013
,
269
-
274
.
de Valk R, Weyde T, Britto AS, Gouyon F, Dixon S
(
2013
)
.
A machine learning approach to voice separation in lute tablature
.
Conference:
14th International Society for Music Information Retrieval Conference
(
Curitiba, PR, Brazil
)
from:
04/11/2013
to:
08/11/2013
,
555
-
560
.
Giannoulis D, Benetos E, Stowell D, Rossignol M, Lagrange M, Plumbley MD
(
2013
)
.
DETECTION AND CLASSIFICATION OF ACOUSTIC SCENES AND EVENTS: AN IEEE AASP CHALLENGE
.
Conference:
2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics1
-
4
.
Giannoulis D, Stowell D, Benetos E, Rossignol M, Lagrange M, Plumbley MD
(
2013
)
.
A database and challenge for acoustic scene classification and event detection
.
Conference:
21st European Signal Processing Conference
(
Marrakech, Morocco
)
Benetos E, Cherla S
(
2013
)
.
An efficient shift-invariant model for polyphonic music transcription
.
Conference:
6th International Workshop on Machine Learning and Music
(
Prague, Czech Republic
)
Benetos E, Dixon S, Giannoulis D, Kirchhoff H, Klapuri A
(
2013
)
.
Automatic music transcription: challenges and future directions
.
Journal of Intelligent Information Systems
vol.
41
,
(
3
)
407
-
434
.
Serra X, Magas M, Benetos E, Chudy M, Dixon S, Flexer A, Gomez E, Gouyon F et al.
(
2013
)
.
Roadmap for Music Information ReSearch
.
The MIReS Consortium
Benetos E, Dixon S
(
2013
)
.
Multiple-instrument polyphonic music transcription using a temporally constrained shift-invariant model
.
The Journal of the Acoustical Society of America
vol.
133
,
(
3
)
1727
-
1741
.
Benetos E, Dixon S
(
2012
)
.
A Shift-Invariant Latent Variable Model for Automatic Music Transcription
.
Computer Music Journal
vol.
36
,
(
4
)
81
-
94
.
BENETOS E, Dixon S, Giannoulis D, Kirchhoff H, Klapuri A
(
2012
)
.
Automatic Music Transcription: Breaking the Glass Ceiling
.
Conference:
13th International Society for Music Information Retrieval Conference (ISMIR 2012)
(
Porto, Portugal
)
from:
08/10/2012
to:
12/10/2012
,
379
-
384
.
Zijlstra A, Mancini M, Lindemann U, Chiari L, Zijlstra W
(
2012
)
.
Sit-stand and stand-sit transitions in older adults and patients with Parkinson’s disease: event detection based on motion sensors versus force plates
.
Journal of NeuroEngineering and Rehabilitation
vol.
9
,
(
1
)
Benetos E, Lagrange M, Dixon S
(
2012
)
.
Characterisation of acoustic scenes using a temporally-constrained shift-invariant model
.
15th International Conference on Digital Audio Effects, DAFx 2012 Proceedings
.
Benetos E, Klapuri A, Dixon S
(
2012
)
.
Score-informed transcription for automatic piano tutoring
.
Conference:
20th European Signal Processing Conference
(
Bucharest, Romania
)
2153
-
2157
.
Benetos E, Dixon S
(
2012
)
.
Temporally-Constrained Convolutive Probabilistic Latent Component Analysis for Multi-pitch Detection
.
Lecture Notes in Computer Science
.
vol.
7191
,
364
-
371
.
Benetos E, Dixon S
(
2011
)
.
A TEMPORALLY-CONSTRAINED CONVOLUTIVE PROBABILISTIC MODEL FOR PITCH DETECTION
.
Conference:
2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)133
-
136
.
Benetos E, Dixon S
(
2011
)
.
Joint Multi-Pitch Detection Using Harmonic Envelope Estimation for Polyphonic Music Transcription
.
IEEE Journal of Selected Topics in Signal Processing
vol.
5
,
(
6
)
1111
-
1123
.
Mearns L, Benetos E, Dixon S
(
2011
)
.
Automatically detecting key modulations in J.S. Bach chorale recordings
.
8th Sound and Music Computing Conference
.
25
-
32
.
Benetos E, Dixon S
(
2011
)
.
Multiple-instrument polyphonic music transcription using a convolutive probabilistic model
.
Conference:
8th Sound and Music Computing Conference
(
Padova, Italy
)
from:
06/07/2011
to:
09/07/2011
,
19
-
24
.
Benetos E, Dixon S
(
2011
)
.
Polyphonic Music Transcription Using Note Onset and Offset Detection
.
Conference:
2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)37
-
40
.
Dixon S, Tidhar D, Benetos E
(
2011
)
.
The temperament police: The truth, the ground truth, and nothing but the truth
.
Proceedings of the 12th International Society for Music Information Retrieval Conference, ISMIR 2011
.
Conference:
12th International Society for Music Information Retrieval Conference
(
Miami, Florida, USA
)
from:
24/10/2011
to:
28/10/2011
,
281
-
286
.
Anglade A, Benetos E, Mauch M, Dixon S
(
2010
)
.
Improving Music Genre Classification Using Automatically Induced Harmony Rules
.
Journal of New Music Research
vol.
39
,
(
4
)
349
-
361
.
Benetos E, Dixon S
(
2010
)
.
Multiple-F0 estimation of piano sounds exploiting spectral structure and temporal evolution
.
ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition
.
13
-
18
.
Benetos E, Stylianou Y
(
2010
)
.
Auditory Spectrum-Based Pitched Instrument Onset Detection
.
IEEE Transactions on Audio Speech and Language Processing
vol.
18
,
(
8
)
1968
-
1977
.
Benetos E, Kotropoulos C
(
2010
)
.
Non-Negative Tensor Factorization Applied to Music Genre Classification
.
IEEE Transactions on Audio Speech and Language Processing
vol.
18
,
(
8
)
1955
-
1967
.
Benetos E, Holzapfel A
(
2009
)
.
Pitched instrument onset detection based on auditory spectra
.
Proceedings of the 10th International Society for Music Information Retrieval Conference, ISMIR 2009
.
105
-
110
.
Benetos E, Kotropoulos C
(
2008
)
.
A tensor-based approach for automatic music genre classification
.
16th European Signal Processing Conference
.
Spachos D, Zlantintsi A, Moschou V, Antonopoulos P, Benetos E, Kotti M, Tzimouli K, Kotropoulos C et al.
(
2008
)
.
MUSCLE movie-database: a multimodal corpus with rich annotation for dialogue and saliency detection
.
6th Language Resources and Evaluation Conference
.
16
-
19
.
BENETOS E, Siatras S, Kotropoulos C, Nikolaidis N
(
2008
)
.
Movie analysis with emphasis to dialogue and action scene detection
.
Multimodal Processing and Interaction
,
vol.
33
,
Springer
Panagakis I, Benetos E, Kotropoulos C, Bello JP, Chew E, Turnbull D
(
2008
)
.
Music Genre Classification: A Multilinear Approach
.
ISMIR
.
583
-
588
.
Kotti M, Benetos E, Kotropoulos C, Pitas I
(
2007
)
.
A neural network approach to audio-assisted movie dialogue detection
.
Neurocomputing
vol.
71
,
(
1-3
)
157
-
166
.
Moschou V, Kotti M, Benetos E, Kotropoulos C
(
2007
)
.
Systematic comparison of BIC-based speaker segmentation systems
.
Conference:
2007 IEEE 9th Workshop on Multimedia Signal Processing66
-
69
.
Kotti M, Benetos E
(
2007
)
.
Neural network-based movie dialogue detection
.
10th International Conference on Engineering Applications of Neural Networks
.
Benetos E, Kotti M, Kotropoulos C
(
2007
)
.
Large scale musical instrument identification
.
4th Sound and Music Computing Conference
.
283
-
286
.
Benetos E, Kotti M, Kotropoulos C
(
2006
)
.
Applying Supervised Classifiers Based on Non-negative Matrix Factorization to Musical Instrument Classification
.
2006 IEEE International Conference on Multimedia and Expo
.
Conference:
2006 IEEE International Conference on Multimedia and Expo2105
-
2108
.
Kotti M, Martins LGPM, Benetos E, Cardoso JS, Kotropoulos C
(
2006
)
.
Automatic Speaker Segmentation using Multiple Features and Distance Measures: A Comparison of Three Approaches
.
2006 IEEE International Conference on Multimedia and Expo
.
Conference:
2006 IEEE International Conference on Multimedia and Expo1101
-
1104
.
Kotti M, Benetos E, Kotropoulos C
(
2006
)
.
Automatic speaker change detection with the Bayesian information criterion using MPEG-7 features and a fusion scheme
.
2005 IEEE International Symposium on Circuits and Systems (ISCAS)
.
Conference:
2006 IEEE International Symposium on Circuits and Systems4
-
pp.
.
Benetos E, Kotti M, Kotropoulos C
(
2006
)
.
Musical instrument classification using non-negative matrix factorization algorithms
.
2005 IEEE International Symposium on Circuits and Systems (ISCAS)
.
Conference:
2006 IEEE International Symposium on Circuits and Systems4
-
pp.
.
Benetos E, Kotti M
(
2006
)
.
Musical instrument classification using non-negative matrix factorization algorithms and subset feature selection
.
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
.
vol.
5
,
Benetos E, Kotropoulos C, Lidy T
(
2006
)
.
Testing supervised classifiers based on non-negative matrix factorization to musical instrument classification
.
European Signal Processing Conference
.
Benetos E, Kotti M, Kotropoulos C, Burred JJ, Eisenberg G, Sikora T
(
2005
)
.
Comparison of subspace analysis-based and statistical model-based algorithms for musical instrument classification
.
2nd Workshop On Immersive Communication And Broadcast Systems
.
Liang J, Benetos E, Phan H
.
Adapting Language-Audio Models as Few-Shot Audio Learners
.
Conference:
INTERSPEECH 2023
Savage PE, Ampiah-Bonney A, Arabadjiev A, Arhine A, Ariza JF, Bamford JS, Barbosa BS, Beck A-K et al.
.
Does synchronised singing enhance social bonding more than speaking does? A global experimental Stage 1 Registered Report
.
.
From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems
.
Conference:
35th IEEE International Workshop on Machine Learning for Signal Processing
de Fleurian R, Clemente A, Benetos E, Pearce MT
.
Melodic expectation as an elicitor of music-evoked chills
.
Qu X, Bai Y, Ma Y, Zhou Z, Lo KM, Liu J, Yuan R, Min L et al.
.
MuPT: A Generative Symbolic Music Pretrained Transformer
.
Conference:
The Thirteenth International Conference on Learning Representations
(
Singapore
)
from:
23/04/2025
to:
28/04/2025
,