Publications: Iran Roman Guzman

Carone B, Roman I, Ripolles P ( 2026 ) . LLMs can read music, but struggle to hear it. An evaluation of core music perception tasks . Proceedings of Machine Learning Research

https://qmro.qmul.ac.uk/xmlui/handle/123456789/115172

Wang L, Roman Guzman I, Xambo Sedo A ( 2025 ) . Towards Real-Time, Stable Mapping from Multimodal Sensing to Interpretable Timbre Axes . Conference: DMRN+20 Digital Music Research Network One-day Workshop 2025 ( King’s College London (Bush House). London, UK ) from: 16/12/2025 to: 16/12/2025 ,

Cheston H, Roman Guzman I, Stepien A, Azcarreta J, Roman AS, Chen C, Bilen Ç . “AudibleLight (RC): A Controllable, End-to-End API for Soundscape Synthesis Across Ray-Traced & Real-World Measured Acoustics” . https://www.qmul.ac.uk/dmrn/dmrn20/ . Conference: DMRN+20 Digital Music Research Network One-day Workshop 2025 ( King’s College London, Bush House, London UK ) from: 16/12/2025 to: 16/12/2025 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/120235

Carone B, Roman Guzman I, Ripollés P ( 2025 ) . Evaluating Multimodal Large Language Models on Core Music Perception Tasks . Conference: 39th Conference on Neural Information Processing Systems (NeurIPS 2025) Workshop: AI for Music

https://qmro.qmul.ac.uk/xmlui/handle/123456789/113265

Roman A, Roman I, Bello J ( 2025 ) . Latent Acoustic Mapping for Direction of Arrival Estimation: A Self-Supervised Approach . Conference: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

https://qmro.qmul.ac.uk/xmlui/handle/123456789/109977

Urrego-Gómez I, Colton S, Roman I ( 2025 ) . Vibe Sorcery: Integrating Emotion Recognition with Generative Music for Playlist Curation . Conference: International Society for Music Information Retrieval: 1st Workshop on Large Language Models for Music & Audio (LLM4MA)

Bozilovic Z, Roman I ( 2025 ) . Decoding Melodic Acoustic Features from Neural Data . Conference: AES International Conference on Artificial Intelligence and Machine Learning for Audio

https://qmro.qmul.ac.uk/xmlui/handle/123456789/109975

Chang A, Li Y, Roman IR, Poeppel D . Spectrotemporal Modulation: Efficient and Interpretable Feature Representation for Classifying Speech, Music, and Environmental Sounds . Conference: Interspeech 2025216 - 220 .

10.21437/interspeech.2025-1021

https://qmro.qmul.ac.uk/xmlui/handle/123456789/107515

Zimokha M, Jamone L, Roman I ( 2025 ) . Combining Recurrent & Bayesian Models for Action Anticipation with Multiple Cues . Conference: Cognitive Computational Neuroscience

https://qmro.qmul.ac.uk/xmlui/handle/123456789/107514

Wang A, Roman I ( 2025 ) . Toward Affective Empathy in AI: Encoding Internal Representations of “Artificial Pain” . Conference: Cognitive Computational Neuroscience

https://qmro.qmul.ac.uk/xmlui/handle/123456789/107513

McGowan E, Rulff J, Castelo S, Wu G, Chen S, Roman IR, Dias FF, Qian J ( 2025 ) . Design and Implementation of the Transparent, Interpretable, and Multimodal (TIM) AR Personal Assistant . IEEE Computer Graphics and Applications vol. 45 , ( 1 ) 28 - 42 .

10.1109/mcg.2025.3549696

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106601

Pedroza H, Abreu W, Corey RM, Roman IR ( 2025 ) . Guitar-TECHS: An Electric Guitar Dataset Covering Techniques, Musical Excerpts, Chords and Scales Using a Diverse Array of Hardware . Conference: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) vol. 00 , 1 - 5 .

10.1109/icassp49660.2025.10887996

https://qmro.qmul.ac.uk/xmlui/handle/123456789/103641

Harding EE, Kim JC, Demos AP, Roman IR ( 2025 ) . Musical neurodynamics . Nature Reviews Neuroscience vol. 26 , ( 5 ) 293 - 307 .

10.1038/s41583-025-00915-4

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106598

Peng X, Chen K, Roman I ( 2025 ) . Perceptually-Guided Acoustic "Foveation" . Conference: 2025 IEEE Conference Virtual Reality and 3D User Interfaces (VR) vol. 00 , 450 - 460 .

10.1109/vr59515.2025.00069

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106603

Castelo S, Rulff J, Solunke P, McGowan E, Wu G, Roman I, Lopez R, Sun Q et al. ( 2024 ) . HuBar: A Visual Analytics Tool to Explore Human Behavior Based on fNIRS in AR Guidance Systems . IEEE Transactions on Visualization and Computer Graphics vol. 31 , ( 1 ) 119 - 129 .

10.1109/tvcg.2024.3456388

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102542

Pedroza H, Abreu W, Corey R ( 2024 ) . LEVERAGING REAL ELECTRIC GUITAR TONES AND EFFECTS TO IMPROVE ROBUSTNESS IN GUITAR TABLATURE TRANSCRIPTION MODELING . Conference: 27th International Conference on Digital Audio Effects (DAFx24)

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102534

Roman AS, Roman IR ( 2024 ) . Robust DoA Estimation from Deep Acoustic Imaging . Conference: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) vol. 00 , 1321 - 1325 .

10.1109/icassp48485.2024.10447883

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102548

Roman IR, Ick C, Roman AS, McFee B ( 2024 ) . Spatial Scaper: A Library to Simulate and Augment Soundscapes for Sound Event Localization and Detection in Realistic Rooms . Conference: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) vol. 00 , 1221 - 1225 .

10.1109/icassp48485.2024.10446118

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102554

Castelo S, Rulff J, McGowan E, Steers B, Wu G, Chen S, Roman I, Brewer E et al. ( 2023 ) . : Visualization of AI-Assisted Task Guidance in AR . IEEE Transactions on Visualization and Computer Graphics vol. 30 , ( 1 ) 1313 - 1323 .

10.1109/tvcg.2023.3327396

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102535

Kushwaha SS, Roman IR, Fuentes M ( 2023 ) . Sound Source Distance Estimation in Diverse and Dynamic Acoustic Conditions . Conference: 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) vol. 00 , 1 - 5 .

10.1109/waspaa58266.2023.10248194

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102555

Faronbi D, Roman I ( 2023 ) . Exploring Approaches to Multi-Task Automatic Synthesizer Programming . Conference: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) vol. 00 , 1 - 5 .

10.1109/icassp49357.2023.10095540

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102538

Roman IR, Roman AS, Kim JC, Large EW ( 2023 ) . Hebbian learning with elasticity explains how the spontaneous motor tempo affects music performance synchronization . PLOS Computational Biology vol. 19 , ( 6 )

10.1371/journal.pcbi.1011154

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102536

Roman Guzman I ( 2023 ) . F0 analysis of Ghanaian pop singing reveals progressive alignment with equal temperament over the past three decades: a case study . Conference: 20th Sound and Music Computing Conference, SMC 2023

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102558

Large EW, Roman I, Kim JC, Cannon J, Trainor LJ ( 2023 ) . Dynamic models for musical rhythm perception and coordination . Frontiers in Computational Neuroscience vol. 17 ,

10.3389/fncom.2023.1151895

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102540

Liang BS, Liang AS, Roman I, Weiss T, Sun Q ( 2022 ) . Reconstructing room scales with a single sound for augmented reality displays . Journal of Information Display vol. 24 , ( 1 ) 1 - 12 .

10.1080/15980316.2022.2145377

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102552

Roman Guzman I ( 2022 ) . Analyzing the effect of equal-angle spatial discretization on sound event localization and detection . Conference: Detection and Classification of Acoustic Scenes and Events 2022

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102559

Roman Guzman I, Bello J ( 2021 ) . micarraylib: Software for Reproducible Aggregation, Standardization, and Signal Processing of Microphone Array Datasets . Conference: Detection and Classification of Acoustic Scenes and Events 2021

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102560

Walls K, Roman I, Van Ert K, Harper C, Adu-Gilmore L . Analyzing Pitch Content In Traditional Ghanaian Seperewa Songs . Conference: 1st Latin American Music Information Retrieval (LAMIR) workshop

10.5281/zenodo.14908040

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102545

Carone B, Roman IR, Ripolles P, Roman Guzman I . THE MUSE BENCHMARK: PROBING MUSIC PERCEPTION AND AUDITORY RELATIONAL REASONING IN AUDIO LLMS . Conference: ICASSP 2026-2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) from: 08/05/2026 to: 04/05/2026 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/119052

Global main menu

Areas of study

Study at Queen Mary

Experience Queen Mary

Research and Innovation

Research by faculties and centres

Collaborations and partnerships

Publications: DR Iran Roman Guzman