Skip to main content
Research

Publications: DR Iran Roman Guzman

Carone B, Roman I, Ripolles P ( 2026 ) . LLMs can read music, but struggle to hear it. An evaluation of core music perception tasks . Proceedings of Machine Learning Research
Wang L, Roman Guzman I, Xambo Sedo A ( 2025 ) . Towards Real-Time, Stable Mapping from Multimodal Sensing to Interpretable Timbre Axes . Conference: DMRN+20 Digital Music Research Network One-day Workshop 2025 ( King’s College London (Bush House). London, UK ) from: 16/12/2025 to: 16/12/2025 ,
Cheston H, Roman Guzman I, Stepien A, Azcarreta J, Roman AS, Chen C, Bilen Ç . “AudibleLight (RC): A Controllable, End-to-End API for Soundscape Synthesis Across Ray-Traced & Real-World Measured Acoustics” . https://www.qmul.ac.uk/dmrn/dmrn20/ . Conference: DMRN+20 Digital Music Research Network One-day Workshop 2025 ( King’s College London, Bush House, London UK ) from: 16/12/2025 to: 16/12/2025 ,
Carone B, Roman Guzman I, Ripollés P ( 2025 ) . Evaluating Multimodal Large Language Models on Core Music Perception Tasks . Conference: 39th Conference on Neural Information Processing Systems (NeurIPS 2025) Workshop: AI for Music
Roman A, Roman I, Bello J ( 2025 ) . Latent Acoustic Mapping for Direction of Arrival Estimation: A Self-Supervised Approach . Conference: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
Urrego-Gómez I, Colton S, Roman I ( 2025 ) . Vibe Sorcery: Integrating Emotion Recognition with Generative Music for Playlist Curation . Conference: International Society for Music Information Retrieval: 1st Workshop on Large Language Models for Music & Audio (LLM4MA)
Bozilovic Z, Roman I ( 2025 ) . Decoding Melodic Acoustic Features from Neural Data . Conference: AES International Conference on Artificial Intelligence and Machine Learning for Audio
Chang A, Li Y, Roman IR, Poeppel D . Spectrotemporal Modulation: Efficient and Interpretable Feature Representation for Classifying Speech, Music, and Environmental Sounds . Conference: Interspeech 2025216 - 220 .
Zimokha M, Jamone L, Roman I ( 2025 ) . Combining Recurrent & Bayesian Models for Action Anticipation with Multiple Cues . Conference: Cognitive Computational Neuroscience
Wang A, Roman I ( 2025 ) . Toward Affective Empathy in AI: Encoding Internal Representations of “Artificial Pain” . Conference: Cognitive Computational Neuroscience
McGowan E, Rulff J, Castelo S, Wu G, Chen S, Roman IR, Dias FF, Qian J ( 2025 ) . Design and Implementation of the Transparent, Interpretable, and Multimodal (TIM) AR Personal Assistant . IEEE Computer Graphics and Applications vol. 45 , ( 1 ) 28 - 42 .
Pedroza H, Abreu W, Corey RM, Roman IR ( 2025 ) . Guitar-TECHS: An Electric Guitar Dataset Covering Techniques, Musical Excerpts, Chords and Scales Using a Diverse Array of Hardware . Conference: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) vol. 00 , 1 - 5 .
Harding EE, Kim JC, Demos AP, Roman IR ( 2025 ) . Musical neurodynamics . Nature Reviews Neuroscience vol. 26 , ( 5 ) 293 - 307 .
Peng X, Chen K, Roman I ( 2025 ) . Perceptually-Guided Acoustic "Foveation" . Conference: 2025 IEEE Conference Virtual Reality and 3D User Interfaces (VR) vol. 00 , 450 - 460 .
Castelo S, Rulff J, Solunke P, McGowan E, Wu G, Roman I, Lopez R, Sun Q et al. ( 2024 ) . HuBar: A Visual Analytics Tool to Explore Human Behavior Based on fNIRS in AR Guidance Systems . IEEE Transactions on Visualization and Computer Graphics vol. 31 , ( 1 ) 119 - 129 .
Pedroza H, Abreu W, Corey R ( 2024 ) . LEVERAGING REAL ELECTRIC GUITAR TONES AND EFFECTS TO IMPROVE ROBUSTNESS IN GUITAR TABLATURE TRANSCRIPTION MODELING . Conference: 27th International Conference on Digital Audio Effects (DAFx24)
Roman AS, Roman IR ( 2024 ) . Robust DoA Estimation from Deep Acoustic Imaging . Conference: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) vol. 00 , 1321 - 1325 .
Roman IR, Ick C, Roman AS, McFee B ( 2024 ) . Spatial Scaper: A Library to Simulate and Augment Soundscapes for Sound Event Localization and Detection in Realistic Rooms . Conference: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) vol. 00 , 1221 - 1225 .
Castelo S, Rulff J, McGowan E, Steers B, Wu G, Chen S, Roman I, Brewer E et al. ( 2023 ) . : Visualization of AI-Assisted Task Guidance in AR . IEEE Transactions on Visualization and Computer Graphics vol. 30 , ( 1 ) 1313 - 1323 .
Kushwaha SS, Roman IR, Fuentes M ( 2023 ) . Sound Source Distance Estimation in Diverse and Dynamic Acoustic Conditions . Conference: 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) vol. 00 , 1 - 5 .
Faronbi D, Roman I ( 2023 ) . Exploring Approaches to Multi-Task Automatic Synthesizer Programming . Conference: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) vol. 00 , 1 - 5 .
Roman IR, Roman AS, Kim JC, Large EW ( 2023 ) . Hebbian learning with elasticity explains how the spontaneous motor tempo affects music performance synchronization . PLOS Computational Biology vol. 19 , ( 6 )
Roman Guzman I ( 2023 ) . F0 analysis of Ghanaian pop singing reveals progressive alignment with equal temperament over the past three decades: a case study . Conference: 20th Sound and Music Computing Conference, SMC 2023
Large EW, Roman I, Kim JC, Cannon J, Trainor LJ ( 2023 ) . Dynamic models for musical rhythm perception and coordination . Frontiers in Computational Neuroscience vol. 17 ,
Liang BS, Liang AS, Roman I, Weiss T, Sun Q ( 2022 ) . Reconstructing room scales with a single sound for augmented reality displays . Journal of Information Display vol. 24 , ( 1 ) 1 - 12 .
Roman Guzman I ( 2022 ) . Analyzing the effect of equal-angle spatial discretization on sound event localization and detection . Conference: Detection and Classification of Acoustic Scenes and Events 2022
Roman Guzman I, Bello J ( 2021 ) . micarraylib: Software for Reproducible Aggregation, Standardization, and Signal Processing of Microphone Array Datasets . Conference: Detection and Classification of Acoustic Scenes and Events 2021
Walls K, Roman I, Van Ert K, Harper C, Adu-Gilmore L . Analyzing Pitch Content In Traditional Ghanaian Seperewa Songs . Conference: 1st Latin American Music Information Retrieval (LAMIR) workshop
Carone B, Roman IR, Ripolles P, Roman Guzman I . THE MUSE BENCHMARK: PROBING MUSIC PERCEPTION AND AUDITORY RELATIONAL REASONING IN AUDIO LLMS . Conference: ICASSP 2026-2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) from: 08/05/2026 to: 04/05/2026 ,