Publications: DR Iran Roman Guzman
Carone B, Roman I, Ripolles P
(
2026
)
.
LLMs can read music, but struggle to hear it. An evaluation of core music perception tasks
.
Proceedings of Machine Learning Research
Wang L, Roman Guzman I, Xambo Sedo A
(
2025
)
.
Towards Real-Time, Stable Mapping from Multimodal Sensing to Interpretable Timbre Axes
.
Conference:
DMRN+20 Digital Music Research Network One-day Workshop 2025
(
King’s College London (Bush House). London, UK
)
from:
16/12/2025
to:
16/12/2025
,
Cheston H, Roman Guzman I, Stepien A, Azcarreta J, Roman AS, Chen C, Bilen Ç
.
“AudibleLight (RC): A Controllable, End-to-End API for Soundscape Synthesis Across Ray-Traced & Real-World Measured Acoustics”
.
https://www.qmul.ac.uk/dmrn/dmrn20/
.
Conference:
DMRN+20 Digital Music Research Network One-day Workshop 2025
(
King’s College London, Bush House, London UK
)
from:
16/12/2025
to:
16/12/2025
,
Carone B, Roman Guzman I, Ripollés P
(
2025
)
.
Evaluating Multimodal Large Language Models on Core Music Perception Tasks
.
Conference:
39th Conference on Neural Information Processing Systems (NeurIPS 2025) Workshop: AI for Music
Roman A, Roman I, Bello J
(
2025
)
.
Latent Acoustic Mapping for Direction of Arrival Estimation: A Self-Supervised Approach
.
Conference:
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
Urrego-Gómez I, Colton S, Roman I
(
2025
)
.
Vibe Sorcery: Integrating Emotion Recognition with Generative Music for Playlist Curation
.
Conference:
International Society for Music Information Retrieval: 1st Workshop on Large Language Models for Music & Audio (LLM4MA)
Bozilovic Z, Roman I
(
2025
)
.
Decoding Melodic Acoustic Features from Neural Data
.
Conference:
AES International Conference on Artificial Intelligence and Machine Learning for Audio
Chang A, Li Y, Roman IR, Poeppel D
.
Spectrotemporal Modulation: Efficient and Interpretable Feature Representation for Classifying Speech, Music, and Environmental Sounds
.
Conference:
Interspeech 2025216
-
220
.
Zimokha M, Jamone L, Roman I
(
2025
)
.
Combining Recurrent & Bayesian Models for Action Anticipation with Multiple Cues
.
Conference:
Cognitive Computational Neuroscience
Wang A, Roman I
(
2025
)
.
Toward Affective Empathy in AI: Encoding Internal Representations of “Artificial Pain”
.
Conference:
Cognitive Computational Neuroscience
McGowan E, Rulff J, Castelo S, Wu G, Chen S, Roman IR, Dias FF, Qian J
(
2025
)
.
Design and Implementation of the Transparent, Interpretable, and Multimodal (TIM) AR Personal Assistant
.
IEEE Computer Graphics and Applications
vol.
45
,
(
1
)
28
-
42
.
Pedroza H, Abreu W, Corey RM, Roman IR
(
2025
)
.
Guitar-TECHS: An Electric Guitar Dataset Covering Techniques, Musical Excerpts, Chords and Scales Using a Diverse Array of Hardware
.
Conference:
ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
vol.
00
,
1
-
5
.
Harding EE, Kim JC, Demos AP, Roman IR
(
2025
)
.
Musical neurodynamics
.
Nature Reviews Neuroscience
vol.
26
,
(
5
)
293
-
307
.
Peng X, Chen K, Roman I
(
2025
)
.
Perceptually-Guided Acoustic "Foveation"
.
Conference:
2025 IEEE Conference Virtual Reality and 3D User Interfaces (VR)
vol.
00
,
450
-
460
.
Castelo S, Rulff J, Solunke P, McGowan E, Wu G, Roman I, Lopez R, Sun Q et al.
(
2024
)
.
HuBar: A Visual Analytics Tool to Explore Human Behavior Based on fNIRS in AR Guidance Systems
.
IEEE Transactions on Visualization and Computer Graphics
vol.
31
,
(
1
)
119
-
129
.
Pedroza H, Abreu W, Corey R
(
2024
)
.
LEVERAGING REAL ELECTRIC GUITAR TONES AND EFFECTS TO IMPROVE ROBUSTNESS IN GUITAR TABLATURE TRANSCRIPTION MODELING
.
Conference:
27th International Conference on Digital Audio Effects (DAFx24)
Roman AS, Roman IR
(
2024
)
.
Robust DoA Estimation from Deep Acoustic Imaging
.
Conference:
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
vol.
00
,
1321
-
1325
.
Roman IR, Ick C, Roman AS, McFee B
(
2024
)
.
Spatial Scaper: A Library to Simulate and Augment Soundscapes for Sound Event Localization and Detection in Realistic Rooms
.
Conference:
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
vol.
00
,
1221
-
1225
.
Castelo S, Rulff J, McGowan E, Steers B, Wu G, Chen S, Roman I, Brewer E et al.
(
2023
)
.
: Visualization of AI-Assisted Task Guidance in AR
.
IEEE Transactions on Visualization and Computer Graphics
vol.
30
,
(
1
)
1313
-
1323
.
Kushwaha SS, Roman IR, Fuentes M
(
2023
)
.
Sound Source Distance Estimation in Diverse and Dynamic Acoustic Conditions
.
Conference:
2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
vol.
00
,
1
-
5
.
Faronbi D, Roman I
(
2023
)
.
Exploring Approaches to Multi-Task Automatic Synthesizer Programming
.
Conference:
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
vol.
00
,
1
-
5
.
Roman IR, Roman AS, Kim JC, Large EW
(
2023
)
.
Hebbian learning with elasticity explains how the spontaneous motor tempo affects music performance synchronization
.
PLOS Computational Biology
vol.
19
,
(
6
)
Roman Guzman I
(
2023
)
.
F0 analysis of Ghanaian pop singing reveals progressive alignment with equal temperament over the past three decades: a case study
.
Conference:
20th Sound and Music Computing Conference, SMC 2023
Large EW, Roman I, Kim JC, Cannon J, Trainor LJ
(
2023
)
.
Dynamic models for musical rhythm perception and coordination
.
Frontiers in Computational Neuroscience
vol.
17
,
Liang BS, Liang AS, Roman I, Weiss T, Sun Q
(
2022
)
.
Reconstructing room scales with a single sound for augmented reality displays
.
Journal of Information Display
vol.
24
,
(
1
)
1
-
12
.
Roman Guzman I
(
2022
)
.
Analyzing the effect of equal-angle spatial discretization on sound event localization and detection
.
Conference:
Detection and Classification of Acoustic Scenes and Events 2022
Roman Guzman I, Bello J
(
2021
)
.
micarraylib: Software for Reproducible Aggregation, Standardization, and Signal Processing of Microphone Array Datasets
.
Conference:
Detection and Classification of Acoustic Scenes and Events 2021
Walls K, Roman I, Van Ert K, Harper C, Adu-Gilmore L
.
Analyzing Pitch Content In Traditional Ghanaian Seperewa Songs
.
Conference:
1st Latin American Music Information Retrieval (LAMIR) workshop
Carone B, Roman IR, Ripolles P, Roman Guzman I
.
THE MUSE BENCHMARK: PROBING MUSIC PERCEPTION AND AUDITORY RELATIONAL REASONING IN AUDIO LLMS
.
Conference:
ICASSP 2026-2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
from:
08/05/2026
to:
04/05/2026
,