Publications: Georgios Tzimiropoulos

Bulat A, Ouali Y, Tzimiropoulos G ( 2025 ) . Compress & Cache: Vision token compression for efficient generation and retrieval . Conference: NeurIPS 2025. The Thirty-Ninth Annual Conference on Neural Information Processing Systems.

https://qmro.qmul.ac.uk/xmlui/handle/123456789/113711

Hadji I, Noroozi M, Escorcia V, Zaganidis A, Martinez B, Tzimiropoulos G ( 2025 ) . Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning . Conference: 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 12789 - 12798 .

10.1109/cvpr52734.2025.01193

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106406

Yang H, Bulat A, Hadji I, Pham HX, Zhu X, Tzimiropoulos G, Martinez B ( 2025 ) . FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion . Conference: 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 2459 - 2468 .

10.1109/cvpr52734.2025.00235

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106404

Ouali Y, Bulat A, Xenos A, Zaganidis A, Metaxas IM, Martinez B, Tzimiropoulos G ( 2025 ) . VladVA: Discriminative Fine-tuning of LVLMs . Conference: 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 4101 - 4111 .

10.1109/cvpr52734.2025.00388

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106405

Ntinou I, Xenos A, Ouali Y, Bulat A, Tzimiropoulos G ( 2025 ) . Vision-Free Retrieval: Rethinking Multimodal Search with Textual Scene Descriptions . Conference: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing14057 - 14073 .

10.18653/v1/2025.emnlp-main.709

https://qmro.qmul.ac.uk/xmlui/handle/123456789/113709

Ntinou I, Sanchez E ( 2024 ) . Memsvd: Long-Range Temporal Structure Capturing Using Incremental SVD . Conference: 2024 IEEE International Conference on Image Processing (ICIP) vol. 00 , 458 - 464 .

10.1109/icip51287.2024.10647706

Tan F, Lee R, Dudziak Ł, Hu SX, Bhattacharya S, Hospedales T, Tzimiropoulos G, Martinez B ( 2024 ) . MobileQuant: Mobile-friendly Quantization for On-device Language Models .

10.48550/arxiv.2408.13933

Maniadis Metaxas I, Tzimiropoulos G, Patras I ( 2024 ) . Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing . Conference: European Conference on Computer Vision 2024 from: 29/09/2024 to: 04/10/2024 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/99679

Metaxas IM, Tzimiropoulos G, Patras I ( 2024 ) . Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing .

10.48550/arxiv.2407.11168

Patras I, Song S, Sun Z, Tzimiropoulos G ( 2024 ) . CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition . Conference: Advances in Neural Information Processing Systems 3735612 - 35638 .

10.52202/079017-1123

https://qmro.qmul.ac.uk/xmlui/handle/123456789/100861

Bulat A, Ouali Y, Guerrero R, Martinez B ( 2024 ) . Efficient Vision-Language pre-training via domain-specific learning for human activities . Conference: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing7978 - 8000 .

10.18653/v1/2024.emnlp-main.454

https://qmro.qmul.ac.uk/xmlui/handle/123456789/100862

Bulat A, Ouali Y, Tzimiropoulos G ( 2024 ) . QBB: Quantization with Binary Bases for LLMs . Conference: Advances in Neural Information Processing Systems 373209 - 3228 .

10.52202/079017-0105

https://qmro.qmul.ac.uk/xmlui/handle/123456789/100860

Bulat A, Tzimiropoulos G ( 2023 ) . Language-Aware Soft Prompting: Text-to-Text Optimization for Few- and Zero-Shot Adaptation of V &L Models . International Journal of Computer Vision vol. 132 , ( 4 ) 1108 - 1125 .

10.1007/s11263-023-01904-9

https://qmro.qmul.ac.uk/xmlui/handle/123456789/92248

Derakhshani MM, Sanchez E, Bulat A, Da Costa VGT, Martinez B ( 2023 ) . Bayesian Prompt Learning for Image-Language Model Generalization . Conference: 2023 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 15191 - 15200 .

10.1109/iccv51070.2023.01398

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91869

Ouali Y, Bulat A, Matinez B ( 2023 ) . Black Box Few-Shot Adaptation for Vision-Language models . Conference: 2023 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 15488 - 15500 .

10.1109/iccv51070.2023.01424

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91844

Bulat A, Guerrero R, Tzimiropoulos G ( 2023 ) . FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training . Conference: 2023 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 11759 - 11768 .

10.1109/iccv51070.2023.01083

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91846

Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G ( 2023 ) . HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces . Conference: 2023 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 7115 - 7125 .

10.1109/iccv51070.2023.00657

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91848

Bulat A, Martinez B ( 2023 ) . ReGen: A good Generative zero-shot video classifier should be Rewarded . Conference: 2023 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 13477 - 13487 .

10.1109/iccv51070.2023.01244

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91847

Bulat A ( 2023 ) . LASP: Text-to-Text Optimization for Language-Aware Soft Prompting of Vision & Language Models . Conference: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 23232 - 23241 .

10.1109/cvpr52729.2023.02225

https://qmro.qmul.ac.uk/xmlui/handle/123456789/87745

Mallis D, Sanchez E, Bell M ( 2023 ) . From Keypoints to Object Landmarks via Self-Training Correspondence: A Novel Approach to Unsupervised Landmark Discovery . IEEE Transactions on Pattern Analysis and Machine Intelligence vol. 45 , ( 7 ) 8390 - 8404 .

10.1109/tpami.2023.3234212

https://qmro.qmul.ac.uk/xmlui/handle/123456789/85042

Bounareli S, Tzimiropoulos G ( 2022 ) . Finding Directions in GAN’s Latent Space for Neural Face Reenactment . Conference: British Machine Vision Conference

https://qmro.qmul.ac.uk/xmlui/handle/123456789/83794

Sun Z ( 2022 ) . Part-based Face Recognition with Vision Transformers . Conference: British Machine Vision Conference

https://qmro.qmul.ac.uk/xmlui/handle/123456789/83793

Pan J, Bulat A, Tan F, Zhu X, Li H ( 2022 ) . EdgeViTs: Competing Light-Weight CNNs on Mobile Devices with Vision Transformers . Lecture Notes in Computer Science . vol. 13671 , 294 - 311 .

10.1007/978-3-031-20083-0_18

https://qmro.qmul.ac.uk/xmlui/handle/123456789/82107

Bulat A, Cheng S, Yang J, Garbett A ( 2022 ) . Pre-training Strategies and Datasets for Facial Representation Learning . Lecture Notes in Computer Science . vol. 13673 , 107 - 125 .

10.1007/978-3-031-19778-9_7

https://qmro.qmul.ac.uk/xmlui/handle/123456789/82108

Bulat A, Perez-Rua J-M, Tzimiropoulos G ( 2021 ) . Space-time Mixing Attention for Video Transformer . Conference: Thirty-fifth Conference on Neural Information Processing Systems

https://qmro.qmul.ac.uk/xmlui/handle/123456789/75587

Bulat A ( 2021 ) . Bit-Mixer: Mixed-precision networks with runtime bit-width selection . Conference: 2021 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 5168 - 5177 .

10.1109/iccv48922.2021.00514

https://qmro.qmul.ac.uk/xmlui/handle/123456789/74426

Tzelepis C, Tzimiropoulos G, Patras I ( 2021 ) . WarpedGANSpace: Finding non-linear RBF paths in GAN latent space . Conference: 2021 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 6373 - 6382 .

10.1109/iccv48922.2021.00633

https://qmro.qmul.ac.uk/xmlui/handle/123456789/74209

Sanchez E, Valstar M, Tzimiropoulos G ( 2021 ) . Affective Processes: stochastic modelling of temporal context for emotion and facial expression recognition . Conference: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 9070 - 9080 .

10.1109/cvpr46437.2021.00896

https://qmro.qmul.ac.uk/xmlui/handle/123456789/72300

Bulat A, Tzimiropoulos G ( 2021 ) . High-Capacity Expert Binary Networks . Conference: International Conference on Learning Representations (ICLR)

https://qmro.qmul.ac.uk/xmlui/handle/123456789/70426

Yang J, Martinez B, Bulat A ( 2021 ) . Knowledge distillation via softmax regression representation learning . Conference: International Conference on Learning Representations (ICLR)

https://qmro.qmul.ac.uk/xmlui/handle/123456789/70425

Song S, Sanchez E, Tzimiropoulos G, Shen L, Valstar M ( 2021 ) . Self-supervised Learning of Person-specific Facial Dynamics for Automatic Personality Recognition . IEEE Transactions on Affective Computing

10.1109/TAFFC.2021.3064601

https://qmro.qmul.ac.uk/xmlui/handle/123456789/72297

Ntinou IN, Sanchez E, Bulat A, Tzimiropoulos G ( 2021 ) . A Transfer Learning approach to Heatmap Regression for Action Unit intensity estimation . IEEE Transactions on Affective Computing

10.1109/TAFFC.2021.3061605

https://qmro.qmul.ac.uk/xmlui/handle/123456789/72299

Dimitrios M, Enrique S ( 2020 ) . Unsupervised Learning of Object Landmarks via Self-Training Correspondence . Conference: Advances in Neural Information Processing Systems (NeurIPS) from: 12/12/2020 to: 06/12/2020 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/69167

Bulat A, Martinez B, Tzimiropoulos G ( 2020 ) . BATS: Binary ArchitecTure Search . Lecture Notes in Computer Science . vol. 12368 , 309 - 325 .

10.1007/978-3-030-58592-1_19

https://qmro.qmul.ac.uk/xmlui/handle/123456789/67666

Khan MH, Khan S, Shahabuddin M, Khan FS ( 2020 ) . AnimaWeb: A Large-Scale Hierarchical Dataset of Annotated Animal Faces . Conference: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 6937 - 6946 .

10.1109/cvpr42600.2020.00697

https://qmro.qmul.ac.uk/xmlui/handle/123456789/64003

Yang J, Bulat A ( 2020 ) . FAN-Face: a Simple Orthogonal Improvement to Deep Face Recognition . Proceedings of the AAAI Conference on Artificial Intelligence vol. 34 , ( 07 ) 12621 - 12628 .

10.1609/aaai.v34i07.6953

Kossaifi J, Bulat A, Tzimiropoulos G, Pantic M ( 2019 ) . T-Net: Parametrizing Fully Convolutional Nets with a Single High-Order Tensor . Conference: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 7814 - 7823 .

10.1109/cvpr.2019.00801

Bulat A, Tzimiropoulos G ( 2018 ) . Hierarchical Binary CNNs for Landmark Localization with Limited Resources . IEEE Transactions on Pattern Analysis and Machine Intelligence vol. 42 , ( 2 ) 343 - 356 .

10.1109/tpami.2018.2866051

https://qmro.qmul.ac.uk/xmlui/handle/123456789/64924

Bulat A, Tzimiropoulos G ( 2017 ) . How Far are We from Solving the 2D & 3D Face Alignment Problem? (and a Dataset of 230,000 3D Facial Landmarks) . Conference: 2017 IEEE International Conference on Computer Vision (ICCV)1021 - 1030 .

10.1109/iccv.2017.116

https://qmro.qmul.ac.uk/xmlui/handle/123456789/64928

Jackson AS, Bulat A, Tzimiropoulos G ( 2017 ) . Large Pose 3D Face Reconstruction from a Single Image via Direct Volumetric CNN Regression . Conference: 2017 IEEE International Conference on Computer Vision (ICCV)1031 - 1039 .

10.1109/iccv.2017.117

https://qmro.qmul.ac.uk/xmlui/handle/123456789/64927

Sanchez-Lozano E, Tzimiropoulos G, Martinez B, De la Torre F ( 2017 ) . A Functional Regression Approach to Facial Landmark Tracking . IEEE Transactions on Pattern Analysis and Machine Intelligence vol. 40 , ( 9 ) 2037 - 2050 .

10.1109/tpami.2017.2745568

https://qmro.qmul.ac.uk/xmlui/handle/123456789/64925

Bulat A ( 2016 ) . Human Pose Estimation via Convolutional Part Heatmap Regression . Lecture Notes in Computer Science . vol. 9911 , 717 - 732 .

10.1007/978-3-319-46478-7_44

https://qmro.qmul.ac.uk/xmlui/handle/123456789/64926

Bulat A ( 2016 ) . Convolutional aggregation of local evidence for large pose face alignment . Conference: Procedings of the British Machine Vision Conference 201686.1 - 86.12 .

10.5244/c.30.86

Tzimiropoulos G, Pantic M ( 2014 ) . Gauss-Newton Deformable Part Models for Face Alignment in-the-Wild . Conference: 2014 IEEE Conference on Computer Vision and Pattern Recognition1851 - 1858 .

10.1109/cvpr.2014.239

https://qmro.qmul.ac.uk/xmlui/handle/123456789/64929

Xenos A, Stafylakis T, Patras I, Tzimiropoulos G . A Simple Baseline for Knowledge-Based Visual Question Answering . Conference: Empirical Methods in Natural Language Processing from: 06/12/2023 to: 10/12/2023 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/92288

Feng C, Zhi Z, Huang Z, Ge J, Xiao L, Sebe N, Tzimiropoulos G, Patras I . Deconstructing the Failure of Ideal Noise Correction: A Three-Pillar Diagnosis . Conference: The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026

https://qmro.qmul.ac.uk/xmlui/handle/123456789/127695

Chen I-H, Hadji I, Sanchez E, Bulat A, Kuo S-Y, Timofte R, Tzimiropoulos G, Martinez B . Restore, Assess, Repeat: A Unified Framework for Iterative Image Restoration . Conference: The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026

https://qmro.qmul.ac.uk/xmlui/handle/123456789/127693

Bulat A, Maniadis I, Baldrati A, Ouali Y, Tzimiropoulos G . VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions . Conference: The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026

https://qmro.qmul.ac.uk/xmlui/handle/123456789/127694

Global main menu

Areas of study

Study at Queen Mary

Experience Queen Mary

Research and Innovation

Research by faculties and centres

Collaborations and partnerships

Publications: DR Georgios Tzimiropoulos