Skip to main content
Research

Publications: DR Georgios Tzimiropoulos

Bulat A, Ouali Y, Tzimiropoulos G ( 2025 ) . Compress & Cache: Vision token compression for efficient generation and retrieval . Conference: NeurIPS 2025. The Thirty-Ninth Annual Conference on Neural Information Processing Systems.
( 2025 ) . Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning . Conference: 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 12789 - 12798 .
( 2025 ) . FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion . Conference: 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 2459 - 2468 .
( 2025 ) . VladVA: Discriminative Fine-tuning of LVLMs . Conference: 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 4101 - 4111 .
( 2025 ) . Vision-Free Retrieval: Rethinking Multimodal Search with Textual Scene Descriptions . Conference: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing14057 - 14073 .
Ntinou I, Sanchez E ( 2024 ) . Memsvd: Long-Range Temporal Structure Capturing Using Incremental SVD . Conference: 2024 IEEE International Conference on Image Processing (ICIP) vol. 00 , 458 - 464 .
Tan F, Lee R, Dudziak Ł, Hu SX, Bhattacharya S, Hospedales T, Tzimiropoulos G, Martinez B ( 2024 ) . MobileQuant: Mobile-friendly Quantization for On-device Language Models .
Maniadis Metaxas I, Tzimiropoulos G, Patras I ( 2024 ) . Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing . Conference: European Conference on Computer Vision 2024 from: 29/09/2024 to: 04/10/2024 ,
Metaxas IM, Tzimiropoulos G, Patras I ( 2024 ) . Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing .
( 2024 ) . CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition . Conference: Advances in Neural Information Processing Systems 3735612 - 35638 .
Bulat A, Ouali Y, Guerrero R, Martinez B, Tzimiropoulos G ( 2024 ) . Efficient Vision-Language pre-training via domain-specific learning for human activities . Conference: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing7978 - 8000 .
( 2024 ) . QBB: Quantization with Binary Bases for LLMs . Conference: Advances in Neural Information Processing Systems 373209 - 3228 .
Bulat A, Tzimiropoulos G ( 2023 ) . Language-Aware Soft Prompting: Text-to-Text Optimization for Few- and Zero-Shot Adaptation of V &L Models . International Journal of Computer Vision vol. 132 , ( 4 ) 1108 - 1125 .
Derakhshani MM, Sanchez E, Bulat A, Da Costa VGT, Snoek CGM, Tzimiropoulos G, Martinez B ( 2023 ) . Bayesian Prompt Learning for Image-Language Model Generalization . Conference: 2023 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 15191 - 15200 .
Ouali Y, Bulat A, Matinez B, Tzimiropoulos G ( 2023 ) . Black Box Few-Shot Adaptation for Vision-Language models . Conference: 2023 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 15488 - 15500 .
Bulat A, Guerrero R, Martinez B, Tzimiropoulos G ( 2023 ) . FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training . Conference: 2023 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 11759 - 11768 .
( 2023 ) . HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces . Conference: 2023 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 7115 - 7125 .
Bulat A, Sanchez E, Martinez B, Tzimiropoulos G ( 2023 ) . ReGen: A good Generative zero-shot video classifier should be Rewarded . Conference: 2023 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 13477 - 13487 .
Bulat A, Tzimiropoulos G ( 2023 ) . LASP: Text-to-Text Optimization for Language-Aware Soft Prompting of Vision & Language Models . Conference: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 23232 - 23241 .
Mallis D, Sanchez E, Bell M, Tzimiropoulos G ( 2023 ) . From Keypoints to Object Landmarks via Self-Training Correspondence: A Novel Approach to Unsupervised Landmark Discovery . IEEE Transactions on Pattern Analysis and Machine Intelligence vol. 45 , ( 7 ) 8390 - 8404 .
Bounareli S, Tzimiropoulos G ( 2022 ) . Finding Directions in GAN’s Latent Space for Neural Face Reenactment . Conference: British Machine Vision Conference
Sun Z ( 2022 ) . Part-based Face Recognition with Vision Transformers . Conference: British Machine Vision Conference
Pan J, Bulat A, Tan F, Zhu X, Dudziak L, Li H, Tzimiropoulos G, Martinez B ( 2022 ) . EdgeViTs: Competing Light-Weight CNNs on Mobile Devices with Vision Transformers . Lecture Notes in Computer Science . vol. 13671 , 294 - 311 .
Bulat A, Cheng S, Yang J, Garbett A, Sanchez E, Tzimiropoulos G ( 2022 ) . Pre-training Strategies and Datasets for Facial Representation Learning . Lecture Notes in Computer Science . vol. 13673 , 107 - 125 .
Bulat A, Perez-Rua J-M, Tzimiropoulos G ( 2021 ) . Space-time Mixing Attention for Video Transformer . Conference: Thirty-fifth Conference on Neural Information Processing Systems
Bulat A, Tzimiropoulos G ( 2021 ) . Bit-Mixer: Mixed-precision networks with runtime bit-width selection . Conference: 2021 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 5168 - 5177 .
( 2021 ) . WarpedGANSpace: Finding non-linear RBF paths in GAN latent space . Conference: 2021 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 6373 - 6382 .
Sanchez E, Tellamekala MK, Valstar M, Tzimiropoulos G ( 2021 ) . Affective Processes: stochastic modelling of temporal context for emotion and facial expression recognition . Conference: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 9070 - 9080 .
Bulat A, Tzimiropoulos G ( 2021 ) . High-Capacity Expert Binary Networks . Conference: International Conference on Learning Representations (ICLR)
Yang J, Martinez B, Bulat A ( 2021 ) . Knowledge distillation via softmax regression representation learning . Conference: International Conference on Learning Representations (ICLR)
Song S, Sanchez E, Tzimiropoulos G, Shen L, Valstar M ( 2021 ) . Self-supervised Learning of Person-specific Facial Dynamics for Automatic Personality Recognition . IEEE Transactions on Affective Computing
Ntinou IN, Sanchez E, Bulat A, Tzimiropoulos G ( 2021 ) . A Transfer Learning approach to Heatmap Regression for Action Unit intensity estimation . IEEE Transactions on Affective Computing
Dimitrios M, Enrique S ( 2020 ) . Unsupervised Learning of Object Landmarks via Self-Training Correspondence . Conference: Advances in Neural Information Processing Systems (NeurIPS) from: 12/12/2020 to: 06/12/2020 ,
Bulat A, Martinez B, Tzimiropoulos G ( 2020 ) . BATS: Binary ArchitecTure Search . Lecture Notes in Computer Science . vol. 12368 , 309 - 325 .
Khan MH, McDonagh J, Khan S, Shahabuddin M, Arora A, Khan FS, Shao L, Tzimiropoulos G ( 2020 ) . AnimaWeb: A Large-Scale Hierarchical Dataset of Annotated Animal Faces . Conference: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 6937 - 6946 .
Yang J, Bulat A ( 2020 ) . FAN-Face: a Simple Orthogonal Improvement to Deep Face Recognition . Proceedings of the AAAI Conference on Artificial Intelligence vol. 34 , ( 07 ) 12621 - 12628 .
Kossaifi J, Bulat A, Tzimiropoulos G, Pantic M ( 2019 ) . T-Net: Parametrizing Fully Convolutional Nets with a Single High-Order Tensor . Conference: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 7814 - 7823 .
Bulat A, Tzimiropoulos G ( 2018 ) . Hierarchical Binary CNNs for Landmark Localization with Limited Resources . IEEE Transactions on Pattern Analysis and Machine Intelligence vol. 42 , ( 2 ) 343 - 356 .
Bulat A, Tzimiropoulos G ( 2017 ) . How Far are We from Solving the 2D & 3D Face Alignment Problem? (and a Dataset of 230,000 3D Facial Landmarks) . Conference: 2017 IEEE International Conference on Computer Vision (ICCV)1021 - 1030 .
Jackson AS, Bulat A, Argyriou V, Tzimiropoulos G ( 2017 ) . Large Pose 3D Face Reconstruction from a Single Image via Direct Volumetric CNN Regression . Conference: 2017 IEEE International Conference on Computer Vision (ICCV)1031 - 1039 .
Sanchez-Lozano E, Tzimiropoulos G, Martinez B, De la Torre F, Valstar M ( 2017 ) . A Functional Regression Approach to Facial Landmark Tracking . IEEE Transactions on Pattern Analysis and Machine Intelligence vol. 40 , ( 9 ) 2037 - 2050 .
Bulat A, Tzimiropoulos G ( 2016 ) . Human Pose Estimation via Convolutional Part Heatmap Regression . Lecture Notes in Computer Science . vol. 9911 , 717 - 732 .
Bulat A ( 2016 ) . Convolutional aggregation of local evidence for large pose face alignment . Conference: Procedings of the British Machine Vision Conference 201686.1 - 86.12 .
Tzimiropoulos G, Pantic M ( 2014 ) . Gauss-Newton Deformable Part Models for Face Alignment in-the-Wild . Conference: 2014 IEEE Conference on Computer Vision and Pattern Recognition1851 - 1858 .
Xenos A, Stafylakis T, Patras I, Tzimiropoulos G . A Simple Baseline for Knowledge-Based Visual Question Answering . Conference: Empirical Methods in Natural Language Processing from: 06/12/2023 to: 10/12/2023 ,