Skip to main content
Research

Publications: DR Georgios Tzimiropoulos

Bulat A, Ouali Y, Tzimiropoulos G ( 2025 ) . Compress & Cache: Vision token compression for efficient generation and retrieval . Conference: NeurIPS 2025. The Thirty-Ninth Annual Conference on Neural Information Processing Systems.
Ntinou I, Sanchez E ( 2024 ) . Memsvd: Long-Range Temporal Structure Capturing Using Incremental SVD . Conference: 2024 IEEE International Conference on Image Processing (ICIP) vol. 00 , 458 - 464 .
Maniadis Metaxas I, Tzimiropoulos G, Patras I ( 2024 ) . Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing . Conference: European Conference on Computer Vision 2024 from: 29/09/2024 to: 04/10/2024 ,
Tan F, Lee R, Dudziak Ł, Hu SX, Bhattacharya S, Hospedales T, Tzimiropoulos G, Martinez B ( 2024 ) . MobileQuant: Mobile-friendly Quantization for On-device Language Models .
Metaxas IM, Tzimiropoulos G, Patras I ( 2024 ) . Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing .
Bulat A ( 2023 ) . Language-Aware Soft Prompting: Text-to-Text Optimization for Fewand Zero-Shot Adaptation of V&L Models . International Journal of Computer Vision
Derakhshani MM, Sanchez E, Bulat A, Turrisi da Costa VG, Martinez B ( 2023 ) . Bayesian Prompt Learning for Image-Language Model Generalization . Conference: International Conference on Computer Vision
Ouali Y, Bulat A, Martinez B ( 2023 ) . Black Box Few-Shot Adaptation for Vision-Language models . Conference: International Conference on Computer Vision
Bulat A, Guerrero R, Martinez B ( 2023 ) . Fs-detr: Few-shot detection transformer with prompting and without re-training . Conference: International Conference on Computer Vision
Bounareli S, TZELEPIS C, Argyriou V, Patras I, Tzimiropoulos G ( 2023 ) . HyperReenact: one-shot reenactment via jointly learning to refine and retarget faces . Conference: International Conference on Computer Vision
Bulat A, Sanchez E, Martinez B ( 2023 ) . ReGen: A good Generative zero-shot video classifier should be Rewarded . Conference: International Conference on Computer Vision
Mallis D, Sanchez E, Bell M ( 2023 ) . From Keypoints to Object Landmarks via Self-Training Correspondence: A novel approach to Unsupervised Landmark Discovery . IEEE Transactions on Pattern Analysis and Machine Intelligence
Bounareli S, Tzimiropoulos G ( 2022 ) . Finding Directions in GAN’s Latent Space for Neural Face Reenactment . Conference: British Machine Vision Conference
Sun Z ( 2022 ) . Part-based Face Recognition with Vision Transformers . Conference: British Machine Vision Conference
Pan J, Bulat A, Tan F, Zhu X, Dudziak L, Li H, Tzimiropoulos G ( 2022 ) . EdgeViTs: Competing Light-weight CNNs onMobile Devices with Vision Transformers . Conference: European Conference on Computer Vision
Bulat A, Cheng S, Yang J, Sanchez E ( 2022 ) . Pre-training strategies and datasets for facial representation learning . Conference: European Confence on Computer Vision
Bulat A, Perez-Rua J-M, Tzimiropoulos G ( 2021 ) . Space-time Mixing Attention for Video Transformer . Conference: Thirty-fifth Conference on Neural Information Processing Systems
Bulat A, Tzimiropoulos G ( 2021 ) . Bit-Mixer: Mixed-precision networks with runtime bit-width selection . Conference: International Conference on Computer Vision (ICCV) from: 11/10/2021 to: 17/10/2021 ,
Sanchez E, Tellamekala MK, Tzimiropoulos G ( 2021 ) . Affective Processes: stochastic modelling of temporal context for emotion and facial expression recognition . Conference: IEEE/CVF Conference on Computer Vision and Pattern Recognition from: 19/06/2021 to: 25/06/2021 ,
Bulat A, Tzimiropoulos G ( 2021 ) . High-Capacity Expert Binary Networks . Conference: International Conference on Learning Representations (ICLR)
Yang J, Martinez B, Bulat A ( 2021 ) . Knowledge distillation via softmax regression representation learning . Conference: International Conference on Learning Representations (ICLR)
Song S, Sanchez E, Tzimiropoulos G, Shen L, Valstar M ( 2021 ) . Self-supervised Learning of Person-specific Facial Dynamics for Automatic Personality Recognition . IEEE Transactions on Affective Computing
Ntinou IN, Sanchez E, Bulat A, Tzimiropoulos G ( 2021 ) . A Transfer Learning approach to Heatmap Regression for Action Unit intensity estimation . IEEE Transactions on Affective Computing
Dimitrios M, Enrique S ( 2020 ) . Unsupervised Learning of Object Landmarks via Self-Training Correspondence . Conference: Advances in Neural Information Processing Systems (NeurIPS) from: 12/12/2020 to: 06/12/2020 ,
Bulat A, Martinez B ( 2020 ) . BATS: Binary ArchitecTure Search . Conference: European Conference on Computer Vision (ECCV) from: 23/08/2020 to: 28/08/2020 ,
Yang J, Bulat A ( 2020 ) . FAN-Face: a Simple Orthogonal Improvement to Deep Face Recognition . Proceedings of the AAAI Conference on Artificial Intelligence vol. 34 , ( 07 ) 12621 - 12628 .
Kossaifi J, Bulat A, Tzimiropoulos G, Pantic M ( 2019 ) . T-Net: Parametrizing Fully Convolutional Nets with a Single High-Order Tensor . Conference: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 7814 - 7823 .
Bulat A, Tzimiropoulos G ( 2018 ) . Hierarchical Binary CNNs for Landmark Localization with Limited Resources . IEEE Transactions on Pattern Analysis and Machine Intelligence vol. 42 , ( 2 ) 343 - 356 .
Jackson AS, Argyriou V, Tzimiropoulos G ( 2017 ) . Large Pose 3D Face Reconstruction from a Single Image via Direct Volumetric CNN Regression . Proceedings of the IEEE International Conference on Computer Vision . Conference: IEEE International Conference on Computer Vision vol. 2017-October , 1031 - 1039 .
Bulat A, Tzimiropoulos G ( 2017 ) . How Far are We from Solving the 2D & 3D Face Alignment Problem? (and a Dataset of 230,000 3D Facial Landmarks) . Conference: 2017 IEEE International Conference on Computer Vision (ICCV)1021 - 1030 .
Sanchez-Lozano E, Tzimiropoulos G, Martinez B, Torre FDL ( 2017 ) . A Functional Regression Approach to Facial Landmark Tracking . IEEE Transactions on Pattern Analysis and Machine Intelligence vol. 40 , ( 9 ) 2037 - 2050 .
Bulat A ( 2016 ) . Human pose estimation via convolutional part heatmap regression . Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) . Conference: https://link.springer.com/conference/eccv vol. 9911 LNCS , 717 - 732 .
Bulat A ( 2016 ) . Convolutional aggregation of local evidence for large pose face alignment . Conference: Procedings of the British Machine Vision Conference 201686.1 - 86.12 .
Tzimiropoulos G, Pantic M ( 2014 ) . Gauss-Newton deformable part models for face alignment in-the-wild . Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition . Conference: IEEE Computer Society Conference on Computer Vision and Pattern Recognition1851 - 1858 .
Xenos A, Stafylakis T, Patras I, Tzimiropoulos G . A Simple Baseline for Knowledge-Based Visual Question Answering . Conference: Empirical Methods in Natural Language Processing from: 06/12/2023 to: 10/12/2023 ,
Khan MH, McDonagh J, Khan S, Shahabuddin M, Arora A, Khan FS, Shao L, Tzimiropoulos G . AnimalWeb: A Large-Scale Hierarchical Dataset of Annotated Animal Faces . Conference: 2020 Conference on Computer Vision and Pattern Recognition
Sun Z, Song S, Patras I, Tzimiropoulos G . CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition . Conference: Neural Information Processing Systems (NeurIPS 2024).
Hadji I, Noroozi M, Escorzia V, Zaganidis A, Martinez B, Tzimiropoulos G . Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning . Conference: The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025
Bulat A, Ouali Y, Guerrero R, Martinez B, Tzimiropoulos G . Efficient Vision-Language pre-training via domain-specific learning for human activities . Conference: Empirical Methods in Natural Language Processing
Yang H, Bulat A, Hadji I, Pham HX, Zhu X, Tzimiropoulos G, Martinez B . FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion . Conference: The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025
Bulat A, Tzimiropoulos G . LASP: Text-to-Text Optimization for Language-Aware Soft Prompting of Vision & Language Models . Conference: IEEE/CVF Conference on Computer Vision and Pattern Recognition
Bulat A, Ouali Y, Tzimiropoulos G . QBB: Quantization with Binary Bases for LLMs . Conference: Neural Information Processing Systems
Ouali Y, Bulat A, Xenos A, Maniadis Metaxas I, Martinez B, Tzimiropoulos G . VLADVA: Discriminative Fine-tuning of LVLMs . Conference: The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025
Ntinou I, Xenos A, Ouali Y, Bulat A, Tzimiropoulos G . Vision-Free Retrieval: Rethinking Multimodal Search with Textual Scene . Conference: 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025)
Tzelepis C, Tzimiropoulos G, Patras I . WarpedGANSpace: Finding non-linear RBF paths in GAN latent space . Conference: International Conference on Computer Vision from: 11/10/2021 to: 17/10/2021 ,