Publications: Ioannis Patras

Sahili ZA, Fetanat M, Fetanat M, Patras I, Purver M ( 2026 ) . FairJudge: Abstention-Aware Multimodal Judges for Fairness and Alignment Evaluation in Text-to-Image Models .

10.48550/arxiv.2510.22827

Chen Y, Wong WK, Li J, Patras I, Zheng X ( 2026 ) . Beyond Localization: A Comprehensive Diagnosis of Perspective-Conditioned Spatial Reasoning in MLLMs from Omnidirectional Images .

10.48550/arxiv.2605.12413

Ge J, Zhang X, Cao J, Liu B, Deuser F, Liu C, Wenkang G, Li S et al. ( 2026 ) . ViewSAM: Learning View-aware Cross-modal Semantics for Weakly Supervised Cross-view Referring Multi-Object Tracking .

10.48550/arxiv.2605.02638

Ge J, Cao J, Li X, Zhu X, Liu C, Liu B, Feng C, Patras I ( 2026 ) . Debate-Enhanced Pseudo Labeling and Frequency-Aware Progressive Debiasing for Weakly-Supervised Camouflaged Object Detection with Scribble Annotations .

10.48550/arxiv.2512.20260

Oldfield J, Torr P, Patras I, Bibi A, Barez F ( 2026 ) . Beyond Linear Probes: Dynamic Safety Monitoring for Language Models .

10.48550/arxiv.2509.26238

Zhang Z, Li C, Liu X, Shen C, Liu Z, Patras I ( 2026 ) . Confidence Should Be Calibrated More Than One Turn Deep .

10.48550/arxiv.2604.05397

Zhang Z, Liu Z, Patras I ( 2026 ) . GrACE: A Generative Approach to Better Confidence Elicitation and Efficient Test-Time Scaling in Large Language Models .

10.48550/arxiv.2509.09438

Batziou E, Ioannidis K, Patras I, Vrochidis S, Kompatsiaris I ( 2026 ) . HDD-Unet: A Unet-based architecture for low-light image enhancement . Image and Vision Computing vol. 167 ,

10.1016/j.imavis.2025.105889

Oldfield J, Im S, Li S, Nicolaou MA, Patras I, Chrysos GG ( 2026 ) . Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders .

10.48550/arxiv.2505.21364

https://qmro.qmul.ac.uk/xmlui/handle/123456789/124993

Papadopoulos S, Patsiouras E, Ioannidis K, Vrochidis S, Kompatsiaris I, Patras I ( 2026 ) . Unsupervised Object Localization driven by self-supervised foundation models: A comprehensive review . Image and Vision Computing vol. 165 ,

10.1016/j.imavis.2025.105807

Sahili ZA, Patras I, Purver M ( 2025 ) . Breaking Language Barriers or Reinforcing Bias? A Study of Gender and Racial Disparities in Multilingual Contrastive Vision Language Models .

10.48550/arxiv.2505.14160

https://qmro.qmul.ac.uk/xmlui/handle/123456789/115053

Ge J, Zhang X, Cao J, Zhu X, Liu W, Gao Q, Cao B, Wang K et al. ( 2025 ) . Gen4Track: A Tuning-free Data Augmentation Framework via Self-correcting Diffusion Model for Vision-Language Tracking . 3037 - 3046 .

10.1145/3746027.3754956

Feng C, Sebe N, Tzimiropoulos G, Rodrigues MRD, Patras I ( 2025 ) . Unveiling Open-set Noise: Theoretical Insights into Label Noise . Conference: Proceedings of the 33rd ACM International Conference on Multimedia3290 - 3299 .

10.1145/3746027.3755040

Gao Z, Song J, Zhang Z, Deng J, Patras I ( 2025 ) . Frequency-Guided Diffusion for Training-Free Text-Driven Image Translation . Conference: 2025 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 19195 - 19205 .

10.1109/iccv51701.2025.01784

https://qmro.qmul.ac.uk/xmlui/handle/123456789/109964

Ricci S, Biondi N, Pernici F, Patras I, Del Bimbo A ( 2025 ) . $\boldsymbolλ$-Orthogonality Regularization for Compatible Representation Learning .

10.48550/arxiv.2509.16664

Sahili ZA, Patras I, Purver M ( 2025 ) . FairCoT: Enhancing Fairness in Text-to-Image Generation via Chain of Thought Reasoning with Multimodal Large Language Models .

10.48550/arxiv.2406.09070

https://qmro.qmul.ac.uk/xmlui/handle/123456789/98455

Xenos A, Foteinopoulou NM, Ntinou I, Patras I, Tzimiropoulos G ( 2025 ) . VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning .

10.48550/arxiv.2404.07078

Xenos A, Foteinopoulou NM, Ntinou I, Patras I, Tzimiropoulos G ( 2025 ) . VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning . vol. 00 , 1 - 10 .

10.1109/ijcnn64981.2025.11227260

Goulas A, Mezaris V, Patras I ( 2025 ) . VidCtx: Context-aware Video Question Answering with Image Models . Conference: 2025 IEEE International Conference on Multimedia and Expo (ICME) vol. 00 , 1 - 6 .

10.1109/icme59968.2025.11210080

https://qmro.qmul.ac.uk/xmlui/handle/123456789/115031

Sahili ZA, Patras I, Purver M ( 2025 ) . Multimodal Machine Learning in Mental Health: A Survey of Data, Algorithms, and Challenges .

10.48550/arxiv.2407.16804

https://qmro.qmul.ac.uk/xmlui/handle/123456789/98463

Zhao Z, Liu Z, Cao Y, Gong S, Patras I ( 2025 ) . AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data . Conference: 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 28748 - 28758 .

10.1109/cvpr52734.2025.02677

Diko A, Wang T, Swaileh W, Sun S, Patras I ( 2025 ) . ReWind: Understanding Long Videos with Instructed Learnable Memory . vol. 00 , 13734 - 13743 .

10.1109/cvpr52734.2025.01282

https://qmro.qmul.ac.uk/xmlui/handle/123456789/109961

Cao Y, Zhao Z, Patras I, Gong S ( 2025 ) . Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts . Conference: 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 7707 - 7716 .

10.1109/cvpr52734.2025.00722

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106353

Galanopoulos D, Goulas A, Leventakis A, Patras I, Mezaris V ( 2025 ) . An LLM Framework for Long-Form Video Retrieval and Audio-Visual Question Answering Using Qwen2/2.5 . Conference: 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) vol. 00 , 3730 - 3739 .

10.1109/cvprw67362.2025.00358

https://qmro.qmul.ac.uk/xmlui/handle/123456789/115032

Ntrougkas MV, Mezaris V, Patras I ( 2025 ) . P-TAME: Explain Any Image Classifier with Trained Perturbations .

10.48550/arxiv.2501.17813

Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G ( 2025 ) . DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment . Conference: 2025 IEEE 19th International Conference on Automatic Face and Gesture Recognition (FG) vol. 00 , 1 - 11 .

10.1109/fg61629.2025.11099159

Cioni D, Tzelepis C, Seidenari L, Patras I ( 2025 ) . Are CLIP Features All You Need for Universal Synthetic Image Origin Attribution? . Lecture Notes in Computer Science vol. 15643 , 363 - 382 .

10.1007/978-3-031-92648-8_22

Meng D, Tzelepis C, Patras I, Tzimiropoulos G ( 2025 ) . MM2Latent: Text-to-Facial Image Generation and Editing in GANs with Multimodal Assistance . Lecture Notes in Computer Science vol. 15631 , 88 - 106 .

10.1007/978-3-031-91838-4_6

Kollias D, Psaroudakis A, Arsenos A, Theofilou P, Shao C, Hu G, Patras I ( 2025 ) . MMA-MRNNet: Harnessing Multiple Models of Affect and Dynamic Masked RNN for Precise Facial Expression Intensity Estimation . Computer Vision – ECCV 2024 Workshops , vol. 15637 , Springer Nature

Ntrougkas MV, Mezaris V, Patras I ( 2025 ) . P-TAME: Explain Any Image Classifier With Trained Perturbations . IEEE Open Journal of Signal Processing vol. 6 , 536 - 545 .

10.1109/ojsp.2025.3568756

https://qmro.qmul.ac.uk/xmlui/handle/123456789/114732

Foteinopoulou NM, Patras I ( 2025 ) . Machine learning approaches for fine-grained symptom estimation in schizophrenia: A comprehensive review . Artificial Intelligence in Medicine vol. 165 ,

10.1016/j.artmed.2025.103129

Goulas A, Mezaris V, Patras I ( 2025 ) . VidCtx: Context-aware Video Question Answering with Image Models .

10.48550/arxiv.2412.17415

Diko A, Wang T, Swaileh W, Sun S, Patras I ( 2025 ) . ReWind: Understanding Long Videos with Instructed Learnable Memory .

10.48550/arxiv.2411.15556

Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G ( 2025 ) . DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment .

10.48550/arxiv.2403.17217

https://qmro.qmul.ac.uk/xmlui/handle/123456789/97842

Cao Y, Zhao Z, Patras I, Gong S ( 2025 ) . Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts .

10.48550/arxiv.2503.16218

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106173

Zhao Z, Liu Z, Cao Y, Gong S, Patras I ( 2025 ) . AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data .

10.48550/arxiv.2503.05665

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106171

Zhao Z, Cao Y, Gong S, Patras I ( 2025 ) . Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge Transfer . Conference: 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) vol. 00 , 815 - 824 .

10.1109/wacv61041.2025.00089

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106511

Papadopoulos S, Ioannidis K, Vrochidis S, Kompatsiaris I, Patras I ( 2025 ) . Vision-Language Pretraining for Variable-Shot Image Classification . MultiMedia Modeling , vol. 15523 , Springer Nature

Al Sahili Z, Patras I, Purver M ( 2025 ) . Breaking Language Barriers or Reinforcing Bias? A Study of Gender and Racial Disparities in Multilingual Contrastive Vision Language Models . Conference: Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics331 - 352 .

10.18653/v1/2025.ijcnlp-long.20

https://qmro.qmul.ac.uk/xmlui/handle/123456789/126151

Al Sahili Z, Patras I, Purver M ( 2025 ) . Data Matters Most: Auditing Social Bias in Contrastive Vision–Language Models . Transactions on Machine Learning Research vol. 2025-October ,

10.48550/arxiv.2501.13223

https://qmro.qmul.ac.uk/xmlui/handle/123456789/114012

Al Sahili Z, Patras I, Purver M ( 2025 ) . FairCoT: Enhancing Fairness in Text-to-Image Generation via Chain of Thought Reasoning with Multimodal Large Language Models . Conference: Findings of the Association for Computational Linguistics: EMNLP 2025792 - 816 .

10.18653/v1/2025.findings-emnlp.42

https://qmro.qmul.ac.uk/xmlui/handle/123456789/114552

Zhang Z, Liu Z, Patras I ( 2025 ) . Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization . Proceedings International Conference on Computational Linguistics Coling . 10924 - 10939 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/105339

Ionescu B, Patras I, Müller H, Del Bimbo A ( 2024 ) . Introduction to the Special Issue on Realistic Synthetic Data: Generation, Learning, Evaluation . ACM Transactions on Multimedia Computing Communications and Applications vol. 21 , ( 1 ) 1 - 7 .

10.1145/3703593

Alwazzan O, Gallagher-Syed A, Millner TO, Brandner S, Patras I, Marino S, Slabaugh G ( 2024 ) . Multimodal Outer Arithmetic Block Dual Fusion of Whole Slide Images and Omics Data for Precision Oncology .

10.48550/arxiv.2411.17418

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106184

Zhao Z, Cao Y, Gong S, Patras I ( 2024 ) . Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge Transfer .

10.48550/arxiv.2405.19100

https://qmro.qmul.ac.uk/xmlui/handle/123456789/102459

Zhao Z, Patras I ( 2024 ) . Prompting Visual-Language Models for Dynamic Facial Expression Recognition .

10.48550/arxiv.2308.13382

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91038

Maniadis Metaxas I, Tzimiropoulos G, Patras I ( 2024 ) . Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing . Lecture Notes in Computer Science vol. 15090 , 436 - 454 .

10.1007/978-3-031-73411-3_25

Oldfield J, Tzelepis C, Panagakis Y, Nicolaou MA, Patras I ( 2024 ) . Bilinear Models of Parts and Appearances in Generative Adversarial Networks . IEEE Transactions on Pattern Analysis and Machine Intelligence vol. 46 , ( 12 ) 8568 - 8579 .

10.1109/tpami.2024.3415506

https://qmro.qmul.ac.uk/xmlui/handle/123456789/97860

Sun Z, Song S, Patras I, Tzimiropoulos G ( 2024 ) . CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition .

10.48550/arxiv.2409.18876

Feng C, Tzimiropoulos G, Patras I ( 2024 ) . CLIPCleaner: Cleaning Noisy Labels with CLIP . Conference: Proceedings of the 32nd ACM International Conference on Multimedia876 - 885 .

10.1145/3664647.3680664

https://qmro.qmul.ac.uk/xmlui/handle/123456789/98922

Oldfield J, Georgopoulos M, Chrysos G, Tzelepis C, Panagakis Y, Nicolaou MA, Deng J, Patras I ( 2024 ) . Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization . Vancouver, CA , 38th Conference on Neural Information Processing Systems (NeurIPS)

Publisher URL

https://qmro.qmul.ac.uk/xmlui/handle/123456789/100964

Maniadis Metaxas I, Tzimiropoulos G, Patras I ( 2024 ) . Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing . Conference: European Conference on Computer Vision 2024 from: 29/09/2024 to: 04/10/2024 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/99679

Kollias D, Shao C, Kaloidas O, Patras I ( 2024 ) . Behaviour4All: in-the-wild Facial Behaviour Analysis Toolkit .

10.48550/arxiv.2409.17717

Meng D, Tzelepis C, Patras I, Tzimiropoulos G ( 2024 ) . MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance .

10.48550/arxiv.2409.11010

Feng C, Tzimiropoulos G, Patras I ( 2024 ) . CLIPCleaner: Cleaning Noisy Labels with CLIP .

10.48550/arxiv.2408.10012

Kollias D, Psaroudakis A, Arsenos A, Theofilou P, Shao C, Hu G, Patras I ( 2024 ) . MMA-MRNNet: Harnessing Multiple Models of Affect and Dynamic Masked RNN for Precise Facial Expression Intensity Estimation .

10.48550/arxiv.2303.00180

Cioni D, Tzelepis C, Seidenari L, Patras I ( 2024 ) . Are CLIP features all you need for Universal Synthetic Image Origin Attribution? .

10.48550/arxiv.2408.09153

Zhang Z, Liu Z, Patras I ( 2024 ) . Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization .

10.48550/arxiv.2408.04983

https://qmro.qmul.ac.uk/xmlui/handle/123456789/104580

Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G ( 2024 ) . One-Shot Neural Face Reenactment via Finding Directions in GAN's Latent Space . Int. J. Comput. Vis. vol. 132 , Article 8 , 3324 - 3354 .

Metaxas IM, Tzimiropoulos G, Patras I ( 2024 ) . Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing .

10.48550/arxiv.2407.11168

Feng C, Tzimiropoulos G, Patras I ( 2024 ) . NoiseBox: Toward More Efficient and Effective Learning With Noisy Labels . IEEE Transactions on Circuits and Systems for Video Technology vol. 34 , ( 11 ) 11914 - 11928 .

10.1109/tcsvt.2024.3426994

https://qmro.qmul.ac.uk/xmlui/handle/123456789/97934

Metaxas IM, Bulat A, Patras I, Martinez B, Tzimiropoulos G ( 2024 ) . Aligned Unsupervised Pretraining of Object Detectors with Self-training .

10.48550/arxiv.2307.15697

Apostolidis E, Balaouras G, Patras I, Mezaris V ( 2024 ) . Explainable Video Summarization for Advancing Media Content Production . Encyclopedia of Information Science and Technology, Sixth Edition , IGI Global

Sun Z, Feng C, Patras I, Tzimiropoulos G ( 2024 ) . LAFS: Landmark-Based Facial Self-Supervised Learning for Face Recognition . Conference: 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 1639 - 1649 .

10.1109/cvpr52733.2024.00162

https://qmro.qmul.ac.uk/xmlui/handle/123456789/95407

Gao Z, Patras I ( 2024 ) . Self-Supervised Facial Representation Learning with Facial Region Awareness . Conference: 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 2081 - 2092 .

10.1109/cvpr52733.2024.00203

https://qmro.qmul.ac.uk/xmlui/handle/123456789/95406

Foteinopoulou NM, Patras I ( 2024 ) . EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition . vol. 00 , 1 - 10 .

10.1109/fg59268.2024.10581982

Alwazzan O, Patras I, Slabaugh G ( 2024 ) . FOAA: Flattened Outer Arithmetic Attention for Multimodal Tumor Classification . vol. 00 , 1 - 5 .

10.1109/isbi56570.2024.10635901

https://qmro.qmul.ac.uk/xmlui/handle/123456789/99506

Singh AK, Patras I ( 2024 ) . FashionSD-X: Multimodal Fashion Garment Synthesis using Latent Diffusion .

10.48550/arxiv.2404.18591

Zoumpourlis G, Patras I ( 2024 ) . Motor Imagery Decoding Using Ensemble Curriculum Learning and Collaborative Training . Conference: 12th International Winter Conference on Brain-Computer Interface (BCI) from: 26/02/2024 to: 28/02/2024 ,

10.1109/BCI60775.2024.10480476

https://qmro.qmul.ac.uk/xmlui/handle/123456789/94696

Foteinopoulou NM, Patras I ( 2024 ) . EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition .

10.48550/arxiv.2310.16640

https://qmro.qmul.ac.uk/xmlui/handle/123456789/95405

Sun Z, Feng C, Patras I, Tzimiropoulos G ( 2024 ) . LAFS: Landmark-based Facial Self-supervised Learning for Face Recognition .

10.48550/arxiv.2403.08161

Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G ( 2024 ) . One-Shot Neural Face Reenactment via Finding Directions in GAN’s Latent Space . International Journal of Computer Vision vol. 132 , ( 8 ) 3324 - 3354 .

10.1007/s11263-024-02018-6

Alwazzan O, Khan A, Patras I, Slabaugh G ( 2024 ) . MOAB: Multi-Modal Outer Arithmetic Block For Fusion Of Histopathological Images And Genetic Data For Brain Tumor Grading .

10.48550/arxiv.2403.06349

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106187

Alwazzan O, Patras I, Slabaugh G ( 2024 ) . FOAA: Flattened Outer Arithmetic Attention For Multimodal Tumor Classification .

10.48550/arxiv.2403.06339

https://qmro.qmul.ac.uk/xmlui/handle/123456789/106189

Gao Z, Patras I ( 2024 ) . Self-Supervised Facial Representation Learning with Facial Region Awareness .

10.48550/arxiv.2403.02138

Zoumpourlis G, Patras I ( 2024 ) . Motor Imagery Decoding Using Ensemble Curriculum Learning and Collaborative Training . vol. 00 , 1 - 8 .

10.1109/bci60775.2024.10480476

Zoumpourlis G, Patras I ( 2024 ) . Motor Imagery Decoding Using Ensemble Curriculum Learning and Collaborative Training .

10.48550/arxiv.2211.11460

Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G ( 2024 ) . One-shot Neural Face Reenactment via Finding Directions in GAN's Latent Space .

10.48550/arxiv.2402.03553

D’Incà M, Tzelepis C, Patras I, Sebe N ( 2024 ) . Improving Fairness using Vision-Language Driven Image Augmentation . vol. 00 , 4683 - 4692 .

10.1109/wacv57701.2024.00463

Gao Z, Feng C, Patras I ( 2024 ) . Self-Supervised Representation Learning with Cross-Context Learning between Global and Hypercolumn Features . Conference: 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) vol. 00 , 1762 - 1772 .

10.1109/wacv57701.2024.00179

Kollias D, Shao C, Kaloidas O, Patras I ( 2024 ) . Behaviour4All: in-the-wild Facial Behaviour Analysis Toolkit . CoRR vol. abs/2409.17717 ,

Feng C, Tzimiropoulos G, Patras I, Cai J, Kankanhalli MS, Prabhakaran B, Boll S, Subramanian R et al. ( 2024 ) . CLIPCleaner: Cleaning Noisy Labels with CLIP . ACM Multimedia . 876 - 885 .

Patras I, Song S, Sun Z, Tzimiropoulos G ( 2024 ) . CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition . Conference: Advances in Neural Information Processing Systems 3735612 - 35638 .

10.52202/079017-1123

https://qmro.qmul.ac.uk/xmlui/handle/123456789/100861

Alwazzan O, Patras I, Slabaugh GG ( 2024 ) . FOAA: Flattened Outer Arithmetic Attention for Multimodal Tumor Classification . ISBI . 1 - 5 .

Singh AK, Patras I ( 2024 ) . FashionSD-X: Multimodal Fashion Garment Synthesis using Latent Diffusion . CoRR vol. abs/2404.18591 ,

D'Incà M, Tzelepis C, Patras I, Sebe N ( 2024 ) . Improving Fairness using Vision-Language Driven Image Augmentation . WACV . 4683 - 4692 .

Sun Z, Feng C, Patras I, Tzimiropoulos G ( 2024 ) . LAFS: Landmark-Based Facial Self-Supervised Learning for Face Recognition . CVPR . 1639 - 1649 .

Alwazzan O, Khan A, Patras I, Slabaugh GG ( 2024 ) . MOAB: Multi-Modal Outer Arithmetic Block For Fusion Of Histopathological Images And Genetic Data For Brain Tumor Grading . CoRR vol. abs/2403.06349 ,

Chrysos G, Deng J, Georgopoulos M, Nicolaou M, Oldfield J, Panagakis Y, Patras I, Tzelepis C ( 2024 ) . Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization . 53022 - 53063 .

10.52202/079017-1680

Sahili ZA, Patras I, Purver M ( 2024 ) . Multimodal Machine Learning in Mental Health: A Survey of Data, Algorithms, and Challenges .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/99478

Alwazzan O, Gallagher-Syed A, Millner T, Patras I, Marino S, Slabaugh GG ( 2024 ) . Multimodal Outer Arithmetic Block Dual Fusion of Whole Slide Images and Omics Data for Precision Oncology . CoRR vol. abs/2411.17418 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/109962

Gao Z, Patras I ( 2024 ) . Self-Supervised Facial Representation Learning with Facial Region Awareness . CVPR . 2081 - 2092 .

Oldfield J, Tzelepis C, Panagakis Y, Nicolaou MA, Patras I ( 2023 ) . Parts of Speech-Grounded Subspaces in Vision-Language Models .

10.48550/arxiv.2305.14053

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91800

D'Incà M, Tzelepis C, Patras I, Sebe N ( 2023 ) . Improving Fairness using Vision-Language Driven Image Augmentation .

10.48550/arxiv.2311.01573

Apostolidis E, Mezaris V, Patras I ( 2023 ) . A Study on the Use of Attention for Explaining Video Summarization . Conference: Proceedings of the 2nd Workshop on User-centric Narrative Summarization of Long Videos41 - 49 .

10.1145/3607540.3617138

Kankanhalli MS, Patras I, Liu J, Wong Y, Komamizu T ( 2023 ) . NarSUM 2023 Chairs Welcome . Narsum 2023 Proceedings of the 2nd Workshop on User Centric Narrative Summarization of Long Videos Co Located with mm 2023

Kankanhalli MS, Patras I, Liu J, Wong Y, Komamizu T, Yamazaki S, Stephen K, Kansal K ( 2023 ) . NarSUM '23: The 2nd Workshop on User-Centric Narrative Summarization of Long Videos . Conference: Proceedings of the 31st ACM International Conference on Multimedia9731 - 9733 .

10.1145/3581783.3610946

Foteinopoulou NM, Patras I ( 2023 ) . Machine Learning Approaches for Fine-Grained Symptom Estimation in Schizophrenia: A Comprehensive Review .

10.48550/arxiv.2310.16677

Xenos A, Stafylakis T, Patras I, Tzimiropoulos G ( 2023 ) . A Simple Baseline for Knowledge-Based Visual Question Answering .

10.48550/arxiv.2310.13570

Apostolidis E, Balaouras G, Mezaris V, Patras I ( 2023 ) . Selecting A Diverse Set Of Aesthetically-Pleasing and Representative Video Thumbnails Using Reinforcement Learning . Conference: 2023 IEEE International Conference on Image Processing (ICIP) vol. 00 , 2460 - 2464 .

10.1109/icip49359.2023.10222743

Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G ( 2023 ) . HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces . Conference: 2023 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 7115 - 7125 .

10.1109/iccv51070.2023.00657

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91848

Gao Z, Feng C, Patras I ( 2023 ) . Self-Supervised Representation Learning with Cross-Context Learning between Global and Hypercolumn Features .

10.48550/arxiv.2308.13392

https://qmro.qmul.ac.uk/xmlui/handle/123456789/90358

Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G ( 2023 ) . HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces .

10.48550/arxiv.2307.10797

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91780

Barattin S, Tzelepis C, Patras I, Sebe N ( 2023 ) . Attribute-Preserving Face Dataset Anonymization via Latent Code Optimization . Conference: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 8001 - 8010 .

10.1109/cvpr52729.2023.00773

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91799

Metaxas IM, Tzimiropoulos G, Patras I ( 2023 ) . DivClust: Controlling Diversity in Deep Clustering . Conference: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 3418 - 3428 .

10.1109/cvpr52729.2023.00333

https://qmro.qmul.ac.uk/xmlui/handle/123456789/87819

Feng C, Patras I ( 2023 ) . MaskCon: Masked Contrastive Learning for Coarse-Labelled Dataset . Conference: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) vol. 00 , 19913 - 19922 .

10.1109/cvpr52729.2023.01907

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91782

Kordopatis-Zilos G, Tolias G, Tzelepis C, Kompatsiaris I, Patras I, Papadopoulos S ( 2023 ) . Self-Supervised Video Similarity Learning . Conference: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) vol. 00 , 4756 - 4766 .

10.1109/cvprw59228.2023.00504

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91779

Kordopatis-Zilos G, Tolias G, Tzelepis C, Kompatsiaris I, Patras I, Papadopoulos S ( 2023 ) . Self-Supervised Video Similarity Learning .

10.48550/arxiv.2304.03378

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91801

Patras I ( 2023 ) . Controllable image generation and manipulation . Conference: Proceedings of the 2nd ACM International Workshop on Multimedia AI against Disinformation1 - 1 .

10.1145/3592572.3596476

Alwazzan O, Khan A, Patras I, Slabaugh G ( 2023 ) . MOAB: Multi-Modal Outer Arithmetic Block for Fusion of Histopathological Images and Genetic Data for Brain Tumor Grading . Conference: 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI) vol. 00 , 1 - 5 .

10.1109/isbi53787.2023.10230698

https://qmro.qmul.ac.uk/xmlui/handle/123456789/92245

Metaxas IM, Tzimiropoulos G, Patras I ( 2023 ) . DivClust: Controlling Diversity in Deep Clustering .

10.48550/arxiv.2304.01042

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91781

Batziou E, Ioannidis K, Patras I, Vrochidis S, Kompatsiaris I ( 2023 ) . Low-Light Image Enhancement Based on U-Net and Haar Wavelet Pooling . Lecture Notes in Computer Science . vol. 13834 , 510 - 522 .

10.1007/978-3-031-27818-1_42

https://qmro.qmul.ac.uk/xmlui/handle/123456789/98310

Feng C, Patras I ( 2023 ) . MaskCon: Masked Contrastive Learning for Coarse-Labelled Dataset .

10.48550/arxiv.2303.12756

https://qmro.qmul.ac.uk/xmlui/handle/123456789/86878

Barattin S, Tzelepis C, Patras I, Sebe N ( 2023 ) . Attribute-preserving Face Dataset Anonymization via Latent Code Optimization .

10.48550/arxiv.2303.11296

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91770

Yang Q, Tzelepis C, Nikolenko S, Patras I, Farseev A ( 2023 ) . "Just To See You Smile": SMILEY, a Voice-Guided <strike>GUY</strike> GAN . Conference: Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining1196 - 1199 .

10.1145/3539597.3573031

Oldfield J, Tzelepis C, Panagakis Y, Nicolaou MA, Patras I ( 2023 ) . PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs .

10.48550/arxiv.2206.00048

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91798

Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G ( 2023 ) . StyleMask: Disentangling the Style Space of StyleGAN2 for Neural Face Reenactment . Conference: 2023 IEEE 17th International Conference on Automatic Face and Gesture Recognition (FG) vol. 00 , 1 - 8 .

10.1109/fg57933.2023.10042744

Xenos A, Stafylakis T, Patras I, Tzimiropoulos G ( 2023 ) . A Simple Baseline for Knowledge-Based Visual Question Answering . Conference: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing14871 - 14877 .

10.18653/v1/2023.emnlp-main.919

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91913

Batziou E, Ioannidis K, Patras I, Vrochidis S, Kompatsiaris I ( 2023 ) . Artistic neural style transfer using CycleGAN and FABEMD by adaptive information selection . Pattern Recognition Letters vol. 165 , 55 - 62 .

10.1016/j.patrec.2022.11.026

https://qmro.qmul.ac.uk/xmlui/handle/123456789/98312

Barattin S, Tzelepis C, Patras I, Sebe N ( 2023 ) . Attribute-Preserving Face Dataset Anonymization via Latent Code Optimization . CVPR . 8001 - 8010 .

Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G ( 2023 ) . HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces . ICCV . 7115 - 7125 .

Oldfield J, Tzelepis C, Panagakis Y, Nicolaou M, Patras I ( 2023 ) . PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs . ICLR .

Nicolaou M, Oldfield J, Panagakis Y, Patras I, Tzelepis C ( 2023 ) . Parts of Speech–Grounded Subspaces in Vision-Language Models . Conference: Advances in Neural Information Processing Systems 362700 - 2724 .

10.52202/075280-0121

Zhao Z, Patras I ( 2023 ) . Prompting Visual-Language Models for Dynamic Facial Expression Recognition . BMVC . 98 - 98 .

Kordopatis-Zilos G, Tolias G, Tzelepis C, Kompatsiaris I, Patras I, Papadopoulos S ( 2023 ) . Self-Supervised Video Similarity Learning . CVPR Workshops . 4756 - 4766 .

Metaxas IM, Bulat A, Patras I, Martínez B, Tzimiropoulos G ( 2023 ) . SimDETR: Simplifying self-supervised pretraining for DETR . CoRR vol. abs/2307.15697 ,

Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G ( 2023 ) . StyleMask: Disentangling the Style Space of StyleGAN2 for Neural Face Reenactment . FG . 1 - 8 .

Apostolidis E, Balaouras G, Mezaris V, Patras I ( 2022 ) . Explaining video summarization based on the focus of attention . Conference: 2022 IEEE International Symposium on Multimedia (ISM) vol. 00 , 146 - 150 .

10.1109/ism55400.2022.00029

Feng C, Patras I ( 2022 ) . Adaptive Soft Contrastive Learning . Conference: 2022 26th International Conference on Pattern Recognition (ICPR)

10.1109/ICPR56361.2022.9956660

Foteinopoulou NM, Patras I ( 2022 ) . Learning from Label Relationships in Human Affect . Conference: Proceedings of the 30th ACM International Conference on Multimedia80 - 89 .

10.1145/3503161.3548373

https://qmro.qmul.ac.uk/xmlui/handle/123456789/79854

Patras I ( 2022 ) . Video Summarization in the Deep Learning Era . Conference: Proceedings of the 1st Workshop on User-centric Narrative Summarization of Long Videos1 - 1 .

10.1145/3552463.3554166

Feng C, Tzimiropoulos G, Patras I ( 2022 ) . SSR: An Efficient and Robust Framework for Learning with Unknown Label Noise .

10.48550/arxiv.2111.11288

https://qmro.qmul.ac.uk/xmlui/handle/123456789/84080

Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G ( 2022 ) . StyleMask: Disentangling the Style Space of StyleGAN2 for Neural Face Reenactment .

10.48550/arxiv.2209.13375

Panwar H, Patras I ( 2022 ) . Capsule Network based Contrastive Learning of Unsupervised Visual Representations .

10.48550/arxiv.2209.11276

Feng C, Patras I ( 2022 ) . Adaptive Soft Contrastive Learning . Conference: 2022 26th International Conference on Pattern Recognition (ICPR) vol. 00 , 2721 - 2727 .

10.1109/icpr56361.2022.9956660

https://qmro.qmul.ac.uk/xmlui/handle/123456789/84079

Foteinopoulou NM, Patras I ( 2022 ) . Learning from Label Relationships in Human Affect .

10.48550/arxiv.2207.05577

Kordopatis-Zilos G, Tzelepis C, Papadopoulos S, Kompatsiaris I, Patras I ( 2022 ) . DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval . International Journal of Computer Vision vol. 130 , ( 10 ) 2385 - 2407 .

10.1007/s11263-022-01651-3

https://qmro.qmul.ac.uk/xmlui/handle/123456789/86239

Feng C, Patras I ( 2022 ) . Adaptive Soft Contrastive Learning .

10.48550/arxiv.2207.11163

Apostolidis E, Balaouras G, Mezaris V, Patras I ( 2022 ) . Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of the Video Frames . Conference: Proceedings of the 2022 International Conference on Multimedia Retrieval407 - 415 .

10.1145/3512527.3531404

Tzelepis C, Oldfield J, Tzimiropoulos G, Patras I ( 2022 ) . ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences .

10.48550/arxiv.2206.02104

Zoumpourlis G, Patras I ( 2022 ) . CovMix: Covariance Mixing Regularization for Motor Imagery Decoding . Conference: 2022 10th International Winter Conference on Brain-Computer Interface (BCI) vol. 00 , 1 - 7 .

10.1109/bci53720.2022.9734883

https://qmro.qmul.ac.uk/xmlui/handle/123456789/77747

Tzelepis C, Oldfield J, Tzimiropoulos G, Patras I ( 2022 ) . ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences . CoRR vol. abs/2206.02104 ,

Kordopatis-Zilos G, Tzelepis C, Papadopoulos S, Kompatsiaris I, Patras I ( 2022 ) . DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval . Int. J. Comput. Vis. vol. 130 , Article 10 , 2385 - 2407 .

Foteinopoulou NM, Patras I, Magalhães J, Bimbo AD, Satoh S, Sebe N, Alameda-Pineda X, Jin Q et al. ( 2022 ) . Learning from Label Relationships in Human Affect . ACM Multimedia . 80 - 89 .

Feng C, Tzimiropoulos G, Patras I ( 2022 ) . SSR: An Efficient and Robust Framework for Learning with Unknown Label Noise . Bmvc 2022 33rd British Machine Vision Conference Proceedings .

Patras I, Kankanhalli MS, Liu J, Wong Y ( 2022 ) . Video Summarization in the Deep Learning Era: Current Landscape and Future Directions . NarSUM@MM . 1 - 1 .

Apostolidis E, Balaouras G, Mezaris V, Patras I ( 2021 ) . Combining Global and Local Attention with Positional Encoding for Video Summarization . Conference: 2021 IEEE International Symposium on Multimedia (ISM) vol. 00 , 226 - 234 .

10.1109/ism52913.2021.00045

Oldfield J, Georgopoulos M, Panagakis Y, Nicolaou MA, Patras I ( 2021 ) . Tensor Component Analysis for Interpreting the Latent Space of GANs .

10.48550/arxiv.2111.11736

Apostolidis E, Adamantidou E, Metsai AI, Mezaris V, Patras I ( 2021 ) . Video Summarization Using Deep Neural Networks: A Survey . Proceedings of the IEEE vol. 109 , ( 11 ) 1838 - 1863 .

10.1109/jproc.2021.3117472

https://qmro.qmul.ac.uk/xmlui/handle/123456789/74438

Tzelepis C, Tzimiropoulos G, Patras I ( 2021 ) . WarpedGANSpace: Finding non-linear RBF paths in GAN latent space . Conference: 2021 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 6373 - 6382 .

10.1109/iccv48922.2021.00633

https://qmro.qmul.ac.uk/xmlui/handle/123456789/74209

Foteinopoulou NM, Tzelepis C, Patras I ( 2021 ) . Estimating continuous affect with label uncertainty . Conference: 2021 9th International Conference on Affective Computing and Intelligent Interaction (ACII) vol. 00 , 1 - 8 .

10.1109/acii52823.2021.9597425

https://qmro.qmul.ac.uk/xmlui/handle/123456789/75041

Zoumpourlis G, Patras I ( 2021 ) . Pairwise Ranking Network for Affect Recognition . Conference: 2021 9th International Conference on Affective Computing and Intelligent Interaction (ACII) vol. 00 , 1 - 8 .

10.1109/acii52823.2021.9597392

https://qmro.qmul.ac.uk/xmlui/handle/123456789/73976

Apostolidis E, Adamantidou E, Metsai AI, Mezaris V, Patras I ( 2021 ) . Video Summarization Using Deep Neural Networks: A Survey .

10.48550/arxiv.2101.06072

Tzelepis C, Tzimiropoulos G, Patras I ( 2021 ) . WarpedGANSpace: Finding non-linear RBF paths in GAN latent space .

10.48550/arxiv.2109.13357

Xie T-T, Tzelepis C, Fu F, Patras I ( 2021 ) . Few-Shot Action Localization without Knowing Boundaries .

10.48550/arxiv.2106.04150

Apostolidis E, Adamantidou E, Mezaris V, Patras I ( 2021 ) . Combining Adversarial and Reinforcement Learning for Video Thumbnail Selection . Conference: Proceedings of the 2021 International Conference on Multimedia Retrieval1 - 9 .

10.1145/3460426.3463630

Xie T-T, Tzelepis C, Fu F, Patras I ( 2021 ) . Few-Shot Action Localization without Knowing Boundaries . Conference: Proceedings of the 2021 International Conference on Multimedia Retrieval339 - 348 .

10.1145/3460426.3463643

https://qmro.qmul.ac.uk/xmlui/handle/123456789/74428

( 2021 ) . DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval .

10.48550/arxiv.2106.13266

https://qmro.qmul.ac.uk/xmlui/handle/123456789/86239

Fu F, Xie T, Patras I, Jalali S ( 2021 ) . Relationship-based Neural Baby Talk .

10.48550/arxiv.2103.04846

Tzelepis C, Patras I ( 2021 ) . Uncertainty Propagation in Convolutional Neural Networks: Technical Report .

10.48550/arxiv.2102.06064

Batziou E, Alvanitopoulos P, Ioannidis K, Patras I, Vrochidis S, Kompatsiaris I ( 2021 ) . Cycle-Consistent Adversarial Networks and Fast Adaptive Bi-dimensional Empirical Mode Decomposition for Style Transfer . Conference: 2020 25th International Conference on Pattern Recognition (ICPR) vol. 00 , 2360 - 2367 .

10.1109/icpr48806.2021.9412904

https://qmro.qmul.ac.uk/xmlui/handle/123456789/69132

Chen L, Liang Y, Shi X, Zhou Y, Wu C, Lokoc J, Skopal T, Schoeffmann K et al. ( 2021 ) . Crossed-Time Delay Neural Network for Speaker Recognition . MMM (1) . vol. 12572 , 1 - 10 .

Lu X, Zhang J, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al. ( 2021 ) . DANet: Deformable Alignment Network for Video Inpainting . MMM (1) . vol. 12572 , 430 - 442 .

Feng D, Zhang Y, Zhu C, Zhang H, Song L, Lokoc J, Skopal T, Schoeffmann K et al. ( 2021 ) . DVRCNN: Dark Video Post-processing Method for VVC . MMM (1) . vol. 12572 , 691 - 703 .

Yang K, Lu J, Hu S, Chen X, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Deep 3D Modeling of Human Bodies from Freehand Sketching . MMM (2) . vol. 12573 , 36 - 48 .

Xue L, Yao W, Xia Y, Li X, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Deep Attributed Network Embedding with Community Information . MMM (1) . vol. 12572 , 653 - 665 .

Wen Z, Feng A, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al. ( 2021 ) . Deep Centralized Cross-modal Retrieval . MMM (1) . vol. 12572 , 443 - 455 .

Yang S, Xue H, Ling J, Song L, Xie R, Lokoc J, Skopal T, Schoeffmann K et al. ( 2021 ) . Deep Face Swapping via Cross-Identity Adversarial Training . MMM (2) . vol. 12573 , 74 - 86 .

Constantin MG, Stefan L-D, Ionescu B, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . DeepFusion: Deep Ensembles for Domain Independent System Fusion . MMM (1) . vol. 12572 , 240 - 252 .

Zhang Z, Ma J, Xu P, Wang W, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Dense Attention-Guided Network for Boundary-Aware Salient Object Detection . MMM (1) . vol. 12572 , 148 - 161 .

Wang F, Ding Y, Liang H, Wen J, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Discriminative and Selective Pseudo-Labeling for Domain Adaptation . MMM (1) . vol. 12572 , 365 - 377 .

Zhang X, Du T, Zhang Z, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . EEG Emotion Recognition Based on Channel Attention for E-Healthcare Applications . MMM (2) . vol. 12573 , 159 - 169 .

Khan OS, Jónsson BÞ, Larsen MD, Poulsen LAS, Koelma DC, Rudinac S, Worring M, Zahálka J et al. ( 2021 ) . Exquisitor at the Video Browser Showdown 2021: Relationships Between Semantic Classifiers . MMM (2) . vol. 12573 , 410 - 416 .

Zhao H, She X, Wang S, Ma K, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Fast Discrete Matrix Factorization Hashing for Large-Scale Cross-Modal Retrieval . MMM (1) . vol. 12572 , 24 - 36 .

Wu S, Wang Z, Cai Y, Wang R, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Fast Mode Decision Algorithm for Intra Encoding of the 3rd Generation Audio Video Coding Standard . MMM (1) . vol. 12572 , 481 - 492 .

Qiu T, Ni B, Liu Z, Chen X, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Fast Optimal Transport Artistic Style Transfer . MMM (1) . vol. 12572 , 37 - 49 .

Xie T-T, Tzelepis C, Fu F, Patras I, Cheng W-H, Kankanhalli MS, Wang M, Chu W-T et al. ( 2021 ) . Few-Shot Action Localization without Knowing Boundaries . ICMR . 339 - 348 .

Wang H, Lian J, Xiong S, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Few-Shot Learning with Unlabeled Outlier Exposure . MMM (1) . vol. 12572 , 340 - 351 .

Sun W, Xu J, Yang G, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Fine-Grained Generation for Zero-Shot Learning . MMM (1) . vol. 12572 , 580 - 591 .

Zheng M, Jia Y, Jiang H, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Fine-Grained Image-Text Retrieval via Complementary Feature Learning . MMM (1) . vol. 12572 , 592 - 604 .

Zhang L, Zhang H, Zhu C, Guo S, Chen J, Wang L, Lokoc J, Skopal T et al. ( 2021 ) . Fine-Grained Video Deblurring with Event Camera . MMM (1) . vol. 12572 , 352 - 364 .

Li F, Wang W, Liu Z, Wang H, Yan C, Wu B, Lokoc J, Skopal T et al. ( 2021 ) . Frame Aggregation and Multi-modal Fusion Framework for Video-Based Person Recognition . MMM (1) . vol. 12572 , 75 - 86 .

Giannakeris P, Tsanousa A, Mavropoulos T, Meditskos G, Ioannidis K, Vrochidis S, Kompatsiaris I, Lokoc J et al. ( 2021 ) . Fusion of Multimodal Sensor Data for Effective Human Action Recognition in the Service of Medical Platforms . MMM (2) . vol. 12573 , 367 - 378 .

Liu S, Claypool M, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al. ( 2021 ) . Game Input with Delay - A Model of the Time Distribution for Selecting a Moving Target with a Mouse . MMM (1) . vol. 12572 , 506 - 518 .

Shan X, Wen Y, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al. ( 2021 ) . Gaussian Mixture Model Based Semi-supervised Sparse Representation for Face Recognition . MMM (1) . vol. 12572 , 716 - 727 .

Xiao Z, Li D, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al. ( 2021 ) . Generative Image Inpainting by Hybrid Contextual Attention Network . MMM (1) . vol. 12572 , 162 - 173 .

Zhang C, Zhang W, Chen F, Cheng Y, Gao S, Zhang W, Lokoc J, Skopal T et al. ( 2021 ) . Global Cognition and Local Perception Network for Blind Image Deblurring . MMM (1) . vol. 12572 , 303 - 314 .

Wang X, Li X, Wu S, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Graph Structure Reasoning Network for Face Alignment and Reconstruction . MMM (1) . vol. 12572 , 493 - 505 .

Nguyen M-D, Binh NT, Gurrin C, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Graph-Based Indexing and Retrieval of Lifelog Data . MMM (2) . vol. 12573 , 256 - 267 .

Pei D, Li A, Wang Y, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Group Activity Recognition by Exploiting Position Distribution and Appearance Relation . MMM (1) . vol. 12572 , 123 - 135 .

Garcia-Ceja E, Thambawita V, Hicks SA, Jha D, Jakobsen P, Hammer HL, Halvorsen P, Riegler MA et al. ( 2021 ) . HTAD: A Home-Tasks Activities Dataset with Wrist-Accelerometer and Audio Features . MMM (2) . vol. 12573 , 196 - 205 .

Lee Y, Choi H, Park S, Ro YM, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . IVIST: Interactive Video Search Tool in VBS 2021 . MMM (2) . vol. 12573 , 423 - 428 .

Ressmann A, Schoeffmann K, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al. ( 2021 ) . IVOS - The ITEC Interactive Video Object Search System at VBS2021 . MMM (2) . vol. 12573 , 479 - 483 .

Qiu Y, Chen J, Wang X, Jang K, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Illuminate Low-Light Image via Coarse-to-fine Multi-level Network . MMM (1) . vol. 12572 , 253 - 264 .

Jiang S, Wang C, Huang C, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Image Registration Improved by Generative Adversarial Networks . MMM (2) . vol. 12573 , 26 - 35 .

Apostolakis A, Girtsou S, Kontoes C, Papoutsis I, Tsoutsos M, Lokoc J, Skopal T, Schoeffmann K et al. ( 2021 ) . Implementation of a Random Forest Classifier to Examine Wildfire Predictive Modelling in Greece Using Diachronically Collected Fire Occurrence and Fire Mapping Data . MMM (2) . vol. 12573 , 318 - 329 .

Feng C, Li D, Zheng J, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Improving Supervised Cross-modal Retrieval with Semantic Graph Embedding . MMM (1) . vol. 12572 , 187 - 199 .

Zhu Z, Sun L, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al. ( 2021 ) . Initialize with Mask: For More Efficient Federated Learning . MMM (2) . vol. 12573 , 111 - 120 .

Smeaton AF, Krishnamurthy NG, Suryanarayana AH, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Keystroke Dynamics as Part of Lifelogging . MMM (2) . vol. 12573 , 183 - 195 .

Jha D, Ali S, Emanuelsen K, Hicks SA, Thambawita V, Garcia-Ceja E, Riegler MA, Lange TD et al. ( 2021 ) . Kvasir-Instrument: Diagnostic and Therapeutic Tool Segmentation Dataset in Gastrointestinal Endoscopy . MMM (2) . vol. 12573 , 218 - 229 .

Zhang P, Ouyang D, Jiang C, Shao J, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Language Person Search with Pair-Based Weighting Loss . MMM (1) . vol. 12572 , 227 - 239 .

Liu Z-Y, Liu J-W, Zuo X, Li W, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Learning 3D-Craft Generation with Predictive Action Neural Network . MMM (1) . vol. 12572 , 541 - 553 .

Lu L, Lu Y, Wang S, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Learning Multi-level Interaction Relations and Feature Representations for Group Activity Recognition . MMM (1) . vol. 12572 , 617 - 628 .

Zheng W, Yan L, Wang F-Y, Gou C, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Learning from the Negativity: Deep Negative Correlation Meta-Learning for Adversarial Image Classification . MMM (1) . vol. 12572 , 531 - 540 .

Leibetseder A, Schoeffmann K, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al. ( 2021 ) . Less is More - diveXplore 5.0 at VBS 2021 . MMM (2) . vol. 12573 , 455 - 460 .

Chen X, Liu R, Song X, Han Y, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Locating Visual Explanations for Video Question Answering . MMM (1) . vol. 12572 , 290 - 302 .

Gu Q, Luo Z, Zhao W, Zhu Y, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . MM-Net: Learning Adaptive Meta-metric for Few-Shot Biometric Recognition . MMM (1) . vol. 12572 , 265 - 277 .

Nguyen D-H, Tan LTN, Nguyen M-T, Nguyen T-B, Dao M-S, Lokoc J, Skopal T, Schoeffmann K et al. ( 2021 ) . MNR-Air: An Economic and Dynamic Crowdsourcing Mechanism to Collect Personal Lifelog and Surrounding Environment Dataset. A Case Study in Ho Chi Minh City, Vietnam . MMM (2) . vol. 12573 , 206 - 217 .

Zhang Y, Zhao H, Zhou F, Zhang Q, Shi Y, Liang L, Lokoc J, Skopal T et al. ( 2021 ) . MSCANet: Adaptive Multi-scale Context Aggregation Network for Congested Crowd Counting . MMM (2) . vol. 12573 , 1 - 12 .

Song W, Dai S, Huang D, Song J, Liotta A, Lokoc J, Skopal T, Schoeffmann K et al. ( 2021 ) . Median-Pooling Grad-CAM: An Efficient Inference Level Visual Explanation for CNN Networks in Remote Sensing Image Classification . MMM (2) . vol. 12573 , 134 - 146 .

Codina-Filbà J, Escalera S, Escudero J, Antens C, Buch-Cardona P, Farrús M, Lokoc J, Skopal T et al. ( 2021 ) . Mobile eHealth Platform for Home Monitoring of Bipolar Disorder . MMM (2) . vol. 12573 , 330 - 341 .

Zhang F, Li M, Zhai G, Liu Y, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Multi-branch and Multi-scale Attention Learning for Fine-Grained Visual Categorization . MMM (1) . vol. 12572 , 136 - 147 .

Liu Y, Lu Y, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al. ( 2021 ) . Multi-grained Fusion for Conditional Image Retrieval . MMM (1) . vol. 12572 , 315 - 327 .

Zhang X, Zhang Y, Zhang Z, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Multi-granularity Recurrent Attention Graph Neural Network for Few-Shot Learning . MMM (2) . vol. 12573 , 147 - 158 .

Long J, Lu H, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al. ( 2021 ) . Multi-level Gate Feature Aggregation with Spatially Adaptive Batch-Instance Normalization for Semantic Image Synthesis . MMM (1) . vol. 12572 , 378 - 390 .

Gao R, Huang Z, Liu S, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Multi-task Deep Learning for No-Reference Screen Content Image Quality Assessment . MMM (1) . vol. 12572 , 213 - 226 .

( 2021 ) . MultiMedia Modeling - 27th International Conference, MMM 2021, Prague, Czech Republic, June 22-24, 2021, Proceedings, Part I . MMM (1) . vol. 12572 ,

( 2021 ) . MultiMedia Modeling - 27th International Conference, MMM 2021, Prague, Czech Republic, June 22-24, 2021, Proceedings, Part II . MMM (2) . vol. 12573 ,

Yebda T, Benois-Pineau J, Pech M, Amieva H, Middleton L, Bergelt M, Lokoc J, Skopal T et al. ( 2021 ) . Multimodal Sensor Data Analysis for Detection of Risk Situations of Fragile People in @home Environments . MMM (2) . vol. 12573 , 342 - 353 .

Zhao Y, Guo J, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al. ( 2021 ) . MusiCoder: A Universal Music-Acoustic Encoder Based on Transformer . MMM (1) . vol. 12572 , 417 - 429 .

Karisch C, Leibetseder A, Schoeffmann K, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . NoShot Video Browser at VBS2021 . MMM (2) . vol. 12573 , 405 - 409 .

Dobranský M, Skopal T, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al. ( 2021 ) . On Fusion of Learned and Designed Features for Video Data Analytics . MMM (2) . vol. 12573 , 268 - 280 .

Lokoč J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S, Patras I ( 2021 ) . Preface .

Fu F, Xie T, Patras I, Jalali S ( 2021 ) . Relationship-based Neural Baby Talk . CoRR vol. abs/2103.04846 ,

Zhao S, Li X, Chen Z, Liu C, Peng C, Lokoc J, Skopal T, Schoeffmann K et al. ( 2021 ) . Res2-Unet: An Enhanced Network for Generalized Nuclear Segmentation in Pathological Images . MMM (2) . vol. 12573 , 87 - 98 .

Park S, Kim JU, Kim Y, Moon S-K, Ro YM, Lokoc J, Skopal T, Schoeffmann K et al. ( 2021 ) . Robust Multispectral Pedestrian Detection via Uncertainty-Aware Cross-Modal Learning . MMM (1) . vol. 12572 , 391 - 402 .

Feng C, Tzimiropoulos G, Patras I ( 2021 ) . S3: Supervised Self-supervised Learning under Label Noise . CoRR vol. abs/2111.11288 ,

Veselý P, Mejzlík F, Lokoc J, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . SOMHunter V2 at Video Browser Showdown 2021 . MMM (2) . vol. 12573 , 461 - 466 .

Wu J, Nguyen PA, Ma Z, Ngo C-W, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . SQL-Like Interpretable Interactive Video Search . MMM (2) . vol. 12573 , 391 - 397 .

Bishay M, Palasek P, Priebe S, Patras I ( 2021 ) . SchiNet: Automatic Estimation of Symptoms of Schizophrenia from Facial Behaviour Analysis . IEEE Trans. Affect. Comput. vol. 12 , Article 4 , 949 - 961 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/103133

Gisolf F, Geradts ZJMH, Worring M, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Search and Explore Strategies for Interactive Analysis of Real-Life Image Collections with Unknown and Unique Categories . MMM (2) . vol. 12573 , 244 - 255 .

Wang T, Feng N, Yu J, He Y, Hu Y, Chen Y-PP, Lokoc J, Skopal T et al. ( 2021 ) . Shot Boundary Detection Through Multi-stage Deep Convolution Neural Network . MMM (1) . vol. 12572 , 456 - 468 .

Wang J, Li Y, Lu H, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Spatial Gradient Guided Learning and Semantic Relation Transfer for Facial Landmark Detection . MMM (1) . vol. 12572 , 678 - 690 .

Gajdusek P, Peska L, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al. ( 2021 ) . SpotifyGraph: Visualisation of User's Preferences in Music . MMM (2) . vol. 12573 , 379 - 384 .

Wu Y, Hu R, Wang X, Hu C, Li G, Lokoc J, Skopal T, Schoeffmann K et al. ( 2021 ) . Stacked Sparse Autoencoder for Audio Object Coding . MMM (1) . vol. 12572 , 50 - 61 .

Umemura K, Kastner MA, Ide I, Kawanishi Y, Hirayama T, Doman K, Deguchi D, Murase H et al. ( 2021 ) . Tell as You Imagine: Sentence Imageability-Aware Image Captioning . MMM (2) . vol. 12573 , 62 - 73 .

Oldfield J, Georgopoulos M, Panagakis Y, Nicolaou MA, Patras I . Tensor Component Analysis for Interpreting the Latent Space of GANs . Conference: Proceedings of the British Machine Vision Conference 2021

10.5244/c.35.430

Oldfield J, Georgopoulos M, Panagakis Y, Nicolaou MA, Patras I ( 2021 ) . Tensor Component Analysis for Interpreting the Latent Space of GANs . BMVC . 222 - 222 .

Nefkens M, Hürst W, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al. ( 2021 ) . The MovieWall: A New Interface for Browsing Large Video Collections . MMM (2) . vol. 12573 , 170 - 182 .

Chu W-T, Huang P-S, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al. ( 2021 ) . Thermal Face Recognition Based on Multi-scale Image Synthesis . MMM (1) . vol. 12572 , 99 - 110 .

Wei J, Yang X, Dong Y, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Time-Dependent Body Gesture Representation for Video Emotion Recognition . MMM (1) . vol. 12572 , 403 - 416 .

Heller S, Gasser R, Illi C, Pasquinelli M, Sauter L, Spiess F, Schuldt H, Lokoc J et al. ( 2021 ) . Towards Explainable Interactive Multi-modal Video Retrieval with Vitrivr . MMM (2) . vol. 12573 , 435 - 440 .

Amirpour H, Çetinkaya E, Timmerer C, Ghanbari M, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Towards Optimal Multirate Encoding for HTTP Adaptive Streaming . MMM (1) . vol. 12572 , 469 - 480 .

Kraus M, Seldschopf P, Minker W, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Towards the Development of a Trustworthy Chatbot for Mental Health Applications . MMM (2) . vol. 12573 , 354 - 366 .

Huang C, Chan S, Bai C, Ding W, Zhang J, Lokoc J, Skopal T, Schoeffmann K et al. ( 2021 ) . Tropical Cyclones Tracking Based on Satellite Cloud Images: Database and Comprehensive Study . MMM (2) . vol. 12573 , 13 - 25 .

Wang F, Luo L, Zhu E, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al. ( 2021 ) . Two-Stage Real-Time Multi-object Tracking with Candidate Selection . MMM (2) . vol. 12573 , 49 - 61 .

Tzelepis C, Patras I ( 2021 ) . Uncertainty Propagation in Convolutional Neural Networks: Technical Report . CoRR vol. abs/2102.06064 ,

Lu Y, Wang Y, Xin Y, Wu D, Lu G, Lokoc J, Skopal T, Schoeffmann K et al. ( 2021 ) . Unsupervised Gaze: Exploration of Geometric Constraints for 3D Gaze Estimation . MMM (2) . vol. 12573 , 121 - 133 .

Li X, Wang W, Li Q, Guo L, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Unsupervised Multi-shot Person Re-identification via Dynamic Bi-directional Normalized Sparse Representation . MMM (1) . vol. 12572 , 554 - 566 .

Hu M, Hu R, Wang X, Sheng R, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Unsupervised Temporal Attention Summarization Model for User Created Videos . MMM (1) . vol. 12572 , 519 - 530 .

Andreadis S, Moumtzidou A, Gkountakos K, Pantelidis N, Apostolidis K, Galanopoulos D, Gialampoukidis I, Vrochidis S et al. ( 2021 ) . VERGE in VBS 2021 . MMM (2) . vol. 12573 , 398 - 404 .

Lokoc J, Bátoryová J, Smrz D, Dobranský M, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Video Search with Collage Queries . MMM (2) . vol. 12573 , 429 - 434 .

Hezel N, Schall K, Jung K, Barthel KU, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al. ( 2021 ) . Video Search with Sub-Image Keyword Transfer Using Existing Image Archives . MMM (2) . vol. 12573 , 484 - 489 .

Apostolidis E, Adamantidou E, Metsai AI, Mezaris V, Patras I ( 2021 ) . Video Summarization Using Deep Neural Networks: A Survey . Proc. IEEE vol. 109 , Article 11 , 1838 - 1863 .

Rossetto L, Baumgartner M, Ashena N, Ruosch F, Pernisch R, Heitz L, Bernstein A, Lokoc J et al. ( 2021 ) . VideoGraph - Towards Using Knowledge Graphs for Interactive Video Retrieval . MMM (2) . vol. 12573 , 417 - 422 .

Tzelepis C, Tzimiropoulos G, Patras I ( 2021 ) . WarpedGANSpace: Finding non-linear RBF paths in GAN latent space . CoRR vol. abs/2109.13357 ,

Apostolidis E, Adamantidou E, Metsai AI, Mezaris V, Patras I ( 2020 ) . AC-SUM-GAN: Connecting Actor-Critic and Generative Adversarial Networks for Unsupervised Video Summarization . IEEE Transactions on Circuits and Systems for Video Technology vol. 31 , ( 8 ) 3278 - 3292 .

10.1109/tcsvt.2020.3037883

https://qmro.qmul.ac.uk/xmlui/handle/123456789/68506

Apostolidis E, Adamantidou E, Metsai AI, Mezaris V, Patras I ( 2020 ) . Performance over Random . Conference: Proceedings of the 28th ACM International Conference on Multimedia1056 - 1064 .

10.1145/3394171.3413632

https://qmro.qmul.ac.uk/xmlui/handle/123456789/69084

Apostolidis E, Adamantidou E, Metsai AI, Mezaris V, Patras I ( 2020 ) . Performance over Random: A Robust Evaluation Protocol for Video Summarization Methods . Mm 2020 Proceedings of the 28th ACM International Conference on Multimedia . 1056 - 1064 .

10.1145/3394171.3413632

Xie T-T, Tzelepis C, Patras I ( 2020 ) . Boundary Uncertainty in a Single-Stage Temporal Action Localization Network .

10.48550/arxiv.2008.11170

Xie T-T, Tzelepis C, Patras I ( 2020 ) . Temporal Action Localization with Variance-Aware Networks .

10.48550/arxiv.2008.11254

Xie T, Tzelepis C, Patras I ( 2020 ) . Boundary Uncertainty in a Single-Stage Temporal Action Localization Network . CoRR vol. abs/2008.11170 ,

Gkalelis N, Markatopoulou F, Moumtzidou A, Galanopoulos D, Avgerinakis K, Pittaras N, Vrochidis S, Mezaris V et al. ( 2020 ) . ITI-CERTH participation to TRECVID 2014 . 2014 TREC Video Retrieval Evaluation, TRECVID 2014 .

Markatopoulou F, Ioannidou A, Tzelepis C, Mironidis T, Galanopoulos D, Arestis-Chartampilas S, Pittaras N, Avgerinakis K et al. ( 2020 ) . ITI-CERTH participation to TRECVID 2015 . 2015 TREC Video Retrieval Evaluation, TRECVID 2015 .

Xie T-T, Tzelepis C, Patras I ( 2020 ) . Temporal Action Localization with Variance-Aware Networks . CoRR vol. abs/2008.11254 ,

Apostolidis E, Adamantidou E, Metsai AI, Mezaris V, Patras I ( 2020 ) . Unsupervised Video Summarization via Attention-Driven Adversarial Learning . Lecture Notes in Computer Science . vol. 11961 , 492 - 504 .

10.1007/978-3-030-37731-1_40

https://qmro.qmul.ac.uk/xmlui/handle/123456789/62307

Kordopatis-Zilos G, Papadopoulos S, Patras I, Kompatsiaris I ( 2019 ) . ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning . Conference: 2019 IEEE/CVF International Conference on Computer Vision (ICCV) vol. 00 , 6350 - 6359 .

10.1109/iccv.2019.00645

https://qmro.qmul.ac.uk/xmlui/handle/123456789/66273

Tao Y, Ling Z, Patras I ( 2019 ) . Universal Foreground Segmentation Based on Deep Feature Fusion Network for Multi-Scene Videos . IEEE Access vol. 7 , 158326 - 158337 .

10.1109/access.2019.2950639

https://qmro.qmul.ac.uk/xmlui/handle/123456789/68662

Kordopatis-Zilos G, Papadopoulos S, Patras I, Kompatsiaris I ( 2019 ) . ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning . Conference: International Conference on Computer Vision ( Seoul. Korea ) from: 27/09/2019 to: 02/11/2019 , 6351 - 6360 .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/61984

Apostolidis E, Metsai AI, Adamantidou E, Mezaris V, Patras I ( 2019 ) . A Stepwise, Label-based Approach for Improving the Adversarial Training in Unsupervised Video Summarization . Conference: Proceedings of the 1st International Workshop on AI for Smart TV Content Production, Access and Delivery17 - 25 .

10.1145/3347449.3357482

https://qmro.qmul.ac.uk/xmlui/handle/123456789/62042

Mercier G, Markatopoulou F, Cozien R, Zampoglou M, Apostolidis E, Metsai AI, Papadopoulos S, Mezaris V et al. ( 2019 ) . Detecting Manipulations in Video . Video Verification in the Fake News Era , Springer Nature

https://qmro.qmul.ac.uk/xmlui/handle/123456789/69121

Kordopatis-Zilos G, Papadopoulos S, Patras I, Kompatsiaris I ( 2019 ) . Finding Near-Duplicate Videos in Large-Scale Collections . Video Verification in the Fake News Era , Springer Nature

https://qmro.qmul.ac.uk/xmlui/handle/123456789/61983

Markatopoulou F, Zampoglou M, Apostolidis E, Papadopoulos S, Mezaris V, Patras I, Kompatsiaris I ( 2019 ) . Finding Semantically Related Videos in Closed Collections . Video Verification in the Fake News Era , Springer Nature

https://qmro.qmul.ac.uk/xmlui/handle/123456789/69120

Apostolidis E, Apostolidis K, Patras I, Mezaris V ( 2019 ) . Video Fragmentation and Reverse Search on the Web . Video Verification in the Fake News Era , Springer Nature

https://qmro.qmul.ac.uk/xmlui/handle/123456789/69119

Bishay M, Zoumpourlis G, Patras I ( 2019 ) . TARN: Temporal Attentive Relation Network for Few-Shot and Zero-Shot Action Recognition . Conference: 30th British Machine Vision Conference ( Cardiff, UK ) from: 09/09/2019 to: 12/09/2019 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/59752

Kordopatis-Zilos G, Papadopoulos S, Patras I, Kompatsiaris I ( 2019 ) . ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning .

10.48550/arxiv.1908.07410

Bishay M, Zoumpourlis G, Patras I ( 2019 ) . TARN: Temporal Attentive Relation Network for Few-Shot and Zero-Shot Action Recognition .

10.48550/arxiv.1907.09021

Mou W, Gunes H, Patras I ( 2019 ) . Alone versus In-a-group . ACM Transactions on Multimedia Computing Communications and Applications vol. 15 , ( 2 ) 1 - 23 .

10.1145/3321509

https://qmro.qmul.ac.uk/xmlui/handle/123456789/56695

Marras I, Palasek P, Patras I ( 2019 ) . Deep Mixture of MRFs for Human Pose Estimation . Lecture Notes in Computer Science . vol. 11363 , 717 - 733 .

10.1007/978-3-030-20893-6_45

Xie T, Yang X, Zhang T, Xu C, Patras I ( 2019 ) . Exploring Feature Representation and Training strategies in Temporal Action Localization .

10.48550/arxiv.1905.10608

Mou W, Gunes H, Patras I ( 2019 ) . Your Fellows Matter: Affect Analysis across Subjects in Group Videos . Conference: 2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019) vol. 00 , 1 - 5 .

10.1109/fg.2019.8756514

https://qmro.qmul.ac.uk/xmlui/handle/123456789/57782

Bishay M, Priebe S, Patras I ( 2019 ) . Can Automatic Facial Expression Analysis Be Used for Treatment Outcome Estimation in Schizophrenia? . Conference: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) vol. 00 , 1632 - 1636 .

10.1109/icassp.2019.8682652

https://qmro.qmul.ac.uk/xmlui/handle/123456789/57744

Jang Y, Gunes H, Patras I ( 2019 ) . Registration-free Face-SSD: Single shot analysis of smiles, facial attributes, and affect in the wild . Computer Vision and Image Understanding vol. 182 , 17 - 29 .

10.1016/j.cviu.2019.01.006

https://qmro.qmul.ac.uk/xmlui/handle/123456789/68671

Bishay M, Palasek P, Priebe S, Patras I ( 2019 ) . SchiNet: Automatic Estimation of Symptoms of Schizophrenia from Facial Behaviour Analysis . IEEE Transactions on Affective Computing vol. 12 , ( 4 ) 949 - 961 .

10.1109/taffc.2019.2907628

https://qmro.qmul.ac.uk/xmlui/handle/123456789/57767

Kordopatis-Zilos G, Papadopoulos S, Patras I, Kompatsiaris I ( 2019 ) . FIVR: Fine-grained Incident Video Retrieval .

10.48550/arxiv.1809.04094

Kordopatis-Zilos G, Papadopoulos S, Patras I, Kompatsiaris I ( 2019 ) . FIVR: Fine-Grained Incident Video Retrieval . IEEE Transactions on Multimedia vol. 21 , ( 10 ) 2638 - 2652 .

10.1109/tmm.2019.2905741

https://qmro.qmul.ac.uk/xmlui/handle/123456789/56381

Jang Y, Gunes H, Patras I ( 2019 ) . Registration-free Face-SSD: Single shot analysis of smiles, facial attributes, and affect in the wild .

10.48550/arxiv.1902.04042

Xie T, Yang X, Zhang T, Xu C, Patras I ( 2019 ) . Exploring Feature Representation and Training Strategies in Temporal Action Localization . Conference: 2019 IEEE International Conference on Image Processing (ICIP) vol. 00 , 1605 - 1609 .

10.1109/icip.2019.8803745

https://qmro.qmul.ac.uk/xmlui/handle/123456789/74429

Mou W, Tzelepis C, Mezaris V, Gunes H, Patras I ( 2019 ) . A deep generic to specific recognition model for group membership analysis using non-verbal cues . Image and Vision Computing vol. 81 , 42 - 50 .

10.1016/j.imavis.2018.09.005

https://qmro.qmul.ac.uk/xmlui/handle/123456789/55287

Mou W, Gunes H, Patras I ( 2019 ) . Alone versus In-a-group: A Multi-modal Framework for Automatic Affect Recognition . ACM Trans. Multim. Comput. Commun. Appl. vol. 15 , Article 2 , 47:1 - 47:1 .

10.1145/3321509

Markatopoulou F, Galanopoulos D, Tzelepis C, Mezaris V, Patras I ( 2019 ) . Concept-Based and Event-Based Video Search in Large Video Collections . Big Data Analytics for Large Scale Multimedia Search ,

Xie T, Yang X, Zhang T, Xu C, Patras I ( 2019 ) . Exploring Feature Representation and Training strategies in Temporal Action Localization . CoRR vol. abs/1905.10608 ,

Kordopatis-Zilos G, Papadopoulos S, Patras I, Kompatsiaris I ( 2019 ) . FIVR: Fine-Grained Incident Video Retrieval . IEEE Trans. Multim. vol. 21 , Article 10 , 2638 - 2652 .

10.1109/TMM.2019.2905741

https://qmro.qmul.ac.uk/xmlui/handle/123456789/68673

Ahmadi A, Marras I, Patras I ( 2019 ) . LikeNet: A Siamese motion estimation network trained in an unsupervised way . British Machine Vision Conference 2018, BMVC 2018 .

Jang Y, Gunes H, Patras I ( 2019 ) . Registration-free Face-SSD: Single shot analysis of smiles, facial attributes, and affect in the wild . CoRR vol. abs/1902.04042 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/68670

Bishay M, Zoumpourlis G, Patras I ( 2019 ) . TARN: Temporal Attentive Relation Network for Few-Shot and Zero-Shot Action Recognition . BMVC . 154 - 154 .

Mou W, Gunes H, Patras I ( 2019 ) . Your Fellows Matter: Affect Analysis across Subjects in Group Videos . FG . 1 - 5 .

Andreadis S, Moumtzidou A, Galanopoulos D, Markatopoulou F, Apostolidis K, Mavropoulos T, Gialampoukidis I, Vrochidis S et al. ( 2019 ) . VERGE in VBS 2019 . Lecture Notes in Computer Science . vol. 11296 , 602 - 608 .

10.1007/978-3-030-05716-9_53

Zampoglou M, Markatopoulou F, Mercier G, Touska D, Apostolidis E, Papadopoulos S, Cozien R, Patras I et al. ( 2019 ) . Detecting Tampered Videos with Multimedia Forensics and Deep Learning . Lecture Notes in Computer Science . vol. 11295 , 374 - 386 .

10.1007/978-3-030-05710-7_31

https://qmro.qmul.ac.uk/xmlui/handle/123456789/55029

Nixon L, Apostolidis E, Markatopoulou F, Patras I, Mezaris V ( 2019 ) . Multimodal Video Annotation for Retrieval and Discovery of Newsworthy Video in a News Verification Scenario . Lecture Notes in Computer Science . vol. 11295 , 143 - 155 .

10.1007/978-3-030-05710-7_12

https://qmro.qmul.ac.uk/xmlui/handle/123456789/55027

Miranda-Correa JA, Abadi MK, Sebe N, Patras I ( 2018 ) . AMIGOS: A Dataset for Affect, Personality and Mood Research on Individuals and Groups . IEEE Transactions on Affective Computing vol. 12 , ( 2 ) 479 - 493 .

10.1109/taffc.2018.2884461

https://qmro.qmul.ac.uk/xmlui/handle/123456789/54054

Correa JAM, Abadi MK, Sebe N, Patras I ( 2018 ) . AMIGOS: A Dataset for Affect, Personality and Mood Research on Individuals and Groups . IEEE Transactions on Affective Computing vol. 12 , Article 2 , 479 - 493 .

10.1109/TAFFC.2018.2884461

https://qmro.qmul.ac.uk/xmlui/handle/123456789/86242

Bishay M, Palasek P, Priebe S, Patras I ( 2018 ) . SchiNet: Automatic Estimation of Symptoms of Schizophrenia from Facial Behaviour Analysis .

10.48550/arxiv.1808.02531

Markatopoulou F, Mezaris V, Patras I ( 2018 ) . Implicit and Explicit Concept Relations in Deep Neural Networks for Multi-Label Video/Image Annotation . IEEE Transactions on Circuits and Systems for Video Technology

10.1109/TCSVT.2018.2848458

https://qmro.qmul.ac.uk/xmlui/handle/123456789/42010

Vasilyev A, Hansard M, Mareschal I, Patras I ( 2018 ) . A Model of Visual Search in the Presence of Age-Related Macular Degeneration . PERCEPTION . Conference: Proceedings of the AVA Christmas meeting vol. 47 , 573 - 573 .

Miranda-Correa JA, Patras I ( 2018 ) . A Multi-Task Cascaded Network for Prediction of Affect, Personality, Mood and Social Context Using EEG Signals . Conference: 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018)373 - 380 .

10.1109/fg.2018.00060

Apostolidis K, Markatopoulou F, Tzelepis C, Mezaris V, Patras I ( 2018 ) . Multimedia Processing Essentials . Personal Multimedia Preservation , Springer Nature

Palasek P, Patras I ( 2018 ) . Semi-supervised Fisher vector network .

10.48550/arxiv.1801.04438

Moumtzidou A, Andreadis S, Markatopoulou F, Galanopoulos D, Gialampoukidis I, Vrochidis S, Mezaris V, Kompatsiaris I et al. ( 2018 ) . VERGE in VBS 2018 . Lecture Notes in Computer Science . vol. 10705 , 444 - 450 .

10.1007/978-3-319-73600-6_48

Ahmadi A, Marras I, Patras I ( 2018 ) . LikeNet: A Siamese motion estimation network trained in an unsupervised way . British Machine Vision Conference 2018 Bmvc 2018 .

Tzelepis C, Mezaris V, Patras I ( 2018 ) . Linear Maximum Margin Classifier for Learning from Uncertain Data . IEEE Trans. Pattern Anal. Mach. Intell. vol. 40 , Article 12 , 2948 - 2962 .

10.1109/TPAMI.2017.2772235

Palasek P, Patras I ( 2018 ) . Semi-supervised Fisher vector network . CoRR vol. abs/1801.04438 ,

Batziou E, Michail E, Avgerinakis K, Vrochidis S, Patras I, Kompatsiaris I ( 2018 ) . Visual and audio analysis of movies video for emotion detection @ Emotional Impact of Movies task MediaEval 2018 . Ceur Workshop Proceedings . vol. 2283 ,

Tzelepis C, Mezaris V, Patras I ( 2017 ) . Linear Maximum Margin Classifier for Learning from Uncertain Data .

10.48550/arxiv.1504.03892

Tzelepis C, Mezaris V, Patras I ( 2017 ) . Linear Maximum Margin Classifier for Learning from Uncertain Data . IEEE Transactions on Pattern Analysis and Machine Intelligence vol. 40 , ( 12 ) 2948 - 2962 .

10.1109/tpami.2017.2772235

https://qmro.qmul.ac.uk/xmlui/handle/123456789/29012

Marras I, Palasek P, Patras I ( 2017 ) . Deep Globally Constrained MRFs for Human Pose Estimation . Conference: 2017 IEEE International Conference on Computer Vision (ICCV)3486 - 3495 .

10.1109/iccv.2017.375

https://qmro.qmul.ac.uk/xmlui/handle/123456789/34403

Kordopatis-Zilos G, Papadopoulos S, Patras I, Kompatsiaris Y ( 2017 ) . Near-Duplicate Video Retrieval with Deep Metric Learning . Conference: 2017 IEEE International Conference on Computer Vision Workshops (ICCVW)347 - 356 .

10.1109/iccvw.2017.49

https://qmro.qmul.ac.uk/xmlui/handle/123456789/32414

Jang Y, Gunes H, Patras I ( 2017 ) . SmileNet: Registration-Free Smiling Face Detection in the Wild . Conference: 2017 IEEE International Conference on Computer Vision Workshops (ICCVW)1581 - 1589 .

10.1109/iccvw.2017.186

https://qmro.qmul.ac.uk/xmlui/handle/123456789/36405

Tao Y, Palasek P, Ling Z, Parras I ( 2017 ) . Background Modelling Based on Generative Unet . Conference: 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)1 - 6 .

10.1109/avss.2017.8078483

Palasek P, Patras I ( 2017 ) . Discriminative convolutional Fisher vector network for action recognition .

10.48550/arxiv.1707.06119

Galanopoulos D, Markatopoulou F, Mezaris V, Patras I ( 2017 ) . Concept Language Models and Event-based Concept Number Selection for Zero-example Event Detection . Conference: Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval397 - 401 .

10.1145/3078971.3079043

Markatopoulou F, Galanopoulos D, Mezaris V, Patras I ( 2017 ) . Query and Keyframe Representations for Ad-hoc Video Search . Conference: Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval407 - 411 .

10.1145/3078971.3079041

Collyda C, Apostolidis E, Pournaras A, Markatopoulou F, Mezaris V, Patras I ( 2017 ) . VideoAnalysis4ALL . Conference: Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval470 - 474 .

10.1145/3078971.3079015

https://qmro.qmul.ac.uk/xmlui/handle/123456789/57011

Marras I, Palasek P, Patras I ( 2017 ) . Deep Refinement Convolutional Networks for Human Pose Estimation . Conference: 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017)446 - 453 .

10.1109/fg.2017.148

https://qmro.qmul.ac.uk/xmlui/handle/123456789/22151

Bishav M, Patras I ( 2017 ) . Fusing Multilabel Deep Networks for Facial Action Unit Detection . Conference: 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017)681 - 688 .

10.1109/fg.2017.86

https://qmro.qmul.ac.uk/xmlui/handle/123456789/57745

Mou W, Tzelepis C, Mezaris V, Gunes H, Patras I ( 2017 ) . Generic to Specific Recognition Models for Membership Analysis in Group Videos . Conference: 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017)512 - 517 .

10.1109/fg.2017.69

https://qmro.qmul.ac.uk/xmlui/handle/123456789/74207

Miranda-Correa JA, Abadi MK, Sebe N, Patras I ( 2017 ) . AMIGOS: A Dataset for Affect, Personality and Mood Research on Individuals and Groups .

10.48550/arxiv.1702.02510

Markatopoulou F, Moumtzidou A, Galanopoulos D, Avgerinakis K, Andreadis S, Gialampoukidis I, Tachos S, Vrochidis S et al. ( 2017 ) . ITI-CERTH participation in TRECVID 2017 . 2017 Trec Video Retrieval Evaluation Trecvid 2017 .

Pittaras N, Markatopoulou F, Mezaris V, Patras I ( 2017 ) . Comparison of Fine-Tuning and Extension Strategies for Deep Convolutional Neural Networks . Lecture Notes in Computer Science . vol. 10132 , 102 - 114 .

10.1007/978-3-319-51811-4_9

https://qmro.qmul.ac.uk/xmlui/handle/123456789/23570

Kordopatis-Zilos G, Papadopoulos S, Patras I, Kompatsiaris Y ( 2017 ) . Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers . Lecture Notes in Computer Science . vol. 10132 , 251 - 263 .

10.1007/978-3-319-51811-4_21

https://qmro.qmul.ac.uk/xmlui/handle/123456789/23619

Moumtzidou A, Mironidis T, Markatopoulou F, Andreadis S, Gialampoukidis I, Galanopoulos D, Ioannidou A, Vrochidis S et al. ( 2017 ) . VERGE in VBS 2017 . Lecture Notes in Computer Science . vol. 10133 , 486 - 492 .

10.1007/978-3-319-51814-5_46

https://qmro.qmul.ac.uk/xmlui/handle/123456789/23658

Mou W, Gunes H, Patras I . Automatic Recognition of Emotions and Membership in Group Videos . 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops . Conference: 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)1478 - 1486 .

10.1109/cvprw.2016.185

Kotsia I, Zafeiriou S, Goudelis G, Patras I, Karpouzis K ( 2016 ) . Multimodal Sensing in Affective Gaming . Emotion in Games , vol. 4 , Springer Nature

Mou W, Gunes H, Patras I ( 2016 ) . Alone versus In-a-group . Conference: Proceedings of the 24th ACM international conference on Multimedia521 - 525 .

10.1145/2964284.2967276

Markatopoulou F, Mezaris V, Patras I ( 2016 ) . Deep Multi-task Learning with Label Correlation Constraint for Video Concept Detection . Conference: Proceedings of the 24th ACM international conference on Multimedia501 - 505 .

10.1145/2964284.2967271

Tzelepis C, Galanopoulos D, Mezaris V, Patras I ( 2016 ) . Learning to detect video events from zero or very few video examples . Image and Vision Computing vol. 53 , 35 - 44 .

10.1016/j.imavis.2015.09.005

AHMADI A, Patras I ( 2016 ) . UNSUPERVISED CONVOLUTIONAL NEURAL NETWORKS FOR MOTION ESTIMATION . http://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=7527113 . Conference: Image Processing (ICIP), 2016 IEEE International Conference on ( Phoenix, Arizona, USA ) from: 25/09/2016 to: 28/09/2016 ,

10.1109/ICIP.2016.7532634

https://qmro.qmul.ac.uk/xmlui/handle/123456789/13000

Stefic D, Patras I ( 2016 ) . Action recognition using saliency learned from recorded human gaze . Image and Vision Computing vol. 52 , 195 - 205 .

10.1016/j.imavis.2016.06.006

https://qmro.qmul.ac.uk/xmlui/handle/123456789/13497

Palasek P, Patras I ( 2016 ) . Action Recognition Using Convolutional Restricted Boltzmann Machines . Conference: Proceedings of the 1st International Workshop on Multimedia Analysis and Retrieval for Multimodal Interaction3 - 8 .

10.1145/2927006.2927012

https://qmro.qmul.ac.uk/xmlui/handle/123456789/22210

Wang L, Patras I, Zhang J, Mori G, Davis L ( 2016 ) . Special Issue on Individual and Group Activities in Video Event Analysis . Computer Vision and Image Understanding vol. 144 , 1 - 2 .

10.1016/j.cviu.2016.01.008

Vrochidis S, Patras I, Kompatsiaris I ( 2016 ) . Gaze movement-driven random forests for query clustering in automatic video annotation . Multimedia Tools and Applications vol. 76 , ( 2 ) 2861 - 2889 .

10.1007/s11042-015-3221-1

Ahmadi A, Patras I ( 2016 ) . Unsupervised convolutional neural networks for motion estimation .

10.48550/arxiv.1601.06087

Markatopoulou F, Mezaris V, Patras I ( 2016 ) . Ordering of Visual Descriptors in a Classifier Cascade Towards Improved Video Concept Detection . Lecture Notes in Computer Science . vol. 9516 , 874 - 885 .

10.1007/978-3-319-27671-7_73

Tzelepis C, Mezaris V, Patras I ( 2016 ) . Video Event Detection Using Kernel Support Vector Machine with Isotropic Gaussian Sample Uncertainty (KSVM-iGSU) . Lecture Notes in Computer Science . vol. 9516 , 3 - 15 .

10.1007/978-3-319-27671-7_1

Markatopoulou F, Galanopoulos D, Patras I, Mezaris V ( 2016 ) . ITI-CERTH in TRECVID 2016 Ad-hoc video search (AVS) . 2016 Trec Video Retrieval Evaluation Trecvid 2016 .

Markatopoulou F, Moumtzidou A, Galanopoulos D, Mironidis T, Kaltsa V, Ioannidou A, Symeonidis S, Avgerinakis K et al. ( 2016 ) . ITI-CERTH participation in TRECVID 2016 . 2016 Trec Video Retrieval Evaluation Trecvid 2016 .

Tzelepis C, Galanopoulos D, Mezaris V, Patras I ( 2016 ) . Learning to detect video events from zero or very few video examples . Image Vis. Comput. vol. 53 , 35 - 44 .

Kuranuki Y, Patras I ( 2016 ) . Minimal Filtered Channel Features for Pedestrian Detection . Conference: 2016 23rd International Conference on Pattern Recognition (ICPR)681 - 686 .

10.1109/icpr.2016.7899713

Markatopoulou F, Mezaris V, Patras I ( 2016 ) . Online Multi-Task Learning for Semantic Concept Detection in Video . Conference: 2016 IEEE International Conference on Image Processing (ICIP)186 - 190 .

10.1109/icip.2016.7532344

Ahmadi A, Patras I ( 2016 ) . Unsupervised Convolutional Neural Networks for Motion Estimation . Conference: 2016 IEEE International Conference on Image Processing (ICIP)1629 - 1633 .

10.1109/icip.2016.7532634

https://qmro.qmul.ac.uk/xmlui/handle/123456789/18791

Moumtzidou A, Mironidis T, Apostolidis E, Markatopoulou F, Ioannidou A, Gialampoukidis I, Avgerinakis K, Vrochidis S et al. ( 2016 ) . VERGE: A Multimodal Interactive Search Engine for Video Browsing and Retrieval . Lecture Notes in Computer Science . vol. 9517 , 394 - 399 .

10.1007/978-3-319-27674-8_39

https://qmro.qmul.ac.uk/xmlui/handle/123456789/57028

Tzelepis C, Mavridaki E, Mezaris V, Patras I ( 2016 ) . Video Aesthetic Quality Assessment Using Kernel Support Vector Machine with Isotropic Gaussian Sample Uncertainty (KSVM-IGSU) . Conference: 2016 IEEE International Conference on Image Processing (ICIP)2410 - 2414 .

10.1109/icip.2016.7532791

https://qmro.qmul.ac.uk/xmlui/handle/123456789/74208

Tzelepis C, Galanopoulos D, Mezaris V, Patras I ( 2015 ) . Learning to detect video events from zero or very few video examples .

10.48550/arxiv.1511.08032

Abadi MK, Subramanian R, Kia SM, Avesani P, Patras I, Sebe N ( 2015 ) . DECAF: MEG-Based Multimodal Database for Decoding Affective Physiological Responses . IEEE Transactions on Affective Computing vol. 6 , ( 3 ) 209 - 222 .

10.1109/taffc.2015.2392932

Kalpakis G, Tsikrika¹ T, Markatopoulou F, Pittaras N, Vrochidis S, Mezaris V, Parras I, Kompatsiaris I ( 2015 ) . Concept Detection in Multimedia Web Resources about Home Made Explosives . Conference: 2015 10th International Conference on Availability, Reliability and Security632 - 641 .

10.1109/ares.2015.85

Yang H, Mou W, Zhang Y, Patras I, Gunes H, Robinson P ( 2015 ) . Face Alignment Assisted by Head Pose Estimation .

10.48550/arxiv.1507.03148

Gu L, Kanade T ( 2015 ) . Face Alignment . Encyclopedia of Biometrics , Springer Nature

Patras I ( 2015 ) . Face Pose Analysis . Encyclopedia of Biometrics , Springer Nature

Palasek P, Yang H, Xu Z, Hajimirza N, Izquierdo E, Patras I ( 2015 ) . A Flexible Calibration Method of Multiple Kinects for 3D Human Reconstruction . Conference: 2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)1 - 4 .

10.1109/icmew.2015.7169829

https://qmro.qmul.ac.uk/xmlui/handle/123456789/12694

Yang H, Zou C, Patras I ( 2015 ) . Cascade of forests for face alignment . IET Computer Vision vol. 9 , ( 3 ) 321 - 330 .

10.1049/iet-cvi.2014.0085

Yang H, Patras I ( 2015 ) . Mirror, Mirror on the Wall, Tell me, is the Error Small? . Conference: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)4685 - 4693 .

10.1109/cvpr.2015.7299100

Yang H, Jia X, Patras I, Chan K-P ( 2015 ) . Random Subspace Supervised Descent Method for Regression Problems in Computer Vision . IEEE Signal Processing Letters vol. 22 , ( 10 ) 1816 - 1820 .

10.1109/lsp.2015.2437883

Abadi MK, Correa JAM, Wache J, Yang H, Patras I, Sebe N ( 2015 ) . Inference of Personality Traits and Affect Schedule by Analysis of Spontaneous Reactions to Affective Videos . Conference: 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG)1 - 8 .

10.1109/fg.2015.7163100

Yang H, He X, Jia X, Patras I ( 2015 ) . Robust Face Alignment Under Occlusion via Regional Predictive Power Estimation . IEEE Transactions on Image Processing vol. 24 , ( 8 ) 2393 - 2403 .

10.1109/tip.2015.2421438

https://qmro.qmul.ac.uk/xmlui/handle/123456789/22467

Markatopoulou F, Mezaris V, Pittaras N, Patras I ( 2015 ) . Local Features and a Two-Layer Stacking Architecture for Semantic Concept Detection in Video . IEEE Transactions on Emerging Topics in Computing vol. 3 , ( 2 ) 193 - 204 .

10.1109/tetc.2015.2418714

https://qmro.qmul.ac.uk/xmlui/handle/123456789/8879

Yang H, Patras I ( 2015 ) . Mirror, mirror on the wall, tell me, is the error small? .

10.48550/arxiv.1501.05152

Yang H, Patras I ( 2015 ) . Privileged Information-Based Conditional Structured Output Regression Forest for Facial Point Detection . IEEE Transactions on Circuits and Systems for Video Technology vol. 25 , ( 9 ) 1507 - 1520 .

10.1109/tcsvt.2015.2389492

https://qmro.qmul.ac.uk/xmlui/handle/123456789/22402

Markatopoulou F, Pittaras N, Papadopoulou O, Mezaris V, Patras I ( 2015 ) . A Study on the Use of a Binary Local Descriptor and Color Extensions of Local Descriptors for Video Concept Detection . Lecture Notes in Computer Science . vol. 8935 , 282 - 293 .

10.1007/978-3-319-14445-0_25

https://qmro.qmul.ac.uk/xmlui/handle/123456789/8660

Markatopoulou F, Mezaris V, Patras I ( 2015 ) . Cascade of Classifiers Based on Binary, Non-Binary and Deep Convolutional Network Descriptors for Video Concept Detection . Conference: 2015 IEEE International Conference on Image Processing (ICIP)1786 - 1790 .

10.1109/icip.2015.7351108

Yang H, Mou W, Zhang Y, Patras I, Gunes H, Robinson P ( 2015 ) . Face Alignment Assisted by Head Pose Estimation . 130.1 - 130.13 .

10.5244/c.29.130

Yang H, Mou W, Zhang Y, Patras I, Gunes H, Robinson P, Xie X, Jones MW et al. ( 2015 ) . Face Alignment Assisted by Head Pose Estimation . BMVC . 130.1 - 130.1 .

Markatopoulou F, Ioannidou A, Tzelepis C, Mironidis T, Galanopoulos D, Arestis-Chartampilas S, Pittaras N, Avgerinakis K et al. ( 2015 ) . ITI-CERTH participation to TRECVID 2015 . 2015 Trec Video Retrieval Evaluation Trecvid 2015 .

Chen M, Han J, Guo L, Wang J, Patras I ( 2015 ) . Identifying Valence and Arousal Levels via Connectivity between EEG Channels . Conference: 2015 International Conference on Affective Computing and Intelligent Interaction (ACII)63 - 69 .

10.1109/acii.2015.7344552

Abadi MK, Correa JAM, Wache J, Yang H, Patras I, Sebe N, IEEE ( 2015 ) . Inference of Personality Traits and Affect Schedule by Analysis of Spontaneous Reactions to Affective Videos . 2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), VOL. 2 .

Yang H, Patras I ( 2015 ) . Mirror, mirror on the wall, tell me, is the error small? . CoRR vol. abs/1501.05152 ,

Patras MV, BǍnacu CS, Popescu AM, Patraş I ( 2015 ) . The effect of limiting the right of appeal on the quality of the public procurement management system: Case study of Romania . Proceedings of the 26th International Business Information Management Association Conference Innovation Management and Sustainable Economic Competitive Advantage from Regional Development to Global Growth Ibima 2015 . 521 - 530 .

Moumtzidou A, Avgerinakis K, Apostolidis E, Markatopoulou F, Apostolidis K, Mironidis T, Vrochidis S, Mezaris V et al. ( 2015 ) . VERGE: A Multimodal Interactive Video Search Engine . Lecture Notes in Computer Science . vol. 8936 , 249 - 254 .

10.1007/978-3-319-14442-9_23

https://qmro.qmul.ac.uk/xmlui/handle/123456789/10787

Yang H, Patras I ( 2014 ) . Fine-Tuning Regression Forests Votes for Object Alignment in the Wild . IEEE Transactions on Image Processing vol. 24 , ( 2 ) 619 - 631 .

10.1109/tip.2014.2383325

https://qmro.qmul.ac.uk/xmlui/handle/123456789/22607

Kaymak S, Patras I ( 2014 ) . Multimodal random forest based tensor regression . IET Computer Vision vol. 8 , ( 6 ) 650 - 657 .

10.1049/iet-cvi.2013.0320

Stefic D, Patras I ( 2014 ) . Learning Visual Saliency Using Topographic Independent Component Analysis . Conference: 2014 IEEE International Conference on Image Processing (ICIP)1130 - 1134 .

10.1109/icip.2014.7025225

https://qmro.qmul.ac.uk/xmlui/handle/123456789/12720

Burelli P, Triantafyllidis G, Patras I ( 2014 ) . Non-Invasive Player Experience Estimation from Body Motion and Game Context . Conference: 2014 IEEE Conference on Computational Intelligence and Games1 - 7 .

10.1109/cig.2014.6932871

Stefic D, Patras I ( 2014 ) . Learning visual saliency using topographic independent component analysis . 2014 IEEE International Conference on Image Processing Icip 2014 . 1130 - 1134 .

10.1109/ICIP.2014.7025225

Yang H, Zou C, Patras I ( 2014 ) . Face sketch landmarks localization in the wild . IEEE Signal Processing Letters vol. 21 , ( 11 ) 1321 - 1325 .

10.1109/LSP.2014.2333544

Gkalelis N, Markatopoulou F, Moumtzidou A, Galanopoulos D, Avgerinakis K, Pittaras N, Vrochidis S, Mezaris V et al. ( 2014 ) . ITI-CERTH participation to TRECVID 2014 . 2014 Trec Video Retrieval Evaluation Trecvid 2014 .

Jia X, Yang H, Chan K-P, Patras I ( 2014 ) . Structured Semi-supervised Forest for Facial Landmarks Localization with Face Mask Reasoning . Conference: Proceedings of the British Machine Vision Conference 201485.1 - 85.13 .

10.5244/c.28.85

Yang H, Patras I ( 2013 ) . Sieving Regression Forest Votes for Facial Feature Detection in the Wild . 1936 - 1943 .

10.1109/iccv.2013.243

Yang H, Patras I ( 2013 ) . Privileged Information-based Conditional Regression Forest for Facial Feature Detection . Conference: 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG)1 - 6 .

10.1109/fg.2013.6553766

G VKB, Patras I ( 2013 ) . Supervised Dictionary Learning for Action Localization . Conference: 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG)1 - 8 .

10.1109/fg.2013.6553745

Rudovic O, Pantic M, Patras IY ( 2013 ) . Coupled Gaussian Processes for Pose-Invariant Facial Expression Recognition . IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE vol. 35 , ( 6 ) 1357 - 1369 .

10.1109/TPAMI.2012.233

Kaymak S, Patras I ( 2013 ) . Exploiting Depth and Intensity Information for Head Pose Estimation with Random Forests and Tensor Models . Lecture Notes in Computer Science . vol. 7729 , 160 - 170 .

10.1007/978-3-642-37484-5_14

Kotsia I, Patras I ( 2013 ) . Exploring the Similarities of Neighboring Spatiotemporal Points for Action Pair Matching . Lecture Notes in Computer Science . vol. 7726 , 624 - 635 .

10.1007/978-3-642-37431-9_48

Yang H, Patras I ( 2013 ) . Face Parts Localization Using Structured-Output Regression Forests . Lecture Notes in Computer Science . vol. 7725 , 667 - 679 .

10.1007/978-3-642-37444-9_52

Koelstra S, Patras I ( 2013 ) . Fusion of facial expressions and EEG for implicit affective tagging . IMAGE AND VISION COMPUTING vol. 31 , ( 2 ) 164 - 174 .

10.1016/j.imavis.2012.10.002

Nikolopoulos S, Zafeiriou S, Patras I, Kompatsiaris I ( 2013 ) . High order pLSA for indexing tagged images . SIGNAL PROCESSING vol. 93 , ( 8 ) 2212 - 2228 .

10.1016/j.sigpro.2012.08.004

Guo W, Hu W, Boulgouris NV, Patras I ( 2013 ) . Semi-Supervised Visual Recognition With Constrained Graph Regularized Non Negative Matrix Factorization . Conference: 2013 IEEE International Conference on Image Processing2743 - 2747 .

10.1109/icip.2013.6738565

Kotsia I, Guo W, Patras I ( 2012 ) . Higher rank Support Tensor Machines for visual recognition . Pattern Recognition vol. 45 , ( 12 ) 4192 - 4203 .

10.1016/j.patcog.2012.04.033

Oveisi F, Oveisi S, Efranian A, Patras I ( 2012 ) . Nonlinear Independent Component Analysis for EEG-Based Brain-Computer Interface Systems . Independent Component Analysis for Audio and Biosignal Applications , IntechOpen

G VKB, Patras I ( 2012 ) . Learning Codebook Weights for Action Detection . Conference: 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops vol. 1 , 27 - 32 .

10.1109/cvprw.2012.6239257

Kotsia I, Patras I, Fotopoulos S ( 2012 ) . AFFECTIVE GAMING: BEYOND USING SENSORS . Conference: 2012 5th International Symposium on Communications, Control and Signal Processing vol. 1 , 1 - 4 .

10.1109/isccsp.2012.6217768

Vrochidis S, Patras I, Kompatsiaris I ( 2012 ) . EXPLOITING GAZE MOVEMENTS FOR AUTOMATIC VIDEO ANNOTATION . Conference: 2012 13th International Workshop on Image Analysis for Multimedia Interactive Services vol. 1 , 1 - 4 .

10.1109/wiamis.2012.6226766

Guo W, Kotsia I, Patras I ( 2012 ) . Tensor learning for regression . IEEE Trans Image Process vol. 21 , ( 2 ) 816 - 827 .

10.1109/TIP.2011.2165291

Yang H, Liu X, Patras I ( 2012 ) . A Simple and Effective Extrinsic Calibration Method of a Camera and a Single Line Scanning Lidar . 21st International Conference on Pattern Recognition . Conference: ICPR 2012

Yang H, Zhang Y, Liu X, Patras I ( 2012 ) . Coupled 3D Tracking and Pose Optimization of Rigid Objects Using Particle Filter . 21st International Conference on Pattern Recognition . Conference: ICPR 2012

Koelstra S, Muhl C, Soleymani M, Lee J-S, Yazdani A, Ebrahimi T, Pun T, Nijholt A et al. ( 2012 ) . DEAP: A Database for Emotion Analysis Using Physiological Signals . IEEE TRANSACTIONS ON AFFECTIVE COMPUTING vol. 3 , ( 1 ) 18 - 31 .

10.1109/T-AFFC.2011.15

Kotsia I, Guo W, Patras I ( 2012 ) . Higher Rank Support Tensor Machines . Lecture Notes in Computer Science . vol. 7432 , 31 - 40 .

10.1007/978-3-642-33191-6_4

Nikolopoulos S, Papadopoulos GT, Kompatsiaris I, Patras I ( 2012 ) . Image Interpretation by Combining Ontologies and Bayesian Networks . Lecture Notes in Computer Science . vol. 7297 , 307 - 314 .

10.1007/978-3-642-30448-4_39

Chatzilari E, Nikolopoulos S, Patras I, Kompatsiaris I ( 2012 ) . Leveraging social media for scalable object detection . PATTERN RECOGNITION vol. 45 , ( 8 ) 2962 - 2979 .

10.1016/j.patcog.2012.02.006

Kumar BGV, Kotsia I, Patras I ( 2012 ) . Max-margin Non-negative Matrix Factorization . IMAGE AND VISION COMPUTING vol. 30 , ( 4-5 ) 279 - 291 .

10.1016/j.imavis.2012.02.010

Kotsia I, Patras I ( 2012 ) . SUPPORT TENSOR ACTION SPOTTING . 2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012) . 1397 - 1400 .

10.1109/ICIP.2012.6467130

Oveisi F, Oveisi S, Efranian A, Patras I ( 2012 ) . Tree-Structured Feature Extraction Using Mutual Information . IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS vol. 23 , ( 1 ) 127 - 137 .

10.1109/TNNLS.2011.2178447

Kunitsyn VE, Nesterov IA, Shalimov SL ( 2011 ) . Japan megathrust earthquake on March 11, 2011: GPS-TEC evidence for ionospheric disturbances . JETP Letters . vol. 94 , 616 - 620 .

10.1134/s0021364011200082

Nikolopoulos S, Papadopoulos GT, Kompatsiaris I, Patras I ( 2011 ) . Evidence-driven image interpretation by combining implicit and explicit knowledge in a Bayesian network . IEEE Trans Syst Man Cybern B Cybern vol. 41 , ( 5 ) 1366 - 1381 .

10.1109/TSMCB.2011.2147781

Vrochidis S, Patras I, Kompatsiaris I ( 2011 ) . An eye-tracking-based approach to facilitate interactive video search . Conference: Proceedings of the 1st ACM International Conference on Multimedia Retrieval1 - 8 .

10.1145/1991996.1992039

Oikonomopoulos A, Patras I, Pantic M ( 2011 ) . Spatiotemporal localization and categorization of human actions in unsegmented image sequences . IEEE Transactions on Image Processing vol. 20 , ( 4 ) 1126 - 1140 .

10.1109/TIP.2010.2076821

Vrochidis S, Kompatsiaris I, Patras I ( 2011 ) . Utilizing Implicit User Feedback to Improve Interactive Video Retrieval . Advances in Multimedia vol. 2011 , ( 1 ) 1 - 18 .

10.1155/2011/310762

Soleymani M, Koelstra S, Patras I, Pun T ( 2011 ) . Continuous Emotion Detection in Response to Music Videos . Conference: Face and Gesture 2011 vol. 1 , 803 - 808 .

10.1109/fg.2011.5771352

Nikolopoulos S, Giannakidou E, Kompatsiaris I, Patras I, Vakali A ( 2011 ) . Combining Multi-modal Features for Social Media Analysis . Social Media Modeling and Computing , Springer Nature

Chatzilari E, Nikolopoulos S, Patras I, Kompatsiaris I ( 2011 ) . Enhancing Computer Vision Using the Collective Intelligence of Social Media . New Directions in Web Data Management 1 , vol. 331 , Springer Nature

Guo W, Kotsia I, Patras I ( 2011 ) . Higher order Support tensor regression for head pose estimation . International Workshop on Image Analysis for Multimedia Interactive Services .

Moumtzidou A, Sidiropoulos P, Vrochidis S, Gkalelis N, Nikolopoulos S, Mezaris V, Kompatsiaris I, Patras I ( 2011 ) . ITI-CERTH participation to TRECVID 2011 . 2011 Trec Video Retrieval Evaluation Notebook Papers .

Kumar VBG, Patras I, Kotsia I ( 2011 ) . Max-Margin Semi-NMF . Conference: Procedings of the British Machine Vision Conference 2011129.1 - 129.11 .

10.5244/c.25.129

Kotsia I, Patras I ( 2011 ) . Support Tucker Machines . 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) . 633 - 640 .

10.1109/CVPR.2011.5995663

Koelstra S, Pantic M, Patras I ( 2010 ) . A Dynamic Texture-Based Approach to Recognition of Facial Actions and Their Temporal Models . IEEE Transactions on Pattern Analysis and Machine Intelligence vol. 32 , ( 11 ) 1940 - 1954 .

10.1109/TPAMI.2010.50

Oikonomopoulos A, Patras I, Pantic M ( 2010 ) . Discriminative space-time voting for joint recognition and localization of actions . Conference: Proceedings of the 2nd international workshop on Social signal processing11 - 16 .

10.1145/1878116.1878122

Passino G, Patras I, Izquierdo E ( 2010 ) . Aspect coherence for graph-based semantic image labelling . IET COMPUT VIS vol. 4 , ( 3 ) 183 - 194 .

10.1049/iet-cvi.2008.0093

Patras I, Hancock ER ( 2010 ) . Coupled Prediction Classification for Robust Visual Tracking . IEEE Transactions on Pattern Analysis and Machine Intelligence vol. 32 , ( 9 ) 1553 - 1567 .

10.1109/TPAMI.2009.175

Kotsia I, Patras I ( 2010 ) . Multiplicative Update Rules for Multilinear Support Tensor Machines . 2010 20th International Conference on Pattern Recognition . Conference: 2010 20th International Conference on Pattern Recognition33 - 36 .

10.1109/icpr.2010.17

Passino G, Patras I, Izquierdo E ( 2010 ) . Pyramidal Model for Image Semantic Segmentation . 2010 20th International Conference on Pattern Recognition . Conference: 2010 20th International Conference on Pattern Recognition1554 - 1557 .

10.1109/icpr.2010.384

Rudovic O, Patras I, Pantic M ( 2010 ) . Regression-Based Multi-view Facial Expression Recognition . 2010 20th International Conference on Pattern Recognition . Conference: 2010 20th International Conference on Pattern Recognition4121 - 4124 .

10.1109/icpr.2010.1001

Vrochidis S, Kompatsiaris I, Patras I ( 2010 ) . Optimizing visual search with implicit user feedback in interactive video retrieval . Conference: Proceedings of the ACM International Conference on Image and Video Retrieval274 - 281 .

10.1145/1816041.1816082

Kotsia I, Patras I ( 2010 ) . Relative Margin Support Tensor Machines for gait and action recognition . Conference: Proceedings of the ACM International Conference on Image and Video Retrieval446 - 453 .

10.1145/1816041.1816107

Ognjen R, Ioannis P, Maja P ( 2010 ) . Facial Expression Invariant Head Pose Normalization using Gaussian Process Regression . Conference: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops28 - 33 .

10.1109/cvprw.2010.5543269

Koelstra S, Pantic M, Patras I ( 2010 ) . A Dynamic Texture-Based Approach to Recognition of Facial Actions and Their Temporal Models . IEEE Transactions on Pattern Analysis and Machine Intelligence vol. 32 , 1940-1954 - 1940-1954 .

10.1109/TPAMI.2010.50

Kumar.B.G V, Patras I ( 2010 ) . A discriminative voting scheme for object detection using hough forests . British Machine Vision Conference Bmvc 2010 Proceedings .

Rudovic O, Patras I, Pantic M ( 2010 ) . Coupled Gaussian Process Regression for Pose-Invariant Facial Expression Recognition . Lecture Notes in Computer Science . vol. 6312 , 350 - 363 .

10.1007/978-3-642-15552-9_26

Vrochidis S, Kompatsiaris I, Patras I ( 2010 ) . Exploiting implicit user feedback in interactive video retrieval . WIAMIS . 1 - 4 .

Guo W, Patras I ( 2010 ) . Learning output-kernel-dependent regression for human pose estimation . British Machine Vision Conference Bmvc 2010 Proceedings .

Koelstra S, Yazdani A, Soleymani M, Muhl C, Lee J-S, Nijholt A, Pun T, Ebrahimi T et al. ( 2010 ) . Single Trial Classification of EEG and Peripheral Physiological Signals for Recognition of Emotions Induced by Music Videos . BRAIN INFORMATICS, BI 2010 . vol. 6334 , 89 - 100 .

10.1007/978-3-642-15314-3_9

Oikonomopoulos A, Pantic M, Patras I ( 2009 ) . Sparse B-spline polynomial descriptors for human activity recognition . IMAGE VISION COMPUT vol. 27 , ( 12 ) 1814 - 1825 .

10.1016/j.imavis.2009.05.010

Guo W, Patras I ( 2009 ) . Discriminative 3D Human Pose Estimation from Monocular Images via Topological Preserving Hierarchical Affinity Clustering . Conference: 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops9 - 15 .

10.1109/iccvw.2009.5457725

Koelstra S, Mühl C, Patras I ( 2009 ) . EEG analysis for implicit tagging of video data . Conference: 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops1 - 6 .

10.1109/acii.2009.5349482

Passino G, Patras I, Izquierdo E ( 2009 ) . Latent Semantics Local Distribution for CRF-based Image Semantic Segmentation . Proceedings of the British Machine Vision Conference (BMVC 2009) . Conference: BMVC 2009 ( London, England ) from: 07/09/2009 to: 10/09/2009 , 1 - 12 .

10.5244/C.23.26

Passino G, Piatrik T, Patras I, Izquierdo E ( 2009 ) . A Multimedia Content Semantics Extraction Framework for Enhanced Social Interaction . Adjunct proceedings EuroITV 2009 Networked Television . Conference: EuroITV 2009 ( Leuven, Belgium ) from: 03/06/2009 to: 05/06/2009 , 89 - 91 .

Oikonomopoulos A, Patras I, Pantic M ( 2009 ) . An implicit spatiotemporal shape model for human activity localization and recognition . 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops . Conference: 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops27 - 33 .

10.1109/cvprw.2009.5204262

Nikolopoulos S, Papadopoulos GT, Kompatsiaris I, Patras I, Perner P ( 2009 ) . An Evidence-Driven Probabilistic Inference Framework for Semantic Image Understanding . MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION . vol. 5632 , 525 - 539 .

10.1007/978-3-642-03070-3_40

Oikonomopoulos A, Patras I, Pantic M ( 2009 ) . An Implicit Spatiotemporal Shape Model for Human Activity Localization and Recognition . 2009 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPR WORKSHOPS 2009), VOLS 1 AND 2 . 786 - 792 .

Oikonomopoulos A, Patras I, Pantic M ( 2009 ) . An implicit spatiotemporal shape model for human activity localization and recognition . 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops Cvpr Workshops 2009 . 27 - 33 .

10.1109/CVPRW.2009.5204262

Passino G, Patras I, Izquierdo E ( 2009 ) . CONTEXT AWARENESS IN GRAPH-BASED IMAGE SEMANTIC SEGMENTATION VIA VISUAL WORD DISTRIBUTIONS . 2009 10TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES . 33 - 36 .

10.1109/WIAMIS.2009.5031425

( 2009 ) . Face Acquisition . Encyclopedia of Biometrics , Springer Nature

Patras I ( 2009 ) . Face Pose Analysis . Encyclopedia of Biometrics , Springer Nature

Koelstra S, Patras I ( 2009 ) . THE FAST-3D SPATIO-TEMPORAL INTEREST REGION DETECTOR . 2009 10TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES . 242 - 245 .

10.1109/WIAMIS.2009.5031478

Oikonomopoulos A, Pantic M, Patras I ( 2008 ) . Human Gesture Recognition using Sparse B-spline Polynomial Representations . Belgian Netherlands Artificial Intelligence Conference . 193 - 200 .

Andreopoulos Y, Patras I ( 2008 ) . Incremental refinement of image salient-point detection . IEEE Transactions on Image Processing vol. 17 , ( 9 ) 1685 - 1699 .

10.1109/TIP.2008.2001051

Passino G, Patras I, Izquierdo E ( 2008 ) . Aspect coherence for graph-based image labelling . Conference: 5th International Conference on Visual Information Engineering (VIE 2008)94 - 99 .

10.1049/cp:20080290

Oikonomopoulos A, Pantic M, Patras I ( 2008 ) . B-spline Polynomial Descriptors for Human Activity Recognition . 2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3 . 1622 - 1627 .

10.1109/CVPRW.2008.4563175

Oikonomopoulos A, Pantic M, Patras I ( 2008 ) . B-spline polynomial descriptors for human activity recognition . CVPR Workshops . 1 - 6 .

10.1109/CVPRW.2008.4563175

Patras I, Andreopoulos Y ( 2008 ) . Incremental salient point detection . 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 . 1337 - 1340 .

10.1109/ICASSP.2008.4517865

Passino G, Patras I, Izquierdo E ( 2008 ) . ON THE ROLE OF STRUCTURE IN PART-BASED OBJECT DETECTION . 2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5 . 65 - 68 .

PATRAS I, Lagendijk RL, Hendriks EA ( 2007 ) . Bayesian Confidence Measures for Block-based Motion Estimation . IEEE trans. Circuits and Systems for Video Technology vol. 17 Issue 8 , 988 - 995 .

Patras I, Hendriks EA, Lagendijk RL ( 2007 ) . Probabilistic confidence measures for block matching motion estimation . IEEE T CIRC SYST VID vol. 17 , ( 8 ) 988 - 995 .

10.1109/TCSVT.2007.903121

PATRAS I, Hancock ER ( 2007 ) . Regression-ased Template Tracking in Presence of Occlusions . Conference: International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS), Santorini, Greece

10.1109/WIAMIS.2007.74

Pogalin E, Redert A, Patras I, Hendriks EA, Pollefeys M, Daniilidis K ( 2007 ) . Gaze tracking by using factorized likelihoods particle filtering and stereo vision . Third International Symposium on 3D Data Processing, Visualization, and Transmission, Proceedings . 57 - 64 .

10.1109/3DPVT.2006.66

PATRAS I, Paragios N, Oikonomopoulos A, Pantic M, Huang TS, Nijholt A, Pantic M, Pentland A ( 2007 ) . Particle Filtering Tracking Scheme for Trajectory-based Recognition of Human Actions . Artificial Intelligence for Human Computing , Springer ( Lecture Notes in Artifical Intelligence - Volume 4451 ),

Patras I, Hancock ER ( 2007 ) . Regression tracking with data relevance determination . 2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8 . 2062 - 2069 .

10.1109/CVPR.2007.383239

Patras I, Hancock E ( 2007 ) . Template tracking with observation relevance determination . 2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7 . 501 - 504 .

10.1109/ICIP.2007.4379001

Oikonomopoulos A, Patras I, Pantic M, Paragios N, Huang TS, Nijholt A, Pantic M, Pentland A ( 2007 ) . Trajectory-based representation of human actions . Artificial Intelligence for Human Computing . vol. 4451 , 133 - 154 .

10.1007/978-3-540-72348-6_7

PATRAS I, Pantic M, Oikonomopoulos A ( 2006 ) . Kernel-based Recognition of Human Actions Using Spatiotemporal Salient Points . Conference: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, Workshop on Vision for HCI, New York, USA - June 2006

10.1109/CVPRW.2006.114

Oikonomopoulos A, Patras I, Pantic M ( 2006 ) . Spatiotemporal salient points for visual recognition of human actions . IEEE Trans Syst Man Cybern B Cybern vol. 36 , ( 3 ) 710 - 719 .

10.1109/tsmcb.2005.861864

Pantic M, Patras I ( 2006 ) . Dynamics of facial expression: recognition of facial actions and their temporal segments from face profile image sequences . IEEE Trans Syst Man Cybern B Cybern vol. 36 , ( 2 ) 433 - 449 .

10.1109/tsmcb.2005.859075

Diplaros A, Gevers T, Patras I ( 2006 ) . Combining color and shape information for illumination-viewpoint invariant object recognition . IEEE Trans Image Process vol. 15 , ( 1 ) 1 - 11 .

10.1109/tip.2005.860320

PATRAS I, Valstar MF, Pantic M ( 2005 ) . Learning Spatiotemporal Models of Facial Expressions . Conference: International Conference on Measuring Behaviour, Wageningen - September 2005

PATRAS I, Pantic M, Valstar MF ( 2005 ) . Facial Action Unit Detection Using Probabilistically Actively Learned Support Vector Machines on Tracked Facial Point Data . Conference: Proceedings of IEEE International Confernece on Computer Visision and Pattern Recognition, workshop on Vision for HCI, San Diego, USA - June 2005

10.1109/CVPR.2005.457

Pantic M, Patras I ( 2005 ) . Detecting Facial Actions and their Temporal Segments in Nearly Frontal-View Face Image Sequences . Conference: 2005 IEEE International Conference on Systems, Man and Cybernetics vol. 4 , 1 - 6 .

10.1109/icsmc.2005.1571665

Pantic M, Patras I ( 2005 ) . Detecting facial actions and their temporal segments in nearly frontal-view face image sequences . INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS . 3358 - 3363 .

Oikonomopoulos A, Patras I, Pantic M ( 2005 ) . Spatiotemporal saliency for human action recognition . 2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2 . 430 - 433 .

10.1109/ICME.2005.1521452

Patras I, Pantic M ( 2005 ) . Tracking deformable motion . INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS . 1066 - 1071 .

10.1109/icsmc.2005.1571287

Patras I, Worring M, van den Boomgaard R ( 2004 ) . Dense motion estimation using regularization constraints on local parametric models . IEEE Trans Image Process vol. 13 , ( 11 ) 1432 - 1443 .

10.1109/tip.2004.836179

PATRAS I, Pantic M, Valstar M ( 2004 ) . Multilevel Motion History for Facial Action Detection from Face Video . Conference: IEEE International Conference on Systems, Management and Cybernetics, Den Haag, The Netherlands - October 2005

Diplaros A, Gevers T, Patras I, Santini S, Schettini R ( 2004 ) . Combining color and shape information for content-based image retrieval on the Internet . INTERNET IMAGING V . vol. 5304 , 132 - 141 .

10.1117/12.525667

Valstar M, Patras I, Pantic M ( 2004 ) . Facial action unit recognition using temporal templates . RO-MAN 2004: 13TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, PROCEEDINGS . 253 - 258 .

10.1109/roman.2004.1374768

Valstar M, Pantic M, Patras I ( 2004 ) . Motion History for Facial Action Detection in Video . Conference: 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583) vol. 1 , 635 - 640 .

10.1109/icsmc.2004.1398371

Winkelman F, Patras I ( 2004 ) . Online globally consistent mosaicing using an efficient representation . 2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7 . 3116 - 3121 .

10.1109/icsmc.2004.1400818

Patras I, Pantic M ( 2004 ) . Particle filtering with factorized likelihoods for tracking facial features . SIXTH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, PROCEEDINGS . 97 - 102 .

10.1109/AFGR.2004.1301515

Pantic M, Patras I ( 2004 ) . Temporal modeling of facial actions from face profile image sequences . 2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3 . 49 - 52 .

10.1109/icme.2004.1394122

PATRAS I, Gevers T, Diplaros A ( 2003 ) . Color-Shape Context for Object Recognition . Conference: IEEE Workshop on Color and Photometric Methods in Computer Vision (in conjunction with ICCV 2003), Nice, France

Patras I, Hendriks EA, Lagendijk RL ( 2003 ) . Semi-automatic object-based video segmentation with labeling of color segments . SIGNAL PROCESS-IMAGE vol. 18 , ( 1 ) 51 - 65 .

10.1016/S0923-5965(02)00092-9

PATRAS I, Raaijmakers S, Snoek C, van Rest J, Worring M, van Leeuwen D, den Hartog J, Vendring J ( 2002 ) . TREC Feature Extraction by Active Learning . Conference: 11th Text Retrieval Confernece (TREC), Gaithersburg, MD - November 2002

Patras I, Hendriks EA, Lagendijk RL ( 2002 ) . Confidence measures for block matching motion estimation . 2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS . 277 - 280 .

10.1109/icip.2002.1039941

Pantic M, Patras I, Rothkrantz L ( 2002 ) . Facial action recognition in face profile image sequences . IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS . 37 - 40 .

10.1109/ICME.2002.1035712

Patras I, Worring M, Kasturi R, Laurendeau D, Suen C ( 2002 ) . Regularized patch motion estimation . 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL I, PROCEEDINGS . 323 - 326 .

10.1109/ICPR.2002.10010

Patras I, Hendriks EA, Lagendijk RL ( 2001 ) . Video segmentation by MAP labeling of watershed segments . IEEE T PATTERN ANAL vol. 23 , ( 3 ) 326 - 332 .

10.1109/34.910886

PATRAS I, List J, Geusebroek J-M, den Hartog J, Hiemstra D, van Ballegooij A, Worring M, Snoek C et al. ( 2001 ) . Lazy Users and Automatic Video Retrieval Tools in (the) Lowlands . Conference: Proceedings of the 10th Text Retrieval Conference (TREC), NIST 2001

PATRAS I, Hendriks EA, Broekhoven M, Hupkens T ( 2001 ) . Robust Region Merging for Motion Based Sementation Using the Kolmogorov-Smirnov Test . Image Processing and Communications vol. 6 , ( 3-4 ) 27 - 34 .

Patras I, Hendriks EA, Lagendijk RL ( 1998 ) . Iterative motion estimation - segmentation method using watershed segments . IEEE International Conference on Image Processing . vol. 2 , 642 - 646 .

Patras IK, Hendriks EA, Tziritas GG ( 1997 ) . Construction of multiple views using jointly estimated motion and disparity fields . Proceedings of SPIE--the International Society for Optical Engineering . Conference: Visual Communications and Image Processing '97 vol. 3024 , 380 - 390 .

10.1117/12.263250

Xenos A, Stafylakis T, Patras I, Tzimiropoulos G . A Simple Baseline for Knowledge-Based Visual Question Answering . Conference: Empirical Methods in Natural Language Processing from: 06/12/2023 to: 10/12/2023 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/92288

Sun Z, Song S, Patras I, Tzimiropoulos G . CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition . Advances in Neural Information Processing Systems .

https://qmro.qmul.ac.uk/xmlui/handle/123456789/101179

Zhang Z, Li C, Liu X, Shen C, Liu Z, Patras I . Confidence Should Be Calibrated More Than One Turn Deep . Conference: The 64th Annual Meeting of the Association for Computational Linguistics

Zhang Z, Liu Z, Patras I . GrACE: A Generative Approach to Better Confidence Elicitation and Efficient Test-Time Scaling in Large Language Models . Conference: The 64th Annual Meeting of the Association for Computational Linguistics

Izquierdo EE, Patras IE, Hao PE, Gunes HE, Asioli SE, BANGERT TE, Klavdianos PE, Brenner ME et al. . MMV Members .

Oldfield J, Tzelepis C, Panagakis Y, Nicolaou, A. M, Patras I . PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs . Conference: International Conference on Learning Representations (ICLR) ( Kigali, Rwanda )

https://qmro.qmul.ac.uk/xmlui/handle/123456789/84510

Oldfield J . Parts of Speech–Grounded Subspaces in Vision-Language Models . Conference: 37th Conference on Neural Information Processing Systems

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91681

Zhao Z, Patras I . Prompting Visual-Language Models for Dynamic Facial Expression Recognition . Conference: The 34th British Machine Vision Conference ( Aberdeen, UK ) from: 20/11/2023 to: 24/10/2023 ,

https://qmro.qmul.ac.uk/xmlui/handle/123456789/91783

Global main menu

Areas of study

Study at Queen Mary

Experience Queen Mary

Research and Innovation

Research by faculties and centres

Collaborations and partnerships

Publications: PROF Ioannis Patras