Publications: PROF Ioannis Patras
(
2026
)
.
HDD-Unet: A Unet-based architecture for low-light image enhancement
.
Image and Vision Computing
vol.
167
,
Oldfield J, Im S, Li S, Nicolaou MA, Patras I, Chrysos GG
(
2026
)
.
Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders
.
Papadopoulos S, Patsiouras E, Ioannidis K, Vrochidis S, Kompatsiaris I, Patras I
(
2026
)
.
Unsupervised Object Localization driven by self-supervised foundation models: A comprehensive review
.
Image and Vision Computing
vol.
165
,
Ge J, Zhang X, Cao J, Zhu X, Liu W, Gao Q, Cao B, Wang K et al.
(
2025
)
.
Gen4Track: A Tuning-free Data Augmentation Framework via Self-correcting Diffusion Model for Vision-Language Tracking
.
3037
-
3046
.
Feng C, Sebe N, Tzimiropoulos G, Rodrigues MRD, Patras I
(
2025
)
.
Unveiling Open-set Noise: Theoretical Insights into Label Noise
.
Conference:
Proceedings of the 33rd ACM International Conference on Multimedia3290
-
3299
.
Sahili ZA, Patras I, Purver M
(
2025
)
.
FairCoT: Enhancing Fairness in Text-to-Image Generation via Chain of Thought Reasoning with Multimodal Large Language Models
.
Xenos A, Foteinopoulou NM, Ntinou I, Patras I, Tzimiropoulos G
(
2025
)
.
VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning
.
Xenos A, Foteinopoulou NM, Ntinou I, Patras I, Tzimiropoulos G
(
2025
)
.
VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning
.
vol.
00
,
1
-
10
.
Goulas A, Mezaris V, Patras I
(
2025
)
.
VidCtx: Context-aware Video Question Answering with Image Models
.
Conference:
2025 IEEE International Conference on Multimedia and Expo (ICME)
vol.
00
,
1
-
6
.
Sahili ZA, Patras I, Purver M
(
2025
)
.
Multimodal Machine Learning in Mental Health: A Survey of Data, Algorithms, and Challenges
.
Diko A, Wang T, Swaileh W, Sun S, Patras I
(
2025
)
.
ReWind: Understanding Long Videos with Instructed Learnable Memory
.
vol.
00
,
13734
-
13743
.
Cao Y, Zhao Z, Patras I, Gong S
(
2025
)
.
Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts
.
Conference:
2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
vol.
00
,
7707
-
7716
.
Galanopoulos D, Goulas A, Leventakis A, Patras I, Mezaris V
(
2025
)
.
An LLM Framework for Long-Form Video Retrieval and Audio-Visual Question Answering Using Qwen2/2.5
.
Conference:
2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
vol.
00
,
3730
-
3739
.
Ntrougkas MV, Mezaris V, Patras I
(
2025
)
.
P-TAME: Explain Any Image Classifier with Trained Perturbations
.
Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G
(
2025
)
.
DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment
.
Conference:
2025 IEEE 19th International Conference on Automatic Face and Gesture Recognition (FG)
vol.
00
,
1
-
11
.
Cioni D, Tzelepis C, Seidenari L, Patras I
(
2025
)
.
Are CLIP Features All You Need for Universal Synthetic Image Origin Attribution?
.
Lecture Notes in Computer Science
vol.
15643
,
363
-
382
.
Meng D, Tzelepis C, Patras I, Tzimiropoulos G
(
2025
)
.
MM2Latent: Text-to-Facial Image Generation and Editing in GANs with Multimodal Assistance
.
Lecture Notes in Computer Science
vol.
15631
,
88
-
106
.
Kollias D, Psaroudakis A, Arsenos A, Theofilou P, Shao C, Hu G, Patras I
(
2025
)
.
MMA-MRNNet: Harnessing Multiple Models of Affect and Dynamic Masked RNN for Precise Facial Expression Intensity Estimation
.
Computer Vision – ECCV 2024 Workshops
,
vol.
15637
,
Springer Nature
Ntrougkas MV, Mezaris V, Patras I
(
2025
)
.
P-TAME: Explain Any Image Classifier With Trained Perturbations
.
IEEE Open Journal of Signal Processing
vol.
6
,
536
-
545
.
Foteinopoulou NM, Patras I
(
2025
)
.
Machine learning approaches for fine-grained symptom estimation in schizophrenia: A comprehensive review
.
Artificial Intelligence in Medicine
vol.
165
,
Diko A, Wang T, Swaileh W, Sun S, Patras I
(
2025
)
.
ReWind: Understanding Long Videos with Instructed Learnable Memory
.
Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G
(
2025
)
.
DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment
.
Cao Y, Zhao Z, Patras I, Gong S
(
2025
)
.
Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts
.
Zhao Z, Liu Z, Cao Y, Gong S, Patras I
(
2025
)
.
AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data
.
Zhao Z, Cao Y, Gong S, Patras I
(
2025
)
.
Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge Transfer
.
Conference:
2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
vol.
00
,
815
-
824
.
Papadopoulos S, Ioannidis K, Vrochidis S, Kompatsiaris I, Patras I
(
2025
)
.
Vision-Language Pretraining for Variable-Shot Image Classification
.
MultiMedia Modeling
,
vol.
15523
,
Springer Nature
Al Sahili Z, Patras I, Purver M
(
2025
)
.
Data Matters Most: Auditing Social Bias in Contrastive Vision–Language Models
.
Transactions on Machine Learning Research
vol.
2025-October
,
Zhang Z, Liu Z, Patras I
(
2025
)
.
Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization
.
Proceedings International Conference on Computational Linguistics Coling
.
10924
-
10939
.
Ionescu B, Patras I, Müller H, Del Bimbo A
(
2024
)
.
Introduction to the Special Issue on Realistic Synthetic Data: Generation, Learning, Evaluation
.
ACM Transactions on Multimedia Computing Communications and Applications
vol.
21
,
(
1
)
1
-
7
.
Alwazzan O, Gallagher-Syed A, Millner TO, Brandner S, Patras I, Marino S, Slabaugh G
(
2024
)
.
Multimodal Outer Arithmetic Block Dual Fusion of Whole Slide Images and Omics Data for Precision Oncology
.
(
2024
)
.
Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge Transfer
.
(
2024
)
.
Prompting Visual-Language Models for Dynamic Facial Expression Recognition
.
Maniadis Metaxas I, Tzimiropoulos G, Patras I
(
2024
)
.
Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing
.
Lecture Notes in Computer Science
vol.
15090
,
436
-
454
.
Oldfield J, Tzelepis C, Panagakis Y, Nicolaou MA, Patras I
(
2024
)
.
Bilinear Models of Parts and Appearances in Generative Adversarial Networks
.
IEEE Transactions on Pattern Analysis and Machine Intelligence
vol.
46
,
(
12
)
8568
-
8579
.
Sun Z, Song S, Patras I, Tzimiropoulos G
(
2024
)
.
CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition
.
(
2024
)
.
CLIPCleaner: Cleaning Noisy Labels with CLIP
.
Conference:
Proceedings of the 32nd ACM International Conference on Multimedia876
-
885
.
Oldfield J, Georgopoulos M, Chrysos G, Tzelepis C, Panagakis Y, Nicolaou MA, Deng J, Patras I
(
2024
)
.
Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
.
Vancouver, CA
,
38th Conference on Neural Information Processing Systems (NeurIPS)
Maniadis Metaxas I, Tzimiropoulos G, Patras I
(
2024
)
.
Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing
.
Conference:
European Conference on Computer Vision 2024
from:
29/09/2024
to:
04/10/2024
,
Kollias D, Shao C, Kaloidas O, Patras I
(
2024
)
.
Behaviour4All: in-the-wild Facial Behaviour Analysis Toolkit
.
Meng D, Tzelepis C, Patras I, Tzimiropoulos G
(
2024
)
.
MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance
.
Feng C, Tzimiropoulos G, Patras I
(
2024
)
.
CLIPCleaner: Cleaning Noisy Labels with CLIP
.
Kollias D, Psaroudakis A, Arsenos A, Theofilou P, Shao C, Hu G, Patras I
(
2024
)
.
MMA-MRNNet: Harnessing Multiple Models of Affect and Dynamic Masked RNN for Precise Facial Expression Intensity Estimation
.
Cioni D, Tzelepis C, Seidenari L, Patras I
(
2024
)
.
Are CLIP features all you need for Universal Synthetic Image Origin Attribution?
.
Zhang Z, Liu Z, Patras I
(
2024
)
.
Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization
.
Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G
(
2024
)
.
One-Shot Neural Face Reenactment via Finding Directions in GAN's Latent Space
.
Int. J. Comput. Vis.
vol.
132
,
Article
8
,
3324
-
3354
.
Metaxas IM, Tzimiropoulos G, Patras I
(
2024
)
.
Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing
.
(
2024
)
.
NoiseBox: Toward More Efficient and Effective Learning With Noisy Labels
.
IEEE Transactions on Circuits and Systems for Video Technology
vol.
34
,
(
11
)
11914
-
11928
.
Metaxas IM, Bulat A, Patras I, Martinez B, Tzimiropoulos G
(
2024
)
.
Aligned Unsupervised Pretraining of Object Detectors with Self-training
.
Apostolidis E, Balaouras G, Patras I, Mezaris V
(
2024
)
.
Explainable Video Summarization for Advancing Media Content Production
.
Encyclopedia of Information Science and Technology, Sixth Edition
,
IGI Global
(
2024
)
.
LAFS: Landmark-Based Facial Self-Supervised Learning for Face Recognition
.
Conference:
2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
vol.
00
,
1639
-
1649
.
(
2024
)
.
Self-Supervised Facial Representation Learning with Facial Region Awareness
.
Conference:
2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
vol.
00
,
2081
-
2092
.
Foteinopoulou NM, Patras I
(
2024
)
.
EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition
.
vol.
00
,
1
-
10
.
(
2024
)
.
FOAA: Flattened Outer Arithmetic Attention for Multimodal Tumor Classification
.
vol.
00
,
1
-
5
.
Singh AK, Patras I
(
2024
)
.
FashionSD-X: Multimodal Fashion Garment Synthesis using Latent Diffusion
.
Zoumpourlis G, Patras I
(
2024
)
.
Motor Imagery Decoding Using Ensemble Curriculum Learning and Collaborative Training
.
Conference:
12th International Winter Conference on Brain-Computer Interface (BCI)
from:
26/02/2024
to:
28/02/2024
,
(
2024
)
.
EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition
.
Sun Z, Feng C, Patras I, Tzimiropoulos G
(
2024
)
.
LAFS: Landmark-based Facial Self-supervised Learning for Face Recognition
.
Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G
(
2024
)
.
One-Shot Neural Face Reenactment via Finding Directions in GAN’s Latent Space
.
International Journal of Computer Vision
vol.
132
,
(
8
)
3324
-
3354
.
Alwazzan O, Khan A, Patras I, Slabaugh G
(
2024
)
.
MOAB: Multi-Modal Outer Arithmetic Block For Fusion Of Histopathological Images And Genetic Data For Brain Tumor Grading
.
Alwazzan O, Patras I, Slabaugh G
(
2024
)
.
FOAA: Flattened Outer Arithmetic Attention For Multimodal Tumor Classification
.
Gao Z, Patras I
(
2024
)
.
Self-Supervised Facial Representation Learning with Facial Region Awareness
.
Zoumpourlis G, Patras I
(
2024
)
.
Motor Imagery Decoding Using Ensemble Curriculum Learning and Collaborative Training
.
vol.
00
,
1
-
8
.
Zoumpourlis G, Patras I
(
2024
)
.
Motor Imagery Decoding Using Ensemble Curriculum Learning and Collaborative Training
.
Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G
(
2024
)
.
One-shot Neural Face Reenactment via Finding Directions in GAN's Latent Space
.
D’Incà M, Tzelepis C, Patras I, Sebe N
(
2024
)
.
Improving Fairness using Vision-Language Driven Image Augmentation
.
vol.
00
,
4683
-
4692
.
Gao Z, Feng C, Patras I
(
2024
)
.
Self-Supervised Representation Learning with Cross-Context Learning between Global and Hypercolumn Features
.
Conference:
2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
vol.
00
,
1762
-
1772
.
Kollias D, Shao C, Kaloidas O, Patras I
(
2024
)
.
Behaviour4All: in-the-wild Facial Behaviour Analysis Toolkit
.
CoRR
vol.
abs/2409.17717
,
Feng C, Tzimiropoulos G, Patras I, Cai J, Kankanhalli MS, Prabhakaran B, Boll S, Subramanian R et al.
(
2024
)
.
CLIPCleaner: Cleaning Noisy Labels with CLIP
.
ACM Multimedia
.
876
-
885
.
(
2024
)
.
CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition
.
Conference:
Advances in Neural Information Processing Systems 3735612
-
35638
.
Alwazzan O, Patras I, Slabaugh GG
(
2024
)
.
FOAA: Flattened Outer Arithmetic Attention for Multimodal Tumor Classification
.
ISBI
.
1
-
5
.
Singh AK, Patras I
(
2024
)
.
FashionSD-X: Multimodal Fashion Garment Synthesis using Latent Diffusion
.
CoRR
vol.
abs/2404.18591
,
D'Incà M, Tzelepis C, Patras I, Sebe N
(
2024
)
.
Improving Fairness using Vision-Language Driven Image Augmentation
.
WACV
.
4683
-
4692
.
Sun Z, Feng C, Patras I, Tzimiropoulos G
(
2024
)
.
LAFS: Landmark-Based Facial Self-Supervised Learning for Face Recognition
.
CVPR
.
1639
-
1649
.
Alwazzan O, Khan A, Patras I, Slabaugh GG
(
2024
)
.
MOAB: Multi-Modal Outer Arithmetic Block For Fusion Of Histopathological Images And Genetic Data For Brain Tumor Grading
.
CoRR
vol.
abs/2403.06349
,
Chrysos G, Deng J, Georgopoulos M, Nicolaou M, Oldfield J, Panagakis Y, Patras I, Tzelepis C
(
2024
)
.
Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
.
53022
-
53063
.
Sahili ZA, Patras I, Purver M
(
2024
)
.
Multimodal Machine Learning in Mental Health: A Survey of Data, Algorithms, and Challenges
.
Alwazzan O, Gallagher-Syed A, Millner T, Patras I, Marino S, Slabaugh GG
(
2024
)
.
Multimodal Outer Arithmetic Block Dual Fusion of Whole Slide Images and Omics Data for Precision Oncology
.
CoRR
vol.
abs/2411.17418
,
Gao Z, Patras I
(
2024
)
.
Self-Supervised Facial Representation Learning with Facial Region Awareness
.
CVPR
.
2081
-
2092
.
Oldfield J, Tzelepis C, Panagakis Y, Nicolaou MA, Patras I
(
2023
)
.
Parts of Speech-Grounded Subspaces in Vision-Language Models
.
D'Incà M, Tzelepis C, Patras I, Sebe N
(
2023
)
.
Improving Fairness using Vision-Language Driven Image Augmentation
.
Apostolidis E, Mezaris V, Patras I
(
2023
)
.
A Study on the Use of Attention for Explaining Video Summarization
.
Conference:
Proceedings of the 2nd Workshop on User-centric Narrative Summarization of Long Videos41
-
49
.
Kankanhalli MS, Patras I, Liu J, Wong Y, Komamizu T
(
2023
)
.
NarSUM 2023 Chairs Welcome
.
Narsum 2023 Proceedings of the 2nd Workshop on User Centric Narrative Summarization of Long Videos Co Located with mm 2023
Kankanhalli MS, Patras I, Liu J, Wong Y, Komamizu T, Yamazaki S, Stephen K, Kansal K
(
2023
)
.
NarSUM '23: The 2nd Workshop on User-Centric Narrative Summarization of Long Videos
.
Conference:
Proceedings of the 31st ACM International Conference on Multimedia9731
-
9733
.
Foteinopoulou NM, Patras I
(
2023
)
.
Machine Learning Approaches for Fine-Grained Symptom Estimation in Schizophrenia: A Comprehensive Review
.
Xenos A, Stafylakis T, Patras I, Tzimiropoulos G
(
2023
)
.
A Simple Baseline for Knowledge-Based Visual Question Answering
.
Apostolidis E, Balaouras G, Mezaris V, Patras I
(
2023
)
.
Selecting A Diverse Set Of Aesthetically-Pleasing and Representative Video Thumbnails Using Reinforcement Learning
.
Conference:
2023 IEEE International Conference on Image Processing (ICIP)
vol.
00
,
2460
-
2464
.
(
2023
)
.
HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces
.
Conference:
2023 IEEE/CVF International Conference on Computer Vision (ICCV)
vol.
00
,
7115
-
7125
.
(
2023
)
.
Self-Supervised Representation Learning with Cross-Context Learning between Global and Hypercolumn Features
.
Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G
(
2023
)
.
HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces
.
Barattin S, Tzelepis C, Patras I, Sebe N
(
2023
)
.
Attribute-Preserving Face Dataset Anonymization via Latent Code Optimization
.
Conference:
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
vol.
00
,
8001
-
8010
.
(
2023
)
.
DivClust: Controlling Diversity in Deep Clustering
.
Conference:
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
vol.
00
,
3418
-
3428
.
Feng C, Patras I
(
2023
)
.
MaskCon: Masked Contrastive Learning for Coarse-Labelled Dataset
.
Conference:
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
vol.
00
,
19913
-
19922
.
Kordopatis-Zilos G, Tolias G, Tzelepis C, Kompatsiaris I, Patras I, Papadopoulos S
(
2023
)
.
Self-Supervised Video Similarity Learning
.
Conference:
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
vol.
00
,
4756
-
4766
.
Kordopatis-Zilos G, Tolias G, Tzelepis C, Kompatsiaris I, Patras I, Papadopoulos S
(
2023
)
.
Self-Supervised Video Similarity Learning
.
Patras I
(
2023
)
.
Controllable image generation and manipulation
.
Conference:
Proceedings of the 2nd ACM International Workshop on Multimedia AI against Disinformation1
-
1
.
(
2023
)
.
MOAB: Multi-Modal Outer Arithmetic Block for Fusion of Histopathological Images and Genetic Data for Brain Tumor Grading
.
Conference:
2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI)
vol.
00
,
1
-
5
.
Metaxas IM, Tzimiropoulos G, Patras I
(
2023
)
.
DivClust: Controlling Diversity in Deep Clustering
.
(
2023
)
.
Low-Light Image Enhancement Based on U-Net and Haar Wavelet Pooling
.
Lecture Notes in Computer Science
.
vol.
13834
,
510
-
522
.
(
2023
)
.
MaskCon: Masked Contrastive Learning for Coarse-Labelled Dataset
.
Barattin S, Tzelepis C, Patras I, Sebe N
(
2023
)
.
Attribute-preserving Face Dataset Anonymization via Latent Code Optimization
.
Yang Q, Tzelepis C, Nikolenko S, Patras I, Farseev A
(
2023
)
.
"Just To See You Smile": SMILEY, a Voice-Guided <strike>GUY</strike> GAN
.
Conference:
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining1196
-
1199
.
Oldfield J, Tzelepis C, Panagakis Y, Nicolaou MA, Patras I
(
2023
)
.
PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs
.
Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G
(
2023
)
.
StyleMask: Disentangling the Style Space of StyleGAN2 for Neural Face Reenactment
.
Conference:
2023 IEEE 17th International Conference on Automatic Face and Gesture Recognition (FG)
vol.
00
,
1
-
8
.
(
2023
)
.
A Simple Baseline for Knowledge-Based Visual Question Answering
.
Conference:
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing14871
-
14877
.
Batziou E, Ioannidis K, Patras I, Vrochidis S, Kompatsiaris I
(
2023
)
.
Artistic neural style transfer using CycleGAN and FABEMD by adaptive information selection
.
Pattern Recognition Letters
vol.
165
,
55
-
62
.
Barattin S, Tzelepis C, Patras I, Sebe N
(
2023
)
.
Attribute-Preserving Face Dataset Anonymization via Latent Code Optimization
.
CVPR
.
8001
-
8010
.
Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G
(
2023
)
.
HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces
.
ICCV
.
7115
-
7125
.
Oldfield J, Tzelepis C, Panagakis Y, Nicolaou M, Patras I
(
2023
)
.
PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs
.
ICLR
.
Zhao Z, Patras I
(
2023
)
.
Prompting Visual-Language Models for Dynamic Facial Expression Recognition
.
BMVC
.
98
-
98
.
Kordopatis-Zilos G, Tolias G, Tzelepis C, Kompatsiaris I, Patras I, Papadopoulos S
(
2023
)
.
Self-Supervised Video Similarity Learning
.
CVPR Workshops
.
4756
-
4766
.
Metaxas IM, Bulat A, Patras I, Martínez B, Tzimiropoulos G
(
2023
)
.
SimDETR: Simplifying self-supervised pretraining for DETR
.
CoRR
vol.
abs/2307.15697
,
Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G
(
2023
)
.
StyleMask: Disentangling the Style Space of StyleGAN2 for Neural Face Reenactment
.
FG
.
1
-
8
.
Apostolidis E, Balaouras G, Mezaris V, Patras I
(
2022
)
.
Explaining video summarization based on the focus of attention
.
Conference:
2022 IEEE International Symposium on Multimedia (ISM)
vol.
00
,
146
-
150
.
Feng C, Patras I
(
2022
)
.
Adaptive Soft Contrastive Learning
.
Conference:
2022 26th International Conference on Pattern Recognition (ICPR)
(
2022
)
.
Learning from Label Relationships in Human Affect
.
Conference:
Proceedings of the 30th ACM International Conference on Multimedia80
-
89
.
Patras I
(
2022
)
.
Video Summarization in the Deep Learning Era
.
Conference:
Proceedings of the 1st Workshop on User-centric Narrative Summarization of Long Videos1
-
1
.
Feng C, Tzimiropoulos G, Patras I
(
2022
)
.
SSR: An Efficient and Robust Framework for Learning with Unknown Label Noise
.
Bounareli S, Tzelepis C, Argyriou V, Patras I, Tzimiropoulos G
(
2022
)
.
StyleMask: Disentangling the Style Space of StyleGAN2 for Neural Face Reenactment
.
Panwar H, Patras I
(
2022
)
.
Capsule Network based Contrastive Learning of Unsupervised Visual Representations
.
Feng C, Patras I
(
2022
)
.
Adaptive Soft Contrastive Learning
.
Conference:
2022 26th International Conference on Pattern Recognition (ICPR)
vol.
00
,
2721
-
2727
.
Foteinopoulou NM, Patras I
(
2022
)
.
Learning from Label Relationships in Human Affect
.
Kordopatis-Zilos G, Tzelepis C, Papadopoulos S, Kompatsiaris I, Patras I
(
2022
)
.
DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval
.
International Journal of Computer Vision
vol.
130
,
(
10
)
2385
-
2407
.
Feng C, Patras I
(
2022
)
.
Adaptive Soft Contrastive Learning
.
Apostolidis E, Balaouras G, Mezaris V, Patras I
(
2022
)
.
Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of the Video Frames
.
Conference:
Proceedings of the 2022 International Conference on Multimedia Retrieval407
-
415
.
Tzelepis C, Oldfield J, Tzimiropoulos G, Patras I
(
2022
)
.
ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences
.
Zoumpourlis G, Patras I
(
2022
)
.
CovMix: Covariance Mixing Regularization for Motor Imagery Decoding
.
Conference:
2022 10th International Winter Conference on Brain-Computer Interface (BCI)
vol.
00
,
1
-
7
.
Tzelepis C, Oldfield J, Tzimiropoulos G, Patras I
(
2022
)
.
ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences
.
CoRR
vol.
abs/2206.02104
,
Kordopatis-Zilos G, Tzelepis C, Papadopoulos S, Kompatsiaris I, Patras I
(
2022
)
.
DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval
.
Int. J. Comput. Vis.
vol.
130
,
Article
10
,
2385
-
2407
.
Foteinopoulou NM, Patras I, Magalhães J, Bimbo AD, Satoh S, Sebe N, Alameda-Pineda X, Jin Q et al.
(
2022
)
.
Learning from Label Relationships in Human Affect
.
ACM Multimedia
.
80
-
89
.
Feng C, Tzimiropoulos G, Patras I
(
2022
)
.
SSR: An Efficient and Robust Framework for Learning with Unknown Label Noise
.
Bmvc 2022 33rd British Machine Vision Conference Proceedings
.
Patras I, Kankanhalli MS, Liu J, Wong Y
(
2022
)
.
Video Summarization in the Deep Learning Era: Current Landscape and Future Directions
.
NarSUM@MM
.
1
-
1
.
Apostolidis E, Balaouras G, Mezaris V, Patras I
(
2021
)
.
Combining Global and Local Attention with Positional Encoding for Video Summarization
.
Conference:
2021 IEEE International Symposium on Multimedia (ISM)
vol.
00
,
226
-
234
.
Oldfield J, Georgopoulos M, Panagakis Y, Nicolaou MA, Patras I
(
2021
)
.
Tensor Component Analysis for Interpreting the Latent Space of GANs
.
(
2021
)
.
Video Summarization Using Deep Neural Networks: A Survey
.
Proceedings of the IEEE
vol.
109
,
(
11
)
1838
-
1863
.
(
2021
)
.
WarpedGANSpace: Finding non-linear RBF paths in GAN latent space
.
Conference:
2021 IEEE/CVF International Conference on Computer Vision (ICCV)
vol.
00
,
6373
-
6382
.
(
2021
)
.
Estimating continuous affect with label uncertainty
.
Conference:
2021 9th International Conference on Affective Computing and Intelligent Interaction (ACII)
vol.
00
,
1
-
8
.
(
2021
)
.
Pairwise Ranking Network for Affect Recognition
.
Conference:
2021 9th International Conference on Affective Computing and Intelligent Interaction (ACII)
vol.
00
,
1
-
8
.
Apostolidis E, Adamantidou E, Metsai AI, Mezaris V, Patras I
(
2021
)
.
Video Summarization Using Deep Neural Networks: A Survey
.
Tzelepis C, Tzimiropoulos G, Patras I
(
2021
)
.
WarpedGANSpace: Finding non-linear RBF paths in GAN latent space
.
Xie T-T, Tzelepis C, Fu F, Patras I
(
2021
)
.
Few-Shot Action Localization without Knowing Boundaries
.
Apostolidis E, Adamantidou E, Mezaris V, Patras I
(
2021
)
.
Combining Adversarial and Reinforcement Learning for Video Thumbnail Selection
.
Conference:
Proceedings of the 2021 International Conference on Multimedia Retrieval1
-
9
.
Xie T-T, Tzelepis C, Fu F, Patras I
(
2021
)
.
Few-Shot Action Localization without Knowing Boundaries
.
Conference:
Proceedings of the 2021 International Conference on Multimedia Retrieval339
-
348
.
(
2021
)
.
DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval
.
Fu F, Xie T, Patras I, Jalali S
(
2021
)
.
Relationship-based Neural Baby Talk
.
Tzelepis C, Patras I
(
2021
)
.
Uncertainty Propagation in Convolutional Neural Networks: Technical Report
.
(
2021
)
.
Cycle-Consistent Adversarial Networks and Fast Adaptive Bi-dimensional Empirical Mode Decomposition for Style Transfer
.
Conference:
2020 25th International Conference on Pattern Recognition (ICPR)
vol.
00
,
2360
-
2367
.
Chen L, Liang Y, Shi X, Zhou Y, Wu C, Lokoc J, Skopal T, Schoeffmann K et al.
(
2021
)
.
Crossed-Time Delay Neural Network for Speaker Recognition
.
MMM (1)
.
vol.
12572
,
1
-
10
.
Lu X, Zhang J, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al.
(
2021
)
.
DANet: Deformable Alignment Network for Video Inpainting
.
MMM (1)
.
vol.
12572
,
430
-
442
.
Feng D, Zhang Y, Zhu C, Zhang H, Song L, Lokoc J, Skopal T, Schoeffmann K et al.
(
2021
)
.
DVRCNN: Dark Video Post-processing Method for VVC
.
MMM (1)
.
vol.
12572
,
691
-
703
.
Yang K, Lu J, Hu S, Chen X, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Deep 3D Modeling of Human Bodies from Freehand Sketching
.
MMM (2)
.
vol.
12573
,
36
-
48
.
Xue L, Yao W, Xia Y, Li X, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Deep Attributed Network Embedding with Community Information
.
MMM (1)
.
vol.
12572
,
653
-
665
.
Wen Z, Feng A, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al.
(
2021
)
.
Deep Centralized Cross-modal Retrieval
.
MMM (1)
.
vol.
12572
,
443
-
455
.
Yang S, Xue H, Ling J, Song L, Xie R, Lokoc J, Skopal T, Schoeffmann K et al.
(
2021
)
.
Deep Face Swapping via Cross-Identity Adversarial Training
.
MMM (2)
.
vol.
12573
,
74
-
86
.
Constantin MG, Stefan L-D, Ionescu B, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
DeepFusion: Deep Ensembles for Domain Independent System Fusion
.
MMM (1)
.
vol.
12572
,
240
-
252
.
Zhang Z, Ma J, Xu P, Wang W, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Dense Attention-Guided Network for Boundary-Aware Salient Object Detection
.
MMM (1)
.
vol.
12572
,
148
-
161
.
Wang F, Ding Y, Liang H, Wen J, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Discriminative and Selective Pseudo-Labeling for Domain Adaptation
.
MMM (1)
.
vol.
12572
,
365
-
377
.
Zhang X, Du T, Zhang Z, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
EEG Emotion Recognition Based on Channel Attention for E-Healthcare Applications
.
MMM (2)
.
vol.
12573
,
159
-
169
.
Khan OS, Jónsson BÞ, Larsen MD, Poulsen LAS, Koelma DC, Rudinac S, Worring M, Zahálka J et al.
(
2021
)
.
Exquisitor at the Video Browser Showdown 2021: Relationships Between Semantic Classifiers
.
MMM (2)
.
vol.
12573
,
410
-
416
.
Zhao H, She X, Wang S, Ma K, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Fast Discrete Matrix Factorization Hashing for Large-Scale Cross-Modal Retrieval
.
MMM (1)
.
vol.
12572
,
24
-
36
.
Wu S, Wang Z, Cai Y, Wang R, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Fast Mode Decision Algorithm for Intra Encoding of the 3rd Generation Audio Video Coding Standard
.
MMM (1)
.
vol.
12572
,
481
-
492
.
Qiu T, Ni B, Liu Z, Chen X, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Fast Optimal Transport Artistic Style Transfer
.
MMM (1)
.
vol.
12572
,
37
-
49
.
Xie T-T, Tzelepis C, Fu F, Patras I, Cheng W-H, Kankanhalli MS, Wang M, Chu W-T et al.
(
2021
)
.
Few-Shot Action Localization without Knowing Boundaries
.
ICMR
.
339
-
348
.
Wang H, Lian J, Xiong S, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Few-Shot Learning with Unlabeled Outlier Exposure
.
MMM (1)
.
vol.
12572
,
340
-
351
.
Sun W, Xu J, Yang G, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Fine-Grained Generation for Zero-Shot Learning
.
MMM (1)
.
vol.
12572
,
580
-
591
.
Zheng M, Jia Y, Jiang H, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Fine-Grained Image-Text Retrieval via Complementary Feature Learning
.
MMM (1)
.
vol.
12572
,
592
-
604
.
Zhang L, Zhang H, Zhu C, Guo S, Chen J, Wang L, Lokoc J, Skopal T et al.
(
2021
)
.
Fine-Grained Video Deblurring with Event Camera
.
MMM (1)
.
vol.
12572
,
352
-
364
.
Li F, Wang W, Liu Z, Wang H, Yan C, Wu B, Lokoc J, Skopal T et al.
(
2021
)
.
Frame Aggregation and Multi-modal Fusion Framework for Video-Based Person Recognition
.
MMM (1)
.
vol.
12572
,
75
-
86
.
Giannakeris P, Tsanousa A, Mavropoulos T, Meditskos G, Ioannidis K, Vrochidis S, Kompatsiaris I, Lokoc J et al.
(
2021
)
.
Fusion of Multimodal Sensor Data for Effective Human Action Recognition in the Service of Medical Platforms
.
MMM (2)
.
vol.
12573
,
367
-
378
.
Liu S, Claypool M, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al.
(
2021
)
.
Game Input with Delay - A Model of the Time Distribution for Selecting a Moving Target with a Mouse
.
MMM (1)
.
vol.
12572
,
506
-
518
.
Shan X, Wen Y, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al.
(
2021
)
.
Gaussian Mixture Model Based Semi-supervised Sparse Representation for Face Recognition
.
MMM (1)
.
vol.
12572
,
716
-
727
.
Xiao Z, Li D, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al.
(
2021
)
.
Generative Image Inpainting by Hybrid Contextual Attention Network
.
MMM (1)
.
vol.
12572
,
162
-
173
.
Zhang C, Zhang W, Chen F, Cheng Y, Gao S, Zhang W, Lokoc J, Skopal T et al.
(
2021
)
.
Global Cognition and Local Perception Network for Blind Image Deblurring
.
MMM (1)
.
vol.
12572
,
303
-
314
.
Wang X, Li X, Wu S, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Graph Structure Reasoning Network for Face Alignment and Reconstruction
.
MMM (1)
.
vol.
12572
,
493
-
505
.
Nguyen M-D, Binh NT, Gurrin C, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Graph-Based Indexing and Retrieval of Lifelog Data
.
MMM (2)
.
vol.
12573
,
256
-
267
.
Pei D, Li A, Wang Y, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Group Activity Recognition by Exploiting Position Distribution and Appearance Relation
.
MMM (1)
.
vol.
12572
,
123
-
135
.
Garcia-Ceja E, Thambawita V, Hicks SA, Jha D, Jakobsen P, Hammer HL, Halvorsen P, Riegler MA et al.
(
2021
)
.
HTAD: A Home-Tasks Activities Dataset with Wrist-Accelerometer and Audio Features
.
MMM (2)
.
vol.
12573
,
196
-
205
.
Lee Y, Choi H, Park S, Ro YM, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
IVIST: Interactive Video Search Tool in VBS 2021
.
MMM (2)
.
vol.
12573
,
423
-
428
.
Ressmann A, Schoeffmann K, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al.
(
2021
)
.
IVOS - The ITEC Interactive Video Object Search System at VBS2021
.
MMM (2)
.
vol.
12573
,
479
-
483
.
Qiu Y, Chen J, Wang X, Jang K, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Illuminate Low-Light Image via Coarse-to-fine Multi-level Network
.
MMM (1)
.
vol.
12572
,
253
-
264
.
Jiang S, Wang C, Huang C, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Image Registration Improved by Generative Adversarial Networks
.
MMM (2)
.
vol.
12573
,
26
-
35
.
Apostolakis A, Girtsou S, Kontoes C, Papoutsis I, Tsoutsos M, Lokoc J, Skopal T, Schoeffmann K et al.
(
2021
)
.
Implementation of a Random Forest Classifier to Examine Wildfire Predictive Modelling in Greece Using Diachronically Collected Fire Occurrence and Fire Mapping Data
.
MMM (2)
.
vol.
12573
,
318
-
329
.
Feng C, Li D, Zheng J, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Improving Supervised Cross-modal Retrieval with Semantic Graph Embedding
.
MMM (1)
.
vol.
12572
,
187
-
199
.
Zhu Z, Sun L, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al.
(
2021
)
.
Initialize with Mask: For More Efficient Federated Learning
.
MMM (2)
.
vol.
12573
,
111
-
120
.
Smeaton AF, Krishnamurthy NG, Suryanarayana AH, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Keystroke Dynamics as Part of Lifelogging
.
MMM (2)
.
vol.
12573
,
183
-
195
.
Jha D, Ali S, Emanuelsen K, Hicks SA, Thambawita V, Garcia-Ceja E, Riegler MA, Lange TD et al.
(
2021
)
.
Kvasir-Instrument: Diagnostic and Therapeutic Tool Segmentation Dataset in Gastrointestinal Endoscopy
.
MMM (2)
.
vol.
12573
,
218
-
229
.
Zhang P, Ouyang D, Jiang C, Shao J, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Language Person Search with Pair-Based Weighting Loss
.
MMM (1)
.
vol.
12572
,
227
-
239
.
Liu Z-Y, Liu J-W, Zuo X, Li W, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Learning 3D-Craft Generation with Predictive Action Neural Network
.
MMM (1)
.
vol.
12572
,
541
-
553
.
Lu L, Lu Y, Wang S, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Learning Multi-level Interaction Relations and Feature Representations for Group Activity Recognition
.
MMM (1)
.
vol.
12572
,
617
-
628
.
Zheng W, Yan L, Wang F-Y, Gou C, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Learning from the Negativity: Deep Negative Correlation Meta-Learning for Adversarial Image Classification
.
MMM (1)
.
vol.
12572
,
531
-
540
.
Leibetseder A, Schoeffmann K, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al.
(
2021
)
.
Less is More - diveXplore 5.0 at VBS 2021
.
MMM (2)
.
vol.
12573
,
455
-
460
.
Chen X, Liu R, Song X, Han Y, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Locating Visual Explanations for Video Question Answering
.
MMM (1)
.
vol.
12572
,
290
-
302
.
Gu Q, Luo Z, Zhao W, Zhu Y, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
MM-Net: Learning Adaptive Meta-metric for Few-Shot Biometric Recognition
.
MMM (1)
.
vol.
12572
,
265
-
277
.
Nguyen D-H, Tan LTN, Nguyen M-T, Nguyen T-B, Dao M-S, Lokoc J, Skopal T, Schoeffmann K et al.
(
2021
)
.
MNR-Air: An Economic and Dynamic Crowdsourcing Mechanism to Collect Personal Lifelog and Surrounding Environment Dataset. A Case Study in Ho Chi Minh City, Vietnam
.
MMM (2)
.
vol.
12573
,
206
-
217
.
Zhang Y, Zhao H, Zhou F, Zhang Q, Shi Y, Liang L, Lokoc J, Skopal T et al.
(
2021
)
.
MSCANet: Adaptive Multi-scale Context Aggregation Network for Congested Crowd Counting
.
MMM (2)
.
vol.
12573
,
1
-
12
.
Song W, Dai S, Huang D, Song J, Liotta A, Lokoc J, Skopal T, Schoeffmann K et al.
(
2021
)
.
Median-Pooling Grad-CAM: An Efficient Inference Level Visual Explanation for CNN Networks in Remote Sensing Image Classification
.
MMM (2)
.
vol.
12573
,
134
-
146
.
Codina-Filbà J, Escalera S, Escudero J, Antens C, Buch-Cardona P, Farrús M, Lokoc J, Skopal T et al.
(
2021
)
.
Mobile eHealth Platform for Home Monitoring of Bipolar Disorder
.
MMM (2)
.
vol.
12573
,
330
-
341
.
Zhang F, Li M, Zhai G, Liu Y, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Multi-branch and Multi-scale Attention Learning for Fine-Grained Visual Categorization
.
MMM (1)
.
vol.
12572
,
136
-
147
.
Liu Y, Lu Y, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al.
(
2021
)
.
Multi-grained Fusion for Conditional Image Retrieval
.
MMM (1)
.
vol.
12572
,
315
-
327
.
Zhang X, Zhang Y, Zhang Z, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Multi-granularity Recurrent Attention Graph Neural Network for Few-Shot Learning
.
MMM (2)
.
vol.
12573
,
147
-
158
.
Long J, Lu H, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al.
(
2021
)
.
Multi-level Gate Feature Aggregation with Spatially Adaptive Batch-Instance Normalization for Semantic Image Synthesis
.
MMM (1)
.
vol.
12572
,
378
-
390
.
Gao R, Huang Z, Liu S, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Multi-task Deep Learning for No-Reference Screen Content Image Quality Assessment
.
MMM (1)
.
vol.
12572
,
213
-
226
.
(
2021
)
.
MultiMedia Modeling - 27th International Conference, MMM 2021, Prague, Czech Republic, June 22-24, 2021, Proceedings, Part I
.
MMM (1)
.
vol.
12572
,
(
2021
)
.
MultiMedia Modeling - 27th International Conference, MMM 2021, Prague, Czech Republic, June 22-24, 2021, Proceedings, Part II
.
MMM (2)
.
vol.
12573
,
Yebda T, Benois-Pineau J, Pech M, Amieva H, Middleton L, Bergelt M, Lokoc J, Skopal T et al.
(
2021
)
.
Multimodal Sensor Data Analysis for Detection of Risk Situations of Fragile People in @home Environments
.
MMM (2)
.
vol.
12573
,
342
-
353
.
Zhao Y, Guo J, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al.
(
2021
)
.
MusiCoder: A Universal Music-Acoustic Encoder Based on Transformer
.
MMM (1)
.
vol.
12572
,
417
-
429
.
Karisch C, Leibetseder A, Schoeffmann K, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
NoShot Video Browser at VBS2021
.
MMM (2)
.
vol.
12573
,
405
-
409
.
Dobranský M, Skopal T, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al.
(
2021
)
.
On Fusion of Learned and Designed Features for Video Data Analytics
.
MMM (2)
.
vol.
12573
,
268
-
280
.
Lokoč J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S, Patras I
(
2021
)
.
Preface
.
Fu F, Xie T, Patras I, Jalali S
(
2021
)
.
Relationship-based Neural Baby Talk
.
CoRR
vol.
abs/2103.04846
,
Zhao S, Li X, Chen Z, Liu C, Peng C, Lokoc J, Skopal T, Schoeffmann K et al.
(
2021
)
.
Res2-Unet: An Enhanced Network for Generalized Nuclear Segmentation in Pathological Images
.
MMM (2)
.
vol.
12573
,
87
-
98
.
Park S, Kim JU, Kim Y, Moon S-K, Ro YM, Lokoc J, Skopal T, Schoeffmann K et al.
(
2021
)
.
Robust Multispectral Pedestrian Detection via Uncertainty-Aware Cross-Modal Learning
.
MMM (1)
.
vol.
12572
,
391
-
402
.
Feng C, Tzimiropoulos G, Patras I
(
2021
)
.
S3: Supervised Self-supervised Learning under Label Noise
.
CoRR
vol.
abs/2111.11288
,
Veselý P, Mejzlík F, Lokoc J, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
SOMHunter V2 at Video Browser Showdown 2021
.
MMM (2)
.
vol.
12573
,
461
-
466
.
Wu J, Nguyen PA, Ma Z, Ngo C-W, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
SQL-Like Interpretable Interactive Video Search
.
MMM (2)
.
vol.
12573
,
391
-
397
.
Bishay M, Palasek P, Priebe S, Patras I
(
2021
)
.
SchiNet: Automatic Estimation of Symptoms of Schizophrenia from Facial Behaviour Analysis
.
IEEE Trans. Affect. Comput.
vol.
12
,
Article
4
,
949
-
961
.
Gisolf F, Geradts ZJMH, Worring M, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Search and Explore Strategies for Interactive Analysis of Real-Life Image Collections with Unknown and Unique Categories
.
MMM (2)
.
vol.
12573
,
244
-
255
.
Wang T, Feng N, Yu J, He Y, Hu Y, Chen Y-PP, Lokoc J, Skopal T et al.
(
2021
)
.
Shot Boundary Detection Through Multi-stage Deep Convolution Neural Network
.
MMM (1)
.
vol.
12572
,
456
-
468
.
Wang J, Li Y, Lu H, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Spatial Gradient Guided Learning and Semantic Relation Transfer for Facial Landmark Detection
.
MMM (1)
.
vol.
12572
,
678
-
690
.
Gajdusek P, Peska L, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al.
(
2021
)
.
SpotifyGraph: Visualisation of User's Preferences in Music
.
MMM (2)
.
vol.
12573
,
379
-
384
.
Wu Y, Hu R, Wang X, Hu C, Li G, Lokoc J, Skopal T, Schoeffmann K et al.
(
2021
)
.
Stacked Sparse Autoencoder for Audio Object Coding
.
MMM (1)
.
vol.
12572
,
50
-
61
.
Umemura K, Kastner MA, Ide I, Kawanishi Y, Hirayama T, Doman K, Deguchi D, Murase H et al.
(
2021
)
.
Tell as You Imagine: Sentence Imageability-Aware Image Captioning
.
MMM (2)
.
vol.
12573
,
62
-
73
.
Oldfield J, Georgopoulos M, Panagakis Y, Nicolaou MA, Patras I
.
Tensor Component Analysis for Interpreting the Latent Space of GANs
.
Conference:
Proceedings of the British Machine Vision Conference 2021
Oldfield J, Georgopoulos M, Panagakis Y, Nicolaou MA, Patras I
(
2021
)
.
Tensor Component Analysis for Interpreting the Latent Space of GANs
.
BMVC
.
222
-
222
.
Nefkens M, Hürst W, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al.
(
2021
)
.
The MovieWall: A New Interface for Browsing Large Video Collections
.
MMM (2)
.
vol.
12573
,
170
-
182
.
Chu W-T, Huang P-S, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X, Vrochidis S et al.
(
2021
)
.
Thermal Face Recognition Based on Multi-scale Image Synthesis
.
MMM (1)
.
vol.
12572
,
99
-
110
.
Wei J, Yang X, Dong Y, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Time-Dependent Body Gesture Representation for Video Emotion Recognition
.
MMM (1)
.
vol.
12572
,
403
-
416
.
Heller S, Gasser R, Illi C, Pasquinelli M, Sauter L, Spiess F, Schuldt H, Lokoc J et al.
(
2021
)
.
Towards Explainable Interactive Multi-modal Video Retrieval with Vitrivr
.
MMM (2)
.
vol.
12573
,
435
-
440
.
Amirpour H, Çetinkaya E, Timmerer C, Ghanbari M, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Towards Optimal Multirate Encoding for HTTP Adaptive Streaming
.
MMM (1)
.
vol.
12572
,
469
-
480
.
Kraus M, Seldschopf P, Minker W, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Towards the Development of a Trustworthy Chatbot for Mental Health Applications
.
MMM (2)
.
vol.
12573
,
354
-
366
.
Huang C, Chan S, Bai C, Ding W, Zhang J, Lokoc J, Skopal T, Schoeffmann K et al.
(
2021
)
.
Tropical Cyclones Tracking Based on Satellite Cloud Images: Database and Comprehensive Study
.
MMM (2)
.
vol.
12573
,
13
-
25
.
Wang F, Luo L, Zhu E, Lokoc J, Skopal T, Schoeffmann K, Mezaris V, Li X et al.
(
2021
)
.
Two-Stage Real-Time Multi-object Tracking with Candidate Selection
.
MMM (2)
.
vol.
12573
,
49
-
61
.
Tzelepis C, Patras I
(
2021
)
.
Uncertainty Propagation in Convolutional Neural Networks: Technical Report
.
CoRR
vol.
abs/2102.06064
,
Lu Y, Wang Y, Xin Y, Wu D, Lu G, Lokoc J, Skopal T, Schoeffmann K et al.
(
2021
)
.
Unsupervised Gaze: Exploration of Geometric Constraints for 3D Gaze Estimation
.
MMM (2)
.
vol.
12573
,
121
-
133
.
Li X, Wang W, Li Q, Guo L, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Unsupervised Multi-shot Person Re-identification via Dynamic Bi-directional Normalized Sparse Representation
.
MMM (1)
.
vol.
12572
,
554
-
566
.
Hu M, Hu R, Wang X, Sheng R, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Unsupervised Temporal Attention Summarization Model for User Created Videos
.
MMM (1)
.
vol.
12572
,
519
-
530
.
Andreadis S, Moumtzidou A, Gkountakos K, Pantelidis N, Apostolidis K, Galanopoulos D, Gialampoukidis I, Vrochidis S et al.
(
2021
)
.
VERGE in VBS 2021
.
MMM (2)
.
vol.
12573
,
398
-
404
.
Lokoc J, Bátoryová J, Smrz D, Dobranský M, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Video Search with Collage Queries
.
MMM (2)
.
vol.
12573
,
429
-
434
.
Hezel N, Schall K, Jung K, Barthel KU, Lokoc J, Skopal T, Schoeffmann K, Mezaris V et al.
(
2021
)
.
Video Search with Sub-Image Keyword Transfer Using Existing Image Archives
.
MMM (2)
.
vol.
12573
,
484
-
489
.
Apostolidis E, Adamantidou E, Metsai AI, Mezaris V, Patras I
(
2021
)
.
Video Summarization Using Deep Neural Networks: A Survey
.
Proc. IEEE
vol.
109
,
Article
11
,
1838
-
1863
.
Rossetto L, Baumgartner M, Ashena N, Ruosch F, Pernisch R, Heitz L, Bernstein A, Lokoc J et al.
(
2021
)
.
VideoGraph - Towards Using Knowledge Graphs for Interactive Video Retrieval
.
MMM (2)
.
vol.
12573
,
417
-
422
.
Tzelepis C, Tzimiropoulos G, Patras I
(
2021
)
.
WarpedGANSpace: Finding non-linear RBF paths in GAN latent space
.
CoRR
vol.
abs/2109.13357
,
(
2020
)
.
AC-SUM-GAN: Connecting Actor-Critic and Generative Adversarial Networks for Unsupervised Video Summarization
.
IEEE Transactions on Circuits and Systems for Video Technology
vol.
31
,
(
8
)
3278
-
3292
.
Apostolidis E, Adamantidou E, Metsai AI, Mezaris V, Patras I
(
2020
)
.
Performance over Random
.
Conference:
Proceedings of the 28th ACM International Conference on Multimedia1056
-
1064
.
Apostolidis E, Adamantidou E, Metsai AI, Mezaris V, Patras I
(
2020
)
.
Performance over Random: A Robust Evaluation Protocol for Video Summarization Methods
.
Mm 2020 Proceedings of the 28th ACM International Conference on Multimedia
.
1056
-
1064
.
Xie T-T, Tzelepis C, Patras I
(
2020
)
.
Boundary Uncertainty in a Single-Stage Temporal Action Localization Network
.
Xie T-T, Tzelepis C, Patras I
(
2020
)
.
Temporal Action Localization with Variance-Aware Networks
.
Xie T, Tzelepis C, Patras I
(
2020
)
.
Boundary Uncertainty in a Single-Stage Temporal Action Localization Network
.
CoRR
vol.
abs/2008.11170
,
Gkalelis N, Markatopoulou F, Moumtzidou A, Galanopoulos D, Avgerinakis K, Pittaras N, Vrochidis S, Mezaris V et al.
(
2020
)
.
ITI-CERTH participation to TRECVID 2014
.
2014 TREC Video Retrieval Evaluation, TRECVID 2014
.
Markatopoulou F, Ioannidou A, Tzelepis C, Mironidis T, Galanopoulos D, Arestis-Chartampilas S, Pittaras N, Avgerinakis K et al.
(
2020
)
.
ITI-CERTH participation to TRECVID 2015
.
2015 TREC Video Retrieval Evaluation, TRECVID 2015
.
Xie T-T, Tzelepis C, Patras I
(
2020
)
.
Temporal Action Localization with Variance-Aware Networks
.
CoRR
vol.
abs/2008.11254
,
(
2020
)
.
Unsupervised Video Summarization via Attention-Driven Adversarial Learning
.
Lecture Notes in Computer Science
.
vol.
11961
,
492
-
504
.
Kordopatis-Zilos G, Papadopoulos S, Patras I, Kompatsiaris I
(
2019
)
.
ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning
.
Conference:
2019 IEEE/CVF International Conference on Computer Vision (ICCV)
vol.
00
,
6350
-
6359
.
Tao Y, Ling Z, Patras I
(
2019
)
.
Universal Foreground Segmentation Based on Deep Feature Fusion Network for Multi-Scene Videos
.
IEEE Access
vol.
7
,
158326
-
158337
.
Kordopatis-Zilos G, Papadopoulos S, Patras I, Kompatsiaris I
(
2019
)
.
ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning
.
Conference:
International Conference on Computer Vision
(
Seoul. Korea
)
from:
27/09/2019
to:
02/11/2019
,
6351
-
6360
.
(
2019
)
.
A Stepwise, Label-based Approach for Improving the Adversarial Training in Unsupervised Video Summarization
.
Conference:
Proceedings of the 1st International Workshop on AI for Smart TV Content Production, Access and Delivery17
-
25
.
(
2019
)
.
Detecting Manipulations in Video
.
Video Verification in the Fake News Era
,
Springer Nature
(
2019
)
.
Finding Near-Duplicate Videos in Large-Scale Collections
.
Video Verification in the Fake News Era
,
Springer Nature
(
2019
)
.
Finding Semantically Related Videos in Closed Collections
.
Video Verification in the Fake News Era
,
Springer Nature
(
2019
)
.
Video Fragmentation and Reverse Search on the Web
.
Video Verification in the Fake News Era
,
Springer Nature
Bishay M, Zoumpourlis G, Patras I
(
2019
)
.
TARN: Temporal Attentive Relation Network for Few-Shot and Zero-Shot Action Recognition
.
Conference:
30th British Machine Vision Conference
(
Cardiff, UK
)
from:
09/09/2019
to:
12/09/2019
,
Kordopatis-Zilos G, Papadopoulos S, Patras I, Kompatsiaris I
(
2019
)
.
ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning
.
Bishay M, Zoumpourlis G, Patras I
(
2019
)
.
TARN: Temporal Attentive Relation Network for Few-Shot and Zero-Shot Action Recognition
.
(
2019
)
.
Alone versus In-a-group
.
ACM Transactions on Multimedia Computing Communications and Applications
vol.
15
,
(
2
)
1
-
23
.
Marras I, Palasek P, Patras I
(
2019
)
.
Deep Mixture of MRFs for Human Pose Estimation
.
Lecture Notes in Computer Science
.
vol.
11363
,
717
-
733
.
Xie T, Yang X, Zhang T, Xu C, Patras I
(
2019
)
.
Exploring Feature Representation and Training strategies in Temporal Action Localization
.
(
2019
)
.
Your Fellows Matter: Affect Analysis across Subjects in Group Videos
.
Conference:
2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019)
vol.
00
,
1
-
5
.
(
2019
)
.
Can Automatic Facial Expression Analysis Be Used for Treatment Outcome Estimation in Schizophrenia?
.
Conference:
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
vol.
00
,
1632
-
1636
.
Jang Y, Gunes H, Patras I
(
2019
)
.
Registration-free Face-SSD: Single shot analysis of smiles, facial attributes, and affect in the wild
.
Computer Vision and Image Understanding
vol.
182
,
17
-
29
.
(
2019
)
.
SchiNet: Automatic Estimation of Symptoms of Schizophrenia from Facial Behaviour Analysis
.
IEEE Transactions on Affective Computing
vol.
12
,
(
4
)
949
-
961
.
Kordopatis-Zilos G, Papadopoulos S, Patras I, Kompatsiaris I
(
2019
)
.
FIVR: Fine-grained Incident Video Retrieval
.
(
2019
)
.
FIVR: Fine-Grained Incident Video Retrieval
.
IEEE Transactions on Multimedia
vol.
21
,
(
10
)
2638
-
2652
.
Jang Y, Gunes H, Patras I
(
2019
)
.
Registration-free Face-SSD: Single shot analysis of smiles, facial attributes, and affect in the wild
.
Xie T, Yang X, Zhang T, Xu C, Patras I
(
2019
)
.
Exploring Feature Representation and Training Strategies in Temporal Action Localization
.
Conference:
2019 IEEE International Conference on Image Processing (ICIP)
vol.
00
,
1605
-
1609
.
(
2019
)
.
A deep generic to specific recognition model for group membership analysis using non-verbal cues
.
Image and Vision Computing
vol.
81
,
42
-
50
.
Mou W, Gunes H, Patras I
(
2019
)
.
Alone versus In-a-group: A Multi-modal Framework for Automatic Affect Recognition
.
ACM Trans. Multim. Comput. Commun. Appl.
vol.
15
,
Article
2
,
47:1
-
47:1
.
Markatopoulou F, Galanopoulos D, Tzelepis C, Mezaris V, Patras I
(
2019
)
.
Concept-Based and Event-Based Video Search in Large Video Collections
.
Big Data Analytics for Large Scale Multimedia Search
,
Xie T, Yang X, Zhang T, Xu C, Patras I
(
2019
)
.
Exploring Feature Representation and Training strategies in Temporal Action Localization
.
CoRR
vol.
abs/1905.10608
,
Kordopatis-Zilos G, Papadopoulos S, Patras I, Kompatsiaris I
(
2019
)
.
FIVR: Fine-Grained Incident Video Retrieval
.
IEEE Trans. Multim.
vol.
21
,
Article
10
,
2638
-
2652
.
Ahmadi A, Marras I, Patras I
(
2019
)
.
LikeNet: A Siamese motion estimation network trained in an unsupervised way
.
British Machine Vision Conference 2018, BMVC 2018
.
Jang Y, Gunes H, Patras I
(
2019
)
.
Registration-free Face-SSD: Single shot analysis of smiles, facial attributes, and affect in the wild
.
CoRR
vol.
abs/1902.04042
,
Bishay M, Zoumpourlis G, Patras I
(
2019
)
.
TARN: Temporal Attentive Relation Network for Few-Shot and Zero-Shot Action Recognition
.
BMVC
.
154
-
154
.
Mou W, Gunes H, Patras I
(
2019
)
.
Your Fellows Matter: Affect Analysis across Subjects in Group Videos
.
FG
.
1
-
5
.
Andreadis S, Moumtzidou A, Galanopoulos D, Markatopoulou F, Apostolidis K, Mavropoulos T, Gialampoukidis I, Vrochidis S et al.
(
2019
)
.
VERGE in VBS 2019
.
Lecture Notes in Computer Science
.
vol.
11296
,
602
-
608
.
(
2019
)
.
Detecting Tampered Videos with Multimedia Forensics and Deep Learning
.
Lecture Notes in Computer Science
.
vol.
11295
,
374
-
386
.
(
2019
)
.
Multimodal Video Annotation for Retrieval and Discovery of Newsworthy Video in a News Verification Scenario
.
Lecture Notes in Computer Science
.
vol.
11295
,
143
-
155
.
(
2018
)
.
AMIGOS: A Dataset for Affect, Personality and Mood Research on Individuals and Groups
.
IEEE Transactions on Affective Computing
vol.
12
,
(
2
)
479
-
493
.
Correa JAM, Abadi MK, Sebe N, Patras I
(
2018
)
.
AMIGOS: A Dataset for Affect, Personality and Mood Research on Individuals and Groups
.
IEEE Transactions on Affective Computing
vol.
12
,
Article
2
,
479
-
493
.
Bishay M, Palasek P, Priebe S, Patras I
(
2018
)
.
SchiNet: Automatic Estimation of Symptoms of Schizophrenia from Facial Behaviour Analysis
.
Markatopoulou F, Mezaris V, Patras I
(
2018
)
.
Implicit and Explicit Concept Relations in Deep Neural Networks for Multi-Label Video/Image Annotation
.
IEEE Transactions on Circuits and Systems for Video Technology
Vasilyev A, Hansard M, Mareschal I, Patras I
(
2018
)
.
A Model of Visual Search in the Presence of Age-Related Macular Degeneration
.
PERCEPTION
.
Conference:
Proceedings of the AVA Christmas meeting
vol.
47
,
573
-
573
.
Miranda-Correa JA, Patras I
(
2018
)
.
A Multi-Task Cascaded Network for Prediction of Affect, Personality, Mood and Social Context Using EEG Signals
.
Conference:
2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018)373
-
380
.
Apostolidis K, Markatopoulou F, Tzelepis C, Mezaris V, Patras I
(
2018
)
.
Multimedia Processing Essentials
.
Personal Multimedia Preservation
,
Springer Nature
Palasek P, Patras I
(
2018
)
.
Semi-supervised Fisher vector network
.
Moumtzidou A, Andreadis S, Markatopoulou F, Galanopoulos D, Gialampoukidis I, Vrochidis S, Mezaris V, Kompatsiaris I et al.
(
2018
)
.
VERGE in VBS 2018
.
Lecture Notes in Computer Science
.
vol.
10705
,
444
-
450
.
Ahmadi A, Marras I, Patras I
(
2018
)
.
LikeNet: A Siamese motion estimation network trained in an unsupervised way
.
British Machine Vision Conference 2018 Bmvc 2018
.
Tzelepis C, Mezaris V, Patras I
(
2018
)
.
Linear Maximum Margin Classifier for Learning from Uncertain Data
.
IEEE Trans. Pattern Anal. Mach. Intell.
vol.
40
,
Article
12
,
2948
-
2962
.
Palasek P, Patras I
(
2018
)
.
Semi-supervised Fisher vector network
.
CoRR
vol.
abs/1801.04438
,
Batziou E, Michail E, Avgerinakis K, Vrochidis S, Patras I, Kompatsiaris I
(
2018
)
.
Visual and audio analysis of movies video for emotion detection @ Emotional Impact of Movies task MediaEval 2018
.
Ceur Workshop Proceedings
.
vol.
2283
,
Tzelepis C, Mezaris V, Patras I
(
2017
)
.
Linear Maximum Margin Classifier for Learning from Uncertain Data
.
(
2017
)
.
Linear Maximum Margin Classifier for Learning from Uncertain Data
.
IEEE Transactions on Pattern Analysis and Machine Intelligence
vol.
40
,
(
12
)
2948
-
2962
.
(
2017
)
.
Deep Globally Constrained MRFs for Human Pose Estimation
.
Conference:
2017 IEEE International Conference on Computer Vision (ICCV)3486
-
3495
.
(
2017
)
.
Near-Duplicate Video Retrieval with Deep Metric Learning
.
Conference:
2017 IEEE International Conference on Computer Vision Workshops (ICCVW)347
-
356
.
Jang Y, Gunes H, Patras I
(
2017
)
.
SmileNet: Registration-Free Smiling Face Detection in the Wild
.
Conference:
2017 IEEE International Conference on Computer Vision Workshops (ICCVW)1581
-
1589
.
Tao Y, Palasek P, Ling Z, Parras I
(
2017
)
.
Background Modelling Based on Generative Unet
.
Conference:
2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)1
-
6
.
Palasek P, Patras I
(
2017
)
.
Discriminative convolutional Fisher vector network for action
recognition
.
Palasek P, Patras I
(
2017
)
.
Discriminative convolutional Fisher vector network for action recognition
.
Galanopoulos D, Markatopoulou F, Mezaris V, Patras I
(
2017
)
.
Concept Language Models and Event-based Concept Number Selection for Zero-example Event Detection
.
Conference:
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval397
-
401
.
Markatopoulou F, Galanopoulos D, Mezaris V, Patras I
(
2017
)
.
Query and Keyframe Representations for Ad-hoc Video Search
.
Conference:
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval407
-
411
.
Collyda C, Apostolidis E, Pournaras A, Markatopoulou F, Mezaris V, Patras I
(
2017
)
.
VideoAnalysis4ALL
.
Conference:
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval470
-
474
.
(
2017
)
.
Deep Refinement Convolutional Networks for Human Pose Estimation
.
Conference:
2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017)446
-
453
.
(
2017
)
.
Fusing Multilabel Deep Networks for Facial Action Unit Detection
.
Conference:
2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017)681
-
688
.
Mou W, Tzelepis C, Mezaris V, Gunes H, Patras I
(
2017
)
.
Generic to Specific Recognition Models for Membership Analysis in Group Videos
.
Conference:
2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017)512
-
517
.
Miranda-Correa JA, Abadi MK, Sebe N, Patras I
(
2017
)
.
AMIGOS: A Dataset for Affect, Personality and Mood Research on Individuals and Groups
.
Markatopoulou F, Moumtzidou A, Galanopoulos D, Avgerinakis K, Andreadis S, Gialampoukidis I, Tachos S, Vrochidis S et al.
(
2017
)
.
ITI-CERTH participation in TRECVID 2017
.
2017 Trec Video Retrieval Evaluation Trecvid 2017
.
Pittaras N, Markatopoulou F, Mezaris V, Patras I
(
2017
)
.
Comparison of Fine-Tuning and Extension Strategies for Deep Convolutional Neural Networks
.
Lecture Notes in Computer Science
.
vol.
10132
,
102
-
114
.
(
2017
)
.
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
.
Lecture Notes in Computer Science
.
vol.
10132
,
251
-
263
.
Moumtzidou A, Mironidis T, Markatopoulou F, Andreadis S, Gialampoukidis I, Galanopoulos D, Ioannidou A, Vrochidis S et al.
(
2017
)
.
VERGE in VBS 2017
.
Lecture Notes in Computer Science
.
vol.
10133
,
486
-
492
.
Mou W, Gunes H, Patras I
.
Automatic Recognition of Emotions and Membership in Group Videos
.
2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
.
Conference:
2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)1478
-
1486
.
Kotsia I, Zafeiriou S, Goudelis G, Patras I, Karpouzis K
(
2016
)
.
Multimodal Sensing in Affective Gaming
.
Emotion in Games
,
vol.
4
,
Springer Nature
Mou W, Gunes H, Patras I
(
2016
)
.
Alone versus In-a-group
.
Conference:
Proceedings of the 24th ACM international conference on Multimedia521
-
525
.
Markatopoulou F, Mezaris V, Patras I
(
2016
)
.
Deep Multi-task Learning with Label Correlation Constraint for Video Concept Detection
.
Conference:
Proceedings of the 24th ACM international conference on Multimedia501
-
505
.
(
2016
)
.
Learning to detect video events from zero or very few video examples
.
Image and Vision Computing
vol.
53
,
35
-
44
.
AHMADI A, Patras I
(
2016
)
.
UNSUPERVISED CONVOLUTIONAL NEURAL NETWORKS FOR MOTION ESTIMATION
.
http://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=7527113
.
Conference:
Image Processing (ICIP), 2016 IEEE International Conference on
(
Phoenix, Arizona, USA
)
from:
25/09/2016
to:
28/09/2016
,
(
2016
)
.
Action recognition using saliency learned from recorded human gaze
.
Image and Vision Computing
vol.
52
,
195
-
205
.
Palasek P, Patras I
(
2016
)
.
Action Recognition Using Convolutional Restricted Boltzmann Machines
.
Conference:
Proceedings of the 1st International Workshop on Multimedia Analysis and Retrieval for Multimodal Interaction3
-
8
.
Wang L, Patras I, Zhang J, Mori G, Davis L
(
2016
)
.
Special Issue on Individual and Group Activities in Video Event Analysis
.
Computer Vision and Image Understanding
vol.
144
,
1
-
2
.
Vrochidis S, Patras I, Kompatsiaris I
(
2016
)
.
Gaze movement-driven random forests for query clustering in automatic video annotation
.
Multimedia Tools and Applications
vol.
76
,
(
2
)
2861
-
2889
.
Ahmadi A, Patras I
(
2016
)
.
Unsupervised convolutional neural networks for motion estimation
.
Markatopoulou F, Mezaris V, Patras I
(
2016
)
.
Ordering of Visual Descriptors in a Classifier Cascade Towards Improved Video Concept Detection
.
Lecture Notes in Computer Science
.
vol.
9516
,
874
-
885
.
Tzelepis C, Mezaris V, Patras I
(
2016
)
.
Video Event Detection Using Kernel Support Vector Machine with Isotropic Gaussian Sample Uncertainty (KSVM-iGSU)
.
Lecture Notes in Computer Science
.
vol.
9516
,
3
-
15
.
Markatopoulou F, Galanopoulos D, Patras I, Mezaris V
(
2016
)
.
ITI-CERTH in TRECVID 2016 Ad-hoc video search (AVS)
.
2016 Trec Video Retrieval Evaluation Trecvid 2016
.
Markatopoulou F, Moumtzidou A, Galanopoulos D, Mironidis T, Kaltsa V, Ioannidou A, Symeonidis S, Avgerinakis K et al.
(
2016
)
.
ITI-CERTH participation in TRECVID 2016
.
2016 Trec Video Retrieval Evaluation Trecvid 2016
.
Tzelepis C, Galanopoulos D, Mezaris V, Patras I
(
2016
)
.
Learning to detect video events from zero or very few video examples
.
Image Vis. Comput.
vol.
53
,
35
-
44
.
Kuranuki Y, Patras I
(
2016
)
.
Minimal Filtered Channel Features for Pedestrian Detection
.
Conference:
2016 23rd International Conference on Pattern Recognition (ICPR)681
-
686
.
Markatopoulou F, Mezaris V, Patras I
(
2016
)
.
Online Multi-Task Learning for Semantic Concept Detection in Video
.
Conference:
2016 IEEE International Conference on Image Processing (ICIP)186
-
190
.
Ahmadi A, Patras I
(
2016
)
.
Unsupervised Convolutional Neural Networks for Motion Estimation
.
Conference:
2016 IEEE International Conference on Image Processing (ICIP)1629
-
1633
.
Moumtzidou A, Mironidis T, Apostolidis E, Markatopoulou F, Ioannidou A, Gialampoukidis I, Avgerinakis K, Vrochidis S et al.
(
2016
)
.
VERGE: A Multimodal Interactive Search Engine for Video Browsing and Retrieval
.
Lecture Notes in Computer Science
.
vol.
9517
,
394
-
399
.
Tzelepis C, Mavridaki E, Mezaris V, Patras I
(
2016
)
.
Video Aesthetic Quality Assessment Using Kernel Support Vector Machine with Isotropic Gaussian Sample Uncertainty (KSVM-IGSU)
.
Conference:
2016 IEEE International Conference on Image Processing (ICIP)2410
-
2414
.
Tzelepis C, Galanopoulos D, Mezaris V, Patras I
(
2015
)
.
Learning to detect video events from zero or very few video examples
.
Abadi MK, Subramanian R, Kia SM, Avesani P, Patras I, Sebe N
(
2015
)
.
DECAF: MEG-Based Multimodal Database for Decoding Affective Physiological Responses
.
IEEE Transactions on Affective Computing
vol.
6
,
(
3
)
209
-
222
.
Kalpakis G, Tsikrika¹ T, Markatopoulou F, Pittaras N, Vrochidis S, Mezaris V, Parras I, Kompatsiaris I
(
2015
)
.
Concept Detection in Multimedia Web Resources about Home Made Explosives
.
Conference:
2015 10th International Conference on Availability, Reliability and Security632
-
641
.
Yang H, Mou W, Zhang Y, Patras I, Gunes H, Robinson P
(
2015
)
.
Face Alignment Assisted by Head Pose Estimation
.
Gu L, Kanade T
(
2015
)
.
Face Alignment
.
Encyclopedia of Biometrics
,
Springer Nature
Patras I
(
2015
)
.
Face Pose Analysis
.
Encyclopedia of Biometrics
,
Springer Nature
Palasek P, Yang H, Xu Z, Hajimirza N, Izquierdo E, Patras I
(
2015
)
.
A Flexible Calibration Method of Multiple Kinects for 3D Human Reconstruction
.
Conference:
2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)1
-
4
.
Yang H, Zou C, Patras I
(
2015
)
.
Cascade of forests for face alignment
.
IET Computer Vision
vol.
9
,
(
3
)
321
-
330
.
Yang H, Patras I
(
2015
)
.
Mirror, Mirror on the Wall, Tell me, is the Error Small?
.
Conference:
2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)4685
-
4693
.
Yang H, Jia X, Patras I, Chan K-P
(
2015
)
.
Random Subspace Supervised Descent Method for Regression Problems in Computer Vision
.
IEEE Signal Processing Letters
vol.
22
,
(
10
)
1816
-
1820
.
Abadi MK, Correa JAM, Wache J, Yang H, Patras I, Sebe N
(
2015
)
.
Inference of Personality Traits and Affect Schedule by Analysis of Spontaneous Reactions to Affective Videos
.
Conference:
2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG)1
-
8
.
Yang H, He X, Jia X, Patras I
(
2015
)
.
Robust Face Alignment Under Occlusion via Regional Predictive Power Estimation
.
IEEE Transactions on Image Processing
vol.
24
,
(
8
)
2393
-
2403
.
Markatopoulou F, Mezaris V, Pittaras N, Patras I
(
2015
)
.
Local Features and a Two-Layer Stacking Architecture for Semantic Concept Detection in Video
.
IEEE Transactions on Emerging Topics in Computing
vol.
3
,
(
2
)
193
-
204
.
Yang H, Patras I
(
2015
)
.
Mirror, mirror on the wall, tell me, is the error small?
.
Yang H, Patras I
(
2015
)
.
Privileged Information-Based Conditional Structured Output Regression Forest for Facial Point Detection
.
IEEE Transactions on Circuits and Systems for Video Technology
vol.
25
,
(
9
)
1507
-
1520
.
Markatopoulou F, Pittaras N, Papadopoulou O, Mezaris V, Patras I
(
2015
)
.
A Study on the Use of a Binary Local Descriptor and Color Extensions of Local Descriptors for Video Concept Detection
.
Lecture Notes in Computer Science
.
vol.
8935
,
282
-
293
.
Markatopoulou F, Mezaris V, Patras I
(
2015
)
.
Cascade of Classifiers Based on Binary, Non-Binary and Deep Convolutional Network Descriptors for Video Concept Detection
.
Conference:
2015 IEEE International Conference on Image Processing (ICIP)1786
-
1790
.
Yang H, Mou W, Zhang Y, Patras I, Gunes H, Robinson P
(
2015
)
.
Face Alignment Assisted by Head Pose Estimation
.
130.1
-
130.13
.
Yang H, Mou W, Zhang Y, Patras I, Gunes H, Robinson P, Xie X, Jones MW et al.
(
2015
)
.
Face Alignment Assisted by Head Pose Estimation
.
BMVC
.
130.1
-
130.1
.
Markatopoulou F, Ioannidou A, Tzelepis C, Mironidis T, Galanopoulos D, Arestis-Chartampilas S, Pittaras N, Avgerinakis K et al.
(
2015
)
.
ITI-CERTH participation to TRECVID 2015
.
2015 Trec Video Retrieval Evaluation Trecvid 2015
.
Chen M, Han J, Guo L, Wang J, Patras I
(
2015
)
.
Identifying Valence and Arousal Levels via Connectivity between EEG Channels
.
Conference:
2015 International Conference on Affective Computing and Intelligent Interaction (ACII)63
-
69
.
Abadi MK, Correa JAM, Wache J, Yang H, Patras I, Sebe N, IEEE
(
2015
)
.
Inference of Personality Traits and Affect Schedule by Analysis of Spontaneous Reactions to Affective Videos
.
2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), VOL. 2
.
Yang H, Patras I
(
2015
)
.
Mirror, mirror on the wall, tell me, is the error small?
.
CoRR
vol.
abs/1501.05152
,
Patras MV, BǍnacu CS, Popescu AM, Patraş I
(
2015
)
.
The effect of limiting the right of appeal on the quality of the public procurement management system: Case study of Romania
.
Proceedings of the 26th International Business Information Management Association Conference Innovation Management and Sustainable Economic Competitive Advantage from Regional Development to Global Growth Ibima 2015
.
521
-
530
.
Moumtzidou A, Avgerinakis K, Apostolidis E, Markatopoulou F, Apostolidis K, Mironidis T, Vrochidis S, Mezaris V et al.
(
2015
)
.
VERGE: A Multimodal Interactive Video Search Engine
.
Lecture Notes in Computer Science
.
vol.
8936
,
249
-
254
.
Yang H, Patras I
(
2014
)
.
Fine-Tuning Regression Forests Votes for Object Alignment in the Wild
.
IEEE Transactions on Image Processing
vol.
24
,
(
2
)
619
-
631
.
Kaymak S, Patras I
(
2014
)
.
Multimodal random forest based tensor regression
.
IET Computer Vision
vol.
8
,
(
6
)
650
-
657
.
Stefic D, Patras I
(
2014
)
.
Learning Visual Saliency Using Topographic Independent Component Analysis
.
Conference:
2014 IEEE International Conference on Image Processing (ICIP)1130
-
1134
.
Burelli P, Triantafyllidis G, Patras I
(
2014
)
.
Non-Invasive Player Experience Estimation from Body Motion and Game Context
.
Conference:
2014 IEEE Conference on Computational Intelligence and Games1
-
7
.
Stefic D, Patras I
(
2014
)
.
Learning visual saliency using topographic independent component analysis
.
2014 IEEE International Conference on Image Processing Icip 2014
.
1130
-
1134
.
Yang H, Zou C, Patras I
(
2014
)
.
Face sketch landmarks localization in the wild
.
IEEE Signal Processing Letters
vol.
21
,
(
11
)
1321
-
1325
.
Gkalelis N, Markatopoulou F, Moumtzidou A, Galanopoulos D, Avgerinakis K, Pittaras N, Vrochidis S, Mezaris V et al.
(
2014
)
.
ITI-CERTH participation to TRECVID 2014
.
2014 Trec Video Retrieval Evaluation Trecvid 2014
.
Jia X, Yang H, Chan K-P, Patras I
(
2014
)
.
Structured Semi-supervised Forest for Facial Landmarks Localization with Face Mask Reasoning
.
Conference:
Proceedings of the British Machine Vision Conference 201485.1
-
85.13
.
Yang H, Patras I
(
2013
)
.
Sieving Regression Forest Votes for Facial Feature Detection in the Wild
.
1936
-
1943
.
(
2013
)
.
Privileged Information-based Conditional Regression Forest for Facial Feature Detection
.
Conference:
2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG)1
-
6
.
G VKB, Patras I
(
2013
)
.
Supervised Dictionary Learning for Action Localization
.
Conference:
2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG)1
-
8
.
Rudovic O, Pantic M, Patras IY
(
2013
)
.
Coupled Gaussian Processes for Pose-Invariant Facial Expression Recognition
.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
vol.
35
,
(
6
)
1357
-
1369
.
Kaymak S, Patras I
(
2013
)
.
Exploiting Depth and Intensity Information for Head Pose Estimation with Random Forests and Tensor Models
.
Lecture Notes in Computer Science
.
vol.
7729
,
160
-
170
.
Kotsia I, Patras I
(
2013
)
.
Exploring the Similarities of Neighboring Spatiotemporal Points for Action Pair Matching
.
Lecture Notes in Computer Science
.
vol.
7726
,
624
-
635
.
(
2013
)
.
Face Parts Localization Using Structured-Output Regression Forests
.
Lecture Notes in Computer Science
.
vol.
7725
,
667
-
679
.
Koelstra S, Patras I
(
2013
)
.
Fusion of facial expressions and EEG for implicit affective tagging
.
IMAGE AND VISION COMPUTING
vol.
31
,
(
2
)
164
-
174
.
Nikolopoulos S, Zafeiriou S, Patras I, Kompatsiaris I
(
2013
)
.
High order pLSA for indexing tagged images
.
SIGNAL PROCESSING
vol.
93
,
(
8
)
2212
-
2228
.
Guo W, Hu W, Boulgouris NV, Patras I
(
2013
)
.
Semi-Supervised Visual Recognition With Constrained Graph Regularized Non Negative Matrix Factorization
.
Conference:
2013 IEEE International Conference on Image Processing2743
-
2747
.
Kotsia I, Guo W, Patras I
(
2012
)
.
Higher rank Support Tensor Machines for visual recognition
.
Pattern Recognition
vol.
45
,
(
12
)
4192
-
4203
.
Oveisi F, Oveisi S, Efranian A, Patras I
(
2012
)
.
Nonlinear Independent Component Analysis for EEG-Based Brain-Computer Interface Systems
.
Independent Component Analysis for Audio and Biosignal Applications
,
IntechOpen
G VKB, Patras I
(
2012
)
.
Learning Codebook Weights for Action Detection
.
Conference:
2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
vol.
1
,
27
-
32
.
Kotsia I, Patras I, Fotopoulos S
(
2012
)
.
AFFECTIVE GAMING: BEYOND USING SENSORS
.
Conference:
2012 5th International Symposium on Communications, Control and Signal Processing
vol.
1
,
1
-
4
.
Vrochidis S, Patras I, Kompatsiaris I
(
2012
)
.
EXPLOITING GAZE MOVEMENTS FOR AUTOMATIC VIDEO ANNOTATION
.
Conference:
2012 13th International Workshop on Image Analysis for Multimedia Interactive Services
vol.
1
,
1
-
4
.
Guo W, Kotsia I, Patras I
(
2012
)
.
Tensor learning for regression
.
IEEE Trans Image Process
vol.
21
,
(
2
)
816
-
827
.
Yang H, Liu X, Patras I
(
2012
)
.
A Simple and Effective Extrinsic Calibration Method of a Camera and a Single Line Scanning Lidar
.
21st International Conference on Pattern Recognition
.
Conference:
ICPR 2012
Yang H, Zhang Y, Liu X, Patras I
(
2012
)
.
Coupled 3D Tracking and Pose Optimization of Rigid Objects Using Particle Filter
.
21st International Conference on Pattern Recognition
.
Conference:
ICPR 2012
Koelstra S, Muhl C, Soleymani M, Lee J-S, Yazdani A, Ebrahimi T, Pun T, Nijholt A et al.
(
2012
)
.
DEAP: A Database for Emotion Analysis Using Physiological Signals
.
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING
vol.
3
,
(
1
)
18
-
31
.
Kotsia I, Guo W, Patras I
(
2012
)
.
Higher Rank Support Tensor Machines
.
Lecture Notes in Computer Science
.
vol.
7432
,
31
-
40
.
Nikolopoulos S, Papadopoulos GT, Kompatsiaris I, Patras I
(
2012
)
.
Image Interpretation by Combining Ontologies and Bayesian Networks
.
Lecture Notes in Computer Science
.
vol.
7297
,
307
-
314
.
Chatzilari E, Nikolopoulos S, Patras I, Kompatsiaris I
(
2012
)
.
Leveraging social media for scalable object detection
.
PATTERN RECOGNITION
vol.
45
,
(
8
)
2962
-
2979
.
Kumar BGV, Kotsia I, Patras I
(
2012
)
.
Max-margin Non-negative Matrix Factorization
.
IMAGE AND VISION COMPUTING
vol.
30
,
(
4-5
)
279
-
291
.
Kotsia I, Patras I
(
2012
)
.
SUPPORT TENSOR ACTION SPOTTING
.
2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012)
.
1397
-
1400
.
Oveisi F, Oveisi S, Efranian A, Patras I
(
2012
)
.
Tree-Structured Feature Extraction Using Mutual Information
.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
vol.
23
,
(
1
)
127
-
137
.
(
2011
)
.
Japan megathrust earthquake on March 11, 2011: GPS-TEC evidence for ionospheric disturbances
.
JETP Letters
.
vol.
94
,
616
-
620
.
Nikolopoulos S, Papadopoulos GT, Kompatsiaris I, Patras I
(
2011
)
.
Evidence-driven image interpretation by combining implicit and explicit knowledge in a Bayesian network
.
IEEE Trans Syst Man Cybern B Cybern
vol.
41
,
(
5
)
1366
-
1381
.
Vrochidis S, Patras I, Kompatsiaris I
(
2011
)
.
An eye-tracking-based approach to facilitate interactive video search
.
Conference:
Proceedings of the 1st ACM International Conference on Multimedia Retrieval1
-
8
.
Oikonomopoulos A, Patras I, Pantic M
(
2011
)
.
Spatiotemporal localization and categorization of human actions in unsegmented image sequences
.
IEEE Transactions on Image Processing
vol.
20
,
(
4
)
1126
-
1140
.
Vrochidis S, Kompatsiaris I, Patras I
(
2011
)
.
Utilizing Implicit User Feedback to Improve Interactive Video Retrieval
.
Advances in Multimedia
vol.
2011
,
(
1
)
1
-
18
.
Soleymani M, Koelstra S, Patras I, Pun T
(
2011
)
.
Continuous Emotion Detection in Response to Music Videos
.
Conference:
Face and Gesture 2011
vol.
1
,
803
-
808
.
Nikolopoulos S, Giannakidou E, Kompatsiaris I, Patras I, Vakali A
(
2011
)
.
Combining Multi-modal Features for Social Media Analysis
.
Social Media Modeling and Computing
,
Springer Nature
Chatzilari E, Nikolopoulos S, Patras I, Kompatsiaris I
(
2011
)
.
Enhancing Computer Vision Using the Collective Intelligence of Social Media
.
New Directions in Web Data Management 1
,
vol.
331
,
Springer Nature
Guo W, Kotsia I, Patras I
(
2011
)
.
Higher order Support tensor regression for head pose estimation
.
International Workshop on Image Analysis for Multimedia Interactive Services
.
Moumtzidou A, Sidiropoulos P, Vrochidis S, Gkalelis N, Nikolopoulos S, Mezaris V, Kompatsiaris I, Patras I
(
2011
)
.
ITI-CERTH participation to TRECVID 2011
.
2011 Trec Video Retrieval Evaluation Notebook Papers
.
Kumar VBG, Patras I, Kotsia I
(
2011
)
.
Max-Margin Semi-NMF
.
Conference:
Procedings of the British Machine Vision Conference 2011129.1
-
129.11
.
Kotsia I, Patras I
(
2011
)
.
Support Tucker Machines
.
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)
.
633
-
640
.
Koelstra S, Pantic M, Patras I
(
2010
)
.
A Dynamic Texture-Based Approach to Recognition of Facial Actions and Their Temporal Models
.
IEEE Transactions on Pattern Analysis and Machine Intelligence
vol.
32
,
(
11
)
1940
-
1954
.
Oikonomopoulos A, Patras I, Pantic M
(
2010
)
.
Discriminative space-time voting for joint recognition and localization of actions
.
Conference:
Proceedings of the 2nd international workshop on Social signal processing11
-
16
.
Passino G, Patras I, Izquierdo E
(
2010
)
.
Aspect coherence for graph-based semantic image labelling
.
IET COMPUT VIS
vol.
4
,
(
3
)
183
-
194
.
Patras I, Hancock ER
(
2010
)
.
Coupled Prediction Classification for Robust Visual Tracking
.
IEEE Transactions on Pattern Analysis and Machine Intelligence
vol.
32
,
(
9
)
1553
-
1567
.
Kotsia I, Patras I
(
2010
)
.
Multiplicative Update Rules for Multilinear Support Tensor Machines
.
2010 20th International Conference on Pattern Recognition
.
Conference:
2010 20th International Conference on Pattern Recognition33
-
36
.
(
2010
)
.
Pyramidal Model for Image Semantic Segmentation
.
2010 20th International Conference on Pattern Recognition
.
Conference:
2010 20th International Conference on Pattern Recognition1554
-
1557
.
Rudovic O, Patras I, Pantic M
(
2010
)
.
Regression-Based Multi-view Facial Expression Recognition
.
2010 20th International Conference on Pattern Recognition
.
Conference:
2010 20th International Conference on Pattern Recognition4121
-
4124
.
Vrochidis S, Kompatsiaris I, Patras I
(
2010
)
.
Optimizing visual search with implicit user feedback in interactive video retrieval
.
Conference:
Proceedings of the ACM International Conference on Image and Video Retrieval274
-
281
.
Kotsia I, Patras I
(
2010
)
.
Relative Margin Support Tensor Machines for gait and action recognition
.
Conference:
Proceedings of the ACM International Conference on Image and Video Retrieval446
-
453
.
Ognjen R, Ioannis P, Maja P
(
2010
)
.
Facial Expression Invariant Head Pose Normalization using Gaussian Process Regression
.
Conference:
2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops28
-
33
.
Koelstra S, Pantic M, Patras I
(
2010
)
.
A Dynamic Texture-Based Approach to Recognition of Facial Actions and Their Temporal Models
.
IEEE Transactions on Pattern Analysis and Machine Intelligence
vol.
32
,
1940-1954
-
1940-1954
.
Kumar.B.G V, Patras I
(
2010
)
.
A discriminative voting scheme for object detection using hough forests
.
British Machine Vision Conference Bmvc 2010 Proceedings
.
Rudovic O, Patras I, Pantic M
(
2010
)
.
Coupled Gaussian Process Regression for Pose-Invariant Facial Expression Recognition
.
Lecture Notes in Computer Science
.
vol.
6312
,
350
-
363
.
Vrochidis S, Kompatsiaris I, Patras I
(
2010
)
.
Exploiting implicit user feedback in interactive video retrieval
.
WIAMIS
.
1
-
4
.
Guo W, Patras I
(
2010
)
.
Learning output-kernel-dependent regression for human pose estimation
.
British Machine Vision Conference Bmvc 2010 Proceedings
.
Koelstra S, Yazdani A, Soleymani M, Muhl C, Lee J-S, Nijholt A, Pun T, Ebrahimi T et al.
(
2010
)
.
Single Trial Classification of EEG and Peripheral Physiological Signals for Recognition of Emotions Induced by Music Videos
.
BRAIN INFORMATICS, BI 2010
.
vol.
6334
,
89
-
100
.
Oikonomopoulos A, Pantic M, Patras I
(
2009
)
.
Sparse B-spline polynomial descriptors for human activity recognition
.
IMAGE VISION COMPUT
vol.
27
,
(
12
)
1814
-
1825
.
Guo W, Patras I
(
2009
)
.
Discriminative 3D Human Pose Estimation from Monocular Images via Topological Preserving Hierarchical Affinity Clustering
.
Conference:
2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops9
-
15
.
Koelstra S, Mühl C, Patras I
(
2009
)
.
EEG analysis for implicit tagging of video data
.
Conference:
2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops1
-
6
.
Passino G, Patras I, Izquierdo E
(
2009
)
.
Latent Semantics Local Distribution for CRF-based Image Semantic Segmentation
.
Proceedings of the British Machine Vision Conference (BMVC 2009)
.
Conference:
BMVC 2009
(
London, England
)
from:
07/09/2009
to:
10/09/2009
,
1
-
12
.
Passino G, Piatrik T, Patras I, Izquierdo E
(
2009
)
.
A Multimedia Content Semantics Extraction Framework for Enhanced Social Interaction
.
Adjunct proceedings EuroITV 2009 Networked Television
.
Conference:
EuroITV 2009
(
Leuven, Belgium
)
from:
03/06/2009
to:
05/06/2009
,
89
-
91
.
Oikonomopoulos A, Patras I, Pantic M
(
2009
)
.
An implicit spatiotemporal shape model for human activity localization and recognition
.
2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
.
Conference:
2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops27
-
33
.
Nikolopoulos S, Papadopoulos GT, Kompatsiaris I, Patras I, Perner P
(
2009
)
.
An Evidence-Driven Probabilistic Inference Framework for Semantic Image Understanding
.
MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION
.
vol.
5632
,
525
-
539
.
Oikonomopoulos A, Patras I, Pantic M
(
2009
)
.
An Implicit Spatiotemporal Shape Model for Human Activity Localization and Recognition
.
2009 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPR WORKSHOPS 2009), VOLS 1 AND 2
.
786
-
792
.
Oikonomopoulos A, Patras I, Pantic M
(
2009
)
.
An implicit spatiotemporal shape model for human activity localization and recognition
.
2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops Cvpr Workshops 2009
.
27
-
33
.
Passino G, Patras I, Izquierdo E
(
2009
)
.
CONTEXT AWARENESS IN GRAPH-BASED IMAGE SEMANTIC SEGMENTATION VIA VISUAL WORD DISTRIBUTIONS
.
2009 10TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES
.
33
-
36
.
(
2009
)
.
Face Acquisition
.
Encyclopedia of Biometrics
,
Springer Nature
Patras I
(
2009
)
.
Face Pose Analysis
.
Encyclopedia of Biometrics
,
Springer Nature
Koelstra S, Patras I
(
2009
)
.
THE FAST-3D SPATIO-TEMPORAL INTEREST REGION DETECTOR
.
2009 10TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES
.
242
-
245
.
Oikonomopoulos A, Pantic M, Patras I
(
2008
)
.
Human Gesture Recognition using Sparse B-spline Polynomial Representations
.
Belgian Netherlands Artificial Intelligence Conference
.
193
-
200
.
Andreopoulos Y, Patras I
(
2008
)
.
Incremental refinement of image salient-point detection
.
IEEE Transactions on Image Processing
vol.
17
,
(
9
)
1685
-
1699
.
(
2008
)
.
Aspect coherence for graph-based image labelling
.
Conference:
5th International Conference on Visual Information Engineering (VIE 2008)94
-
99
.
Oikonomopoulos A, Pantic M, Patras I
(
2008
)
.
B-spline Polynomial Descriptors for Human Activity Recognition
.
2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3
.
1622
-
1627
.
Oikonomopoulos A, Pantic M, Patras I
(
2008
)
.
B-spline polynomial descriptors for human activity recognition
.
CVPR Workshops
.
1
-
6
.
Patras I, Andreopoulos Y
(
2008
)
.
Incremental salient point detection
.
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12
.
1337
-
1340
.
Passino G, Patras I, Izquierdo E
(
2008
)
.
ON THE ROLE OF STRUCTURE IN PART-BASED OBJECT DETECTION
.
2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5
.
65
-
68
.
PATRAS I, Lagendijk RL, Hendriks EA
(
2007
)
.
Bayesian Confidence Measures for Block-based Motion Estimation
.
IEEE trans. Circuits and Systems for Video Technology
vol.
17 Issue 8
,
988
-
995
.
Patras I, Hendriks EA, Lagendijk RL
(
2007
)
.
Probabilistic confidence measures for block matching motion estimation
.
IEEE T CIRC SYST VID
vol.
17
,
(
8
)
988
-
995
.
PATRAS I, Hancock ER
(
2007
)
.
Regression-ased Template Tracking in Presence of Occlusions
.
Conference:
International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS), Santorini, Greece
Pogalin E, Redert A, Patras I, Hendriks EA, Pollefeys M, Daniilidis K
(
2007
)
.
Gaze tracking by using factorized likelihoods particle filtering and stereo vision
.
Third International Symposium on 3D Data Processing, Visualization, and Transmission, Proceedings
.
57
-
64
.
PATRAS I, Paragios N, Oikonomopoulos A, Pantic M, Huang TS, Nijholt A, Pantic M, Pentland A
(
2007
)
.
Particle Filtering Tracking Scheme for Trajectory-based Recognition of Human Actions
.
Artificial Intelligence for Human Computing
,
Springer
(
Lecture Notes in Artifical Intelligence - Volume 4451
),
Patras I, Hancock ER
(
2007
)
.
Regression tracking with data relevance determination
.
2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8
.
2062
-
2069
.
Patras I, Hancock E
(
2007
)
.
Template tracking with observation relevance determination
.
2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7
.
501
-
504
.
Oikonomopoulos A, Patras I, Pantic M, Paragios N, Huang TS, Nijholt A, Pantic M, Pentland A
(
2007
)
.
Trajectory-based representation of human actions
.
Artificial Intelligence for Human Computing
.
vol.
4451
,
133
-
154
.
PATRAS I, Pantic M, Oikonomopoulos A
(
2006
)
.
Kernel-based Recognition of Human Actions Using Spatiotemporal Salient Points
.
Conference:
Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, Workshop on Vision for HCI, New York, USA - June 2006
Oikonomopoulos A, Patras I, Pantic M
(
2006
)
.
Spatiotemporal salient points for visual recognition of human actions
.
IEEE Trans Syst Man Cybern B Cybern
vol.
36
,
(
3
)
710
-
719
.
Pantic M, Patras I
(
2006
)
.
Dynamics of facial expression: recognition of facial actions and their temporal segments from face profile image sequences
.
IEEE Trans Syst Man Cybern B Cybern
vol.
36
,
(
2
)
433
-
449
.
Diplaros A, Gevers T, Patras I
(
2006
)
.
Combining color and shape information for illumination-viewpoint invariant object recognition
.
IEEE Trans Image Process
vol.
15
,
(
1
)
1
-
11
.
PATRAS I, Valstar MF, Pantic M
(
2005
)
.
Learning Spatiotemporal Models of Facial Expressions
.
Conference:
International Conference on Measuring Behaviour, Wageningen - September 2005
PATRAS I, Pantic M, Valstar MF
(
2005
)
.
Facial Action Unit Detection Using Probabilistically Actively Learned Support Vector Machines on Tracked Facial Point Data
.
Conference:
Proceedings of IEEE International Confernece on Computer Visision and Pattern Recognition, workshop on Vision for HCI, San Diego, USA - June 2005
Pantic M, Patras I
(
2005
)
.
Detecting Facial Actions and their Temporal Segments in Nearly Frontal-View Face Image Sequences
.
Conference:
2005 IEEE International Conference on Systems, Man and Cybernetics
vol.
4
,
1
-
6
.
Pantic M, Patras I
(
2005
)
.
Detecting facial actions and their temporal segments in nearly frontal-view face image sequences
.
INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS
.
3358
-
3363
.
Oikonomopoulos A, Patras I, Pantic M
(
2005
)
.
Spatiotemporal saliency for human action recognition
.
2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2
.
430
-
433
.
Patras I, Pantic M
(
2005
)
.
Tracking deformable motion
.
INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS
.
1066
-
1071
.
Patras I, Worring M, van den Boomgaard R
(
2004
)
.
Dense motion estimation using regularization constraints on local parametric models
.
IEEE Trans Image Process
vol.
13
,
(
11
)
1432
-
1443
.
PATRAS I, Pantic M, Valstar M
(
2004
)
.
Multilevel Motion History for Facial Action Detection from Face Video
.
Conference:
IEEE International Conference on Systems, Management and Cybernetics, Den Haag, The Netherlands - October 2005
Diplaros A, Gevers T, Patras I, Santini S, Schettini R
(
2004
)
.
Combining color and shape information for content-based image retrieval on the Internet
.
INTERNET IMAGING V
.
vol.
5304
,
132
-
141
.
Valstar M, Patras I, Pantic M
(
2004
)
.
Facial action unit recognition using temporal templates
.
RO-MAN 2004: 13TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, PROCEEDINGS
.
253
-
258
.
Valstar M, Pantic M, Patras I
(
2004
)
.
Motion History for Facial Action Detection in Video
.
Conference:
2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583)
vol.
1
,
635
-
640
.
Winkelman F, Patras I
(
2004
)
.
Online globally consistent mosaicing using an efficient representation
.
2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7
.
3116
-
3121
.
Patras I, Pantic M
(
2004
)
.
Particle filtering with factorized likelihoods for tracking facial features
.
SIXTH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, PROCEEDINGS
.
97
-
102
.
Pantic M, Patras I
(
2004
)
.
Temporal modeling of facial actions from face profile image sequences
.
2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3
.
49
-
52
.
PATRAS I, Gevers T, Diplaros A
(
2003
)
.
Color-Shape Context for Object Recognition
.
Conference:
IEEE Workshop on Color and Photometric Methods in Computer Vision (in conjunction with ICCV 2003), Nice, France
Patras I, Hendriks EA, Lagendijk RL
(
2003
)
.
Semi-automatic object-based video segmentation with labeling of color segments
.
SIGNAL PROCESS-IMAGE
vol.
18
,
(
1
)
51
-
65
.
PATRAS I, Raaijmakers S, Snoek C, van Rest J, Worring M, van Leeuwen D, den Hartog J, Vendring J
(
2002
)
.
TREC Feature Extraction by Active Learning
.
Conference:
11th Text Retrieval Confernece (TREC), Gaithersburg, MD - November 2002
Patras I, Hendriks EA, Lagendijk RL
(
2002
)
.
Confidence measures for block matching motion estimation
.
2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS
.
277
-
280
.
Pantic M, Patras I, Rothkrantz L
(
2002
)
.
Facial action recognition in face profile image sequences
.
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS
.
37
-
40
.
Patras I, Worring M, Kasturi R, Laurendeau D, Suen C
(
2002
)
.
Regularized patch motion estimation
.
16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL I, PROCEEDINGS
.
323
-
326
.
Patras I, Hendriks EA, Lagendijk RL
(
2001
)
.
Video segmentation by MAP labeling of watershed segments
.
IEEE T PATTERN ANAL
vol.
23
,
(
3
)
326
-
332
.
PATRAS I, List J, Geusebroek J-M, den Hartog J, Hiemstra D, van Ballegooij A, Worring M, Snoek C et al.
(
2001
)
.
Lazy Users and Automatic Video Retrieval Tools in (the) Lowlands
.
Conference:
Proceedings of the 10th Text Retrieval Conference (TREC), NIST 2001
PATRAS I, Hendriks EA, Broekhoven M, Hupkens T
(
2001
)
.
Robust Region Merging for Motion Based Sementation Using the Kolmogorov-Smirnov Test
.
Image Processing and Communications
vol.
6
,
(
3-4
)
27
-
34
.
Patras I, Hendriks EA, Lagendijk RL
(
1998
)
.
Iterative motion estimation - segmentation method using watershed segments
.
IEEE International Conference on Image Processing
.
vol.
2
,
642
-
646
.
Patras IK, Hendriks EA, Tziritas GG
(
1997
)
.
Construction of multiple views using jointly estimated motion and disparity fields
.
Proceedings of SPIE--the International Society for Optical Engineering
.
Conference:
Visual Communications and Image Processing '97
vol.
3024
,
380
-
390
.
Xenos A, Stafylakis T, Patras I, Tzimiropoulos G
.
A Simple Baseline for Knowledge-Based Visual Question Answering
.
Conference:
Empirical Methods in Natural Language Processing
from:
06/12/2023
to:
10/12/2023
,
Sun Z, Song S, Patras I, Tzimiropoulos G
.
CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition
.
Advances in Neural Information Processing Systems
.
Patras I, Gao Z, Song J, Zhang Z, Deng J
.
Frequency-Guided Diffusion for Training-Free Text-Driven Image Translation
.
Conference:
International Conference on Computer Vision
(
Hawaaii
)
from:
19/07/2025
to:
23/10/2025
,
Izquierdo EE, Patras IE, Hao PE, Gunes HE, Asioli SE, BANGERT TE, Klavdianos PE, Brenner ME et al.
.
MMV Members
.
Oldfield J, Tzelepis C, Panagakis Y, Nicolaou, A. M, Patras I
.
PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs
.
Conference:
International Conference on Learning Representations (ICLR)
(
Kigali, Rwanda
)
Oldfield J
.
Parts of Speech–Grounded Subspaces in
Vision-Language Models
.
Conference:
37th Conference on Neural Information Processing Systems
Oldfield J, TZELEPIS C, Panagakis Y, Nicolaou MA, Patras I
.
Parts of Speech–Grounded Subspaces in Vision-Language Models
.
Conference:
Thirty-seventh Conference on Neural Information Processing Systems
Zhao Z, Patras I
.
Prompting Visual-Language Models for Dynamic Facial Expression Recognition
.
Conference:
The 34th British Machine Vision Conference
(
Aberdeen, UK
)
from:
20/11/2023
to:
24/10/2023
,