About
I am having fun at Fondazione Bruno Kessler (FBK) working on efficient multi-modal learning. I am part of Deep Visual Learning (DVL) unit, advised by Prof. Elisa Ricci and Dr. Yiming Wang.
Before joining FBK I obtained the PhD in Computer Vision at University of Genova and Italian Institute of Technology (IIT) under the supervision of Dr. Alessio Del Bue and Prof. Stuart James. During the PhD, I visited the University of Amsterdam to work with Prof. Sara Magliacane. I completed both the BS in Information Engineering and the MS in Computer Engineering at University of Padova.
My research interests are on representation learning, multimodal large language models and making them more human-centric.
News
- May 2026 - Our tutorial on Human-centric Embodied Multimodal Interaction (HEMI) has been accepted to ICMI! More info coming soon
- May 2026 - CVTS26 was a success! Proud to support the Trento CV community :)
- May 2026 - Honored to be an Outstanding reviewer for CVPR26!
- Jan 2026 - Happy to share that MemCoach has been accepted as Highlight to CVPR26! Keep pushing Francesco :)
Selected Publications
-
How to Take a Memorable Picture? Empowering Users with Actionable Feedback, F. Laiti, D. Talon, J. Staiano, E. Ricci, The IEEE/CVF Conference on Computer Vision and Pattern Recognition (Highlight), 2026. [Homepage] [PDF] [Code] [Data]
-
LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing, F. Girella, D. Talon, Z. Liu, Z. Ruan, Y. Wang, M. Cristani, The IEEE/CVF International Conference on Computer Vision (Oral), 2025. [Homepage] [PDF] [Data]
-
Training-Free Personalization via Retrieval and Reasoning on Fingerprints, D. Das, D. Talon, Y. Wang, M. Mancini, E. Ricci, The IEEE/CVF International Conference on Computer Vision, 2025. [Homepage] [PDF] [Code] [Data]
-
Evaluating Attribute Confusion in Fashion Text-to-Image Generation, Z. Liu, F. Girella, Y. Wang, D. Talon, International Conference on Image Analysis and Processing, 2025. [Homepage] [PDF][Code]
-
Seeing the Abstract: Translating the Abstract Language for Vision Language Models, D. Talon, F. Girella, Z. Liu, M. Cristani, Y. Wang, The IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025. [Homepage] [PDF] [Code] [Poster] [Video]
For a complete list, see my Google Scholar profile or download my CV.
Teaching & Service
Teaching
- Advanced Multimodal Learning, Fall 2025, IECS Doctoral School - University of Trento
- Introduction to Machine Learning, Spring 2025, Data Science Master Degree - University of Trento
Service to the Community
- Conference reviewer: BMVC (2021, 2026), ECCV (2024-26), IROS (2024), ICRA (2025), CVPR (2024-26), ACMMM (2025), ICPR (2026), NeurIPS (2026)
- Area Chair: BMVC (2025)
- Organizer: CVTS 2026, HEMI Tutorial at ICMI 2026, GreenFOMO Workshop at ECCV 2024
More
Outside research, I enjoy eating, drinking, outdoor sports and travelling.