Davide Talon

Postdoctoral Researcher in Computer Vision

Fondazione Bruno Kessler, Trento, Italy

CV Scholar Linkedin Github X

About

I am having fun at Fondazione Bruno Kessler (FBK) working on efficient multi-modal learning. I am part of Deep Visual Learning (DVL) unit, advised by Prof. Elisa Ricci and Dr. Yiming Wang.

Before joining FBK I obtained the PhD in Computer Vision at University of Genova and Italian Institute of Technology (IIT) under the supervision of Dr. Alessio Del Bue and Prof. Stuart James. During the PhD, I visited the University of Amsterdam to work with Prof. Sara Magliacane. I completed both the BS in Information Engineering and the MS in Computer Engineering at University of Padova.

My research interests are on representation learning, multimodal large language models and making them more human-centric.

News

Jul 2026 - So excited about my first grant, preeloox.ai got selected among POC By Trentino projects!
Jul 2026 - Glad to share our paper on Sign-language text-motion retrieval has been accepted to ACMMM 26! Congrats Chang!!!
June 2026 - Good news :) Our “Personalizing MLLMs via Reinforced Multimodal Reference Game” has been accepted to ECCV26! Well done Deepayan!
June 2026 - New preprint on reasoning! Check it out on Arxiv
May 2026 - Our tutorial on Human-centric Embodied Multimodal Interaction (HEMI) has been accepted to ICMI! More info coming soon

Selected Publications

Kinematics-Centric Continuous Sign Language Retrieval with Gloss-Guided Boundary-Aware Alignment, C. Liu, K. Han, D. Talon, E. Ricci, N. Sebe, ACM International Conference on Multimedia, 2026. Coming soon!
Personalizing MLLMs via Reinforced Multimodal Reference Game, D. Das, D. Talon, Y. Wang, M. Mancini, E. Ricci, European Conference on Computer Vision, 2026. [Homepage] [PDF]
How to Take a Memorable Picture? Empowering Users with Actionable Feedback, F. Laiti, D. Talon, J. Staiano, E. Ricci, The IEEE/CVF Conference on Computer Vision and Pattern Recognition (Highlight), 2026. [Homepage] [PDF] [Code] [Data]
LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing, F. Girella, D. Talon, Z. Liu, Z. Ruan, Y. Wang, M. Cristani, The IEEE/CVF International Conference on Computer Vision (Oral), 2025. [Homepage] [PDF] [Data]
Training-Free Personalization via Retrieval and Reasoning on Fingerprints, D. Das, D. Talon, Y. Wang, M. Mancini, E. Ricci, The IEEE/CVF International Conference on Computer Vision, 2025. [Homepage] [PDF] [Code] [Data]
Evaluating Attribute Confusion in Fashion Text-to-Image Generation, Z. Liu, F. Girella, Y. Wang, D. Talon, International Conference on Image Analysis and Processing, 2025. [Homepage] [PDF][Code]
Seeing the Abstract: Translating the Abstract Language for Vision Language Models, D. Talon, F. Girella, Z. Liu, M. Cristani, Y. Wang, The IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025. [Homepage] [PDF] [Code] [Poster] [Video]

For a complete list, see my Google Scholar profile or download my CV.

Teaching & Service

Teaching

Advanced Multimodal Learning, Fall 2025, IECS Doctoral School - University of Trento
Introduction to Machine Learning, Spring 2025, Data Science Master Degree - University of Trento

Service to the Community

Conference reviewer: BMVC (2021, 2026), ECCV (2024-26), IROS (2024), ICRA (2025), CVPR (2024-26), ACMMM (2025), ICPR (2026), NeurIPS (2026), WACV (2026)
Area Chair: BMVC (2025)
Organizer: CVTS 2026, HEMI Tutorial at ICMI 2026, GreenFOMO Workshop at ECCV 2024

Outside research, I enjoy eating, drinking, outdoor sports and travelling.