I am a PhD student in Computer Vision at the University of Barcelona. My research interests include computer vision, deep learning and artificial intelligence. I work under the supervision of Dr. Petia Ivanova Radeva.
PhD in Computer Vision, TBD
University of Barcelona
MSc in Artificial Intelligence, 2023
Polytechnic University of Catalonia (UPC)
BSc in Computer Science, 2020
University of Murcia
BSc in Mathematics, 2020
University of Murcia
[11/06/2025] Visit the MetaFood workshop at CVPR'25 to know more about our challenge on food recognition! 🇺🇸🥘️
[20/01/2025] This week we are hosting the Winter School “Demistifying Artificial Intelligence” for college students from China. 🇨🇳
[05/04/2024] Our latest method, LOFI, has been accepted as an oral presentation at the CVPR'24 Workshop MTF. See you in Seattle! 🇺🇸
[29/10/2023] Very excited to present our work Dining on Details at MADiMa'23 in ACM Multimedia! 🇨🇦 🚀
[01/10/2023] We go to Paris to attend ICCV'23 and present a bunch of interesting projects! 🇫🇷 🚀
[19/09/2023] We presented a poster of our work Dining on Details at the 10th ACMCV in the Computer Vision Center.
2025: CVPR (Conference on Computer Vision and Pattern Recognition), IJCNN, CVPRW (Conference on Computer Vision and Pattern Recognition Workshops), IbPRIA 2025, MTF CVPRW Challenge Organizer, NeurIPS (Conference on Neural Information Processing Systems)
2024: WACV (Winter Conference on Applications of Computer Vision)
2023: IEEE Transactions on Multimedia
Recent self-supervised learning methods rely on massive general-domain datasets for robust model pretraining. However, these datasets may lack specificity required in specialized domains. Collecting large, supervised datasets to compensate for this limitation is also cumbersome. This raises a key question: Can automatically crafted domain-specific datasets serve as efficient and effective SSL pretrainers, performing comparable to—or even surpassing—much larger state-of-the-art general-domain datasets? To address this challenge, we propose Precision at Scale (PaS), a novel modular pipeline for automatic creation of domain-specific datasets on-demand. PaS leverages Large Language Models (LLMs) and Vision-Language Models (VLMs) through three distinct phases: Concept Generation, where LLMs identify relevant domain concepts; Image Collection, utilizing VLMs and Generative models to gather appropriate images; Data Curation, ensuring quality and relevance by eliminating unrelated or redundant images. We conduct extensive experiments across three complex domains — food, insects, and birds — proving that PaS datasets compete and often surpass existing domain-specific datasets in diversity, scale, and effectiveness as pretrainers. Models pretrained on PaS datasets outperform those trained on large-scale general-domain datasets (ImageNet-1K) by up to 21% and surpass same-scale domain-specific datasets by 6.7% across classification tasks. Notably, despite being an order of magnitude smaller, PaS datasets outperform ImageNet-21K pretraining, with improvements of 3.3 % in fine-tuning and 9.5% in few-shot learning, and showing superior performance on specialized dense tasks. Furthermore, by efficiently fine-tuning pretrained VLMs like CLIP and SigLIP using low-rank methods, we achieve performance gains (+4.2 % over CLIP) in specialized domains with minimal overhead, demonstrating the versatility of PaS datasets.
Dining on Details (DoD) is an innovative fine-grained food classification approach using large language models to sort dataset classes into subsets. Powered by the robust ImageBind embedding space, DoD excels in distinguishing similar classes. Universally compatible, DoD integrates seamlessly with any existing classification architecture. Extensive testing on various food datasets and backbones shows performance boosts of 0.5% to 1.61%, and even achieves SoTA results on the Food-101 dataset.
Generative AI Lecture, Winter School “Demistifying Artificial Intelligence”, 2025 - University of Barcelona: Invited lecturer.
Computer Vision, Bachelor’s Degree in Computer Science, 2024-2025 - University of Barcelona: Lab teacher.