Jesús M. Rodríguez-de-Vera

PhD Candidate in Computer Vision

University of Barcelona

Biography

I am a PhD student in Computer Vision at the University of Barcelona. My research interests include computer vision, deep learning and artificial intelligence. I work under the supervision of Dr. Petia Ivanova Radeva.

In this page I will publish the main results of my research, as well as other projects I am working on. If you want to know more about me, you can check my LinkedIn profile.

Interests

Artificial Intelligence
Computer Vision
Deep Learning

Education

PhD in Computer Vision, TBD
University of Barcelona
MSc in Artificial Intelligence, 2023
Polytechnic University of Catalonia (UPC)
BSc in Computer Science, 2020
University of Murcia
BSc in Mathematics, 2020
University of Murcia

Recent News

[11/06/2025] Visit the MetaFood workshop at CVPR'25 to know more about our challenge on food recognition! 🇺🇸🥘️

[20/01/2025] This week we are hosting the Winter School “Demistifying Artificial Intelligence” for college students from China. 🇨🇳

[05/04/2024] Our latest method, LOFI, has been accepted as an oral presentation at the CVPR'24 Workshop MTF. See you in Seattle! 🇺🇸

[29/10/2023] Very excited to present our work Dining on Details at MADiMa'23 in ACM Multimedia! 🇨🇦 🚀

[01/10/2023] We go to Paris to attend ICCV'23 and present a bunch of interesting projects! 🇫🇷 🚀

[19/09/2023] We presented a poster of our work Dining on Details at the 10th ACMCV in the Computer Vision Center.

Reviewing Experience

2025: CVPR (Conference on Computer Vision and Pattern Recognition), IJCNN, CVPRW (Conference on Computer Vision and Pattern Recognition Workshops), IbPRIA 2025, MTF CVPRW Challenge Organizer, NeurIPS (Conference on Neural Information Processing Systems)
2024: WACV (Winter Conference on Applications of Computer Vision)
2023: IEEE Transactions on Multimedia

Featured Publications

Imanol G. Estepa, Jesús M. Rodríguez-de-Vera, Ignacio Sarasúa, Bhalaji Nagarajan, Petia Radeva

March, 2025 ArXiv

Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis

Jesús M. Rodríguez-de-Vera, Imanol G. Estepa, Ignacio Sarasúa, Bhalaji Nagarajan, Petia Radeva

July, 2024 ArXiv

Precision at Scale: Domain-Specific Datasets On-Demand

In the realm of self-supervised learning (SSL), conventional wisdom has gravitated towards the utility of massive, general domain datasets for pretraining robust backbones. In this paper, we challenge this idea by exploring if it is possible to bridge the scale between general-domain datasets and (traditionally smaller) domain-specific datasets to reduce the current performance gap. More specifically, we propose Precision at Scale (PaS), a novel method for the autonomous creation of domain-specific datasets on-demand. The modularity of the PaS pipeline enables leveraging state-of-the-art foundational and generative models to create a collection of images of any given size belonging to any given domain with minimal human intervention. Extensive analysis in two complex domains, proves the superiority of PaS datasets over existing traditional domain-specific datasets in terms of diversity, scale, and effectiveness in training visual transformers and convolutional neural networks. Most notably, we prove that automatically generated domain-specific datasets lead to better pretraining than large-scale supervised datasets such as ImageNet-1k and ImageNet-21k. Concretely, models trained on domain-specific datasets constructed by PaS pipeline, beat ImageNet-1k pretrained backbones by at least 12% in all the considered domains and classification tasks and lead to better food domain performance than supervised ImageNet-21k pretrain while being 12 times smaller.

Jesús M. Rodríguez-de-Vera, Pablo Villacorta, Imanol G. Estepa, Marc Bolaños, Ignacio Sarasúa, Bhalaji Nagarajan, Petia Radeva

October, 2023 In Proceedings of the 8th International Workshop on Multimedia Assisted Dietary Management (MADiMa ‘23), co-located with ACM Multimedia 2023

Dining on Details: LLM-Guided Expert Networks for Fine-Grained Food Recognition

Dining on Details (DoD) is an innovative fine-grained food classification approach using large language models to sort dataset classes into subsets. Powered by the robust ImageBind embedding space, DoD excels in distinguishing similar classes. Universally compatible, DoD integrates seamlessly with any existing classification architecture. Extensive testing on various food datasets and backbones shows performance boosts of 0.5% to 1.61%, and even achieves SoTA results on the Food-101 dataset.

Recent Publications

Quickly discover relevant content by filtering publications.

Imanol G. Estepa, Jesús M. Rodríguez-de-Vera, Ignacio Sarasúa, Bhalaji Nagarajan, Petia Radeva (2025). Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis. ArXiv.

Cite ArXiv

Jesús M. Rodríguez-de-Vera, Imanol G. Estepa, Ignacio Sarasúa, Bhalaji Nagarajan, Petia Radeva (2024). Precision at Scale: Domain-Specific Datasets On-Demand. ArXiv.

Cite ArXiv

Jesús M. Rodríguez-de-Vera, Imanol G. Estepa, Marc Bolaños, Bhalaji Nagarajan, Petia Radeva (2024). LOFI: LOng-tailed FIne-Grained Network for Food Recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops.

Cite CVF Open Access

Jesús M. Rodríguez-de-Vera, Pablo Villacorta, Imanol G. Estepa, Marc Bolaños, Ignacio Sarasúa, Bhalaji Nagarajan, Petia Radeva (2023). Dining on Details: LLM-Guided Expert Networks for Fine-Grained Food Recognition. In Proceedings of the 8th International Workshop on Multimedia Assisted Dietary Management (MADiMa ‘23), co-located with ACM Multimedia 2023.

Cite DOI

Imanol G Estepa, Jesús M. Rodríguez-de-Vera, Bhalaji Nagarajan, Petia Radeva (2023). Good Fences Make Good Neighbours. 4th Visual Inductive Priors for Data-Efficient Deep Learning Workshop, co-located with ICCV'23.

Cite OpenReview

See all publications

Recent & Upcoming Conferences

[MetaFood'24 CVPR-W] LOFI: LOng-tailed FIne-Grained Network for Food Recognition

Oral presentation of our work ‘LOFI: LOng-tailed FIne-Grained Network for Food Recognition’ at the MetaFood 2024 Workshop in the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024).

Jun 17, 2024 3:30 PM — 8:35 PM Seattle Convention Center

Jesús M. Rodríguez-de-Vera, Imanol G. Estepa, Marc Bolaños, Bhalaji Nagarajan, Petia Radeva

[MADiMa'23] Dining on Details: LLM-Guided Expert Networks for Fine-Grained Food Recognition

Oral and poster presentation of our work ‘Dining on Details’ at MADiMa 2023 in the 31st ACM Multimedia 2023.

Oct 29, 2023 1:30 PM — 5:00 PM

Jesús M. Rodríguez-de-Vera, Pablo Villacorta, Imanol G. Estepa, Marc Bolaños, Ignacio Sarasúa, Bhalaji Nagarajan, Petia Radeva

[LatinX ICCV'23] Harnessing Automated Hierarchies for Triplet Contrast-based Fine-grained Recognition

Poster presentation of our extended abstract ‘Harnessing Automated Hierarchies for Triplet Contrast-based Fine-grained Recognition’ at the 2023 International Conference on Computer Vision (ICCV 2023).

Oct 3, 2023 3:55 PM — 5:10 PM Paris Convention Centre

Jesús M. Rodríguez-de-Vera, Imanol G Estepa, Bhalaji Nagarajan, Petia Radeva

[LatinX ICCV'23] Harnessing Automated Hierarchies for Triplet Contrast-based Fine-grained Recognition

[VIPriors'23] Good Fences Make Good Neighbours

Poster presentation of our work ‘Good Fences Make Good Neighbours’ at the 4th Visual Inductive Priors for Data-Efficient Deep Learning Workshop in the 2023 International Conference on Computer Vision (ICCV 2023).

Oct 2, 2023 8:45 AM — 1:00 PM Paris Convention Centre

Imanol G Estepa, Jesús M. Rodríguez-de-Vera, Bhalaji Nagarajan, Petia Radeva

[VIPriors'23] Good Fences Make Good Neighbours

[ACMCV'23] Dining on Details: LLM-Guided Expert Networks for Fine-Grained Food Recognition

Poster presentation of our work (accepted as oral at MADiMa'23, ACM Multimedia 2023) at the 10th Annual Catalan Meeting on Computer Vision (ACMCV 2023).

Sep 19, 2023 5:00 PM — 7:00 PM Computer Vision Center

Jesús M. Rodríguez-de-Vera, Pablo Villacorta, Imanol G. Estepa, Marc Bolaños, Ignacio Sarasúa, Bhalaji Nagarajan, Petia Radeva

[ACMCV'23] Dining on Details: LLM-Guided Expert Networks for Fine-Grained Food Recognition

Teaching Experience

Generative AI Lecture, Winter School “Demistifying Artificial Intelligence”, 2025 - University of Barcelona: Invited lecturer.
Computer Vision, Bachelor’s Degree in Computer Science, 2024-2025 - University of Barcelona: Lab teacher.

Contact

My preferred way of contact is via LinkedIn. You can also send me an email to jesusmolrdv@gmail.com.