[LatinX ICCV'23] Harnessing Automated Hierarchies for Triplet Contrast-based Fine-grained Recognition

Name: [LatinX ICCV'23] Harnessing Automated Hierarchies for Triplet Contrast-based Fine-grained Recognition
Start: 2023-10-03T15:55:00Z
End: 2023-10-03T17:10:00Z
Location: Paris Convention Centre

Jesús M. Rodríguez-de-Vera, Imanol G Estepa, Bhalaji Nagarajan, Petia Radeva

Abstract

Fine-grained classification is a complex classification problem in which the objective is to distinguish between classes that are very similar to each other. The ability of triplet loss to model relations between samples makes it a good alternative for fine-grained settings. In this work, we propose to create an adaptive three-level hierarchy of samples in order to exploit this information via multi-level triplet contrast. The negatives and the positives are sampled from a queue, which allows higher control over the variety and computational cost of sampling. We take advantage of cross-modal information thanks to a Universal Sentence Encoder to seamlessly find similar categories and group them together. In addition, we use K-means to dynamically find subclasses within fine-grained categories. Experiments show that the proposed method results in significant improvements in the accuracy of two popular fine-grained classification benchmarks. The results include an improvement of +0.58 in CUB-200-2011.

Date

Oct 3, 2023 3:55 PM — 5:10 PM

Event

LatinX in Computer Vision Workshop in ICCV'23

Location

Paris Convention Centre

1 Place de la Porte de Versailles, Paris, Île-de-France 75015

This poster corresponds to work in progress (extended abstract). No published paper is available yet.

ICCV 2023 ICCV Conference 2023 Extended abstract Fine-grained Poster LatinX

[LatinX ICCV'23] Harnessing Automated Hierarchies for Triplet Contrast-based Fine-grained Recognition

Abstract

Jesús M. Rodríguez-de-Vera

PhD Candidate in Computer Vision