Dining on Details (DoD) is an innovative
fine-grained food classification approach using large language models to sort dataset classes into subsets. Powered by the robust ImageBind embedding space, DoD excels in distinguishing similar classes. Universally compatible, DoD integrates seamlessly with any existing classification architecture. Extensive testing on various food datasets and backbones shows performance boosts of 0.5% to 1.61%, and even achieves
SoTA results on the Food-101 dataset.