24 January 2024 TEG: image theme recognition using text-embedding-guided few-shot adaptation
Jikai Wang, Wanglong Lu, Yu Wang, Kaijie Shi, Xianta Jiang, Hanli Zhao
Author Affiliations +
Abstract

Grouping images into different themes is a challenging task in photo book curation. Unlike image object recognition, image theme recognition focuses on the understanding of the main subject or overall meaning conveyed by an image. However, it is challenging to achieve satisfactory performance using existing general image recognition methods. In this work, we aim to solve the image theme recognition task with few-shot training samples using pre-trained contrastive language-image models. A text-prompt-guided few-shot image adaptation framework is proposed, which incorporates a text-embedding-guided classifier and an auxiliary classification loss to exploit embedded visual and text features, stabilize the network training, and enhance recognition performance. We also present an annotated dataset Theme25 for studying image theme recognition. We conducted experiments on our Theme25 dataset as well as the publicly available CIFAR100 and ImageNet datasets to demonstrate the superiority of our method over the compared state-of-the-art methods.

© 2024 SPIE and IS&T
Jikai Wang, Wanglong Lu, Yu Wang, Kaijie Shi, Xianta Jiang, and Hanli Zhao "TEG: image theme recognition using text-embedding-guided few-shot adaptation," Journal of Electronic Imaging 33(1), 013028 (24 January 2024). https://doi.org/10.1117/1.JEI.33.1.013028
Received: 3 September 2023; Accepted: 2 January 2024; Published: 24 January 2024
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Education and training

Visualization

Object recognition

Image classification

Machine learning

Visual process modeling

Data modeling

Back to Top