1 April 2019 ASP-CNN: aligning semantic parts for fine-grained image classification
Hao Ge, Xiaoguang Tu, Mei Xie, Zheng Ma
Author Affiliations +
Abstract
Recently, numerous methods have been proposed to tackle the problem of fine-grained image classification (FGIC). Most of them follow a two-step strategy that contains detecting the object regions and classifying with the features extracted from these regions. For the feature extraction, the most popular method is directly cropping the feature maps according to the location of detected part regions. However, one challenge of such a method is that the direction of the semantic parts may vary in different images, therefore, it is necessary to capture such differences for better classification. We propose a CNN architecture by aligning semantic parts (ASP-CNN) for FGIC, aiming to increase the interclass variance and meanwhile reduce the intraclass variance in fine-grained datasets. Extensive experiments on CUB-200-2011 and CUB-200-2010 show the effectiveness of our ASP-CNN.
© 2019 SPIE and IS&T 1017-9909/2019/$25.00 © 2019 SPIE and IS&T
Hao Ge, Xiaoguang Tu, Mei Xie, and Zheng Ma "ASP-CNN: aligning semantic parts for fine-grained image classification," Journal of Electronic Imaging 28(2), 023024 (1 April 2019). https://doi.org/10.1117/1.JEI.28.2.023024
Received: 15 January 2019; Accepted: 20 March 2019; Published: 1 April 2019
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image classification

Classification systems

Head

Feature extraction

Facial recognition systems

Breast

Detection and tracking algorithms

Back to Top