Paper
15 March 2019 GAN-based data augmentation for visual finger spelling recognition
Bogdan Kwolek
Author Affiliations +
Proceedings Volume 11041, Eleventh International Conference on Machine Vision (ICMV 2018); 110411U (2019) https://doi.org/10.1117/12.2522935
Event: Eleventh International Conference on Machine Vision (ICMV 2018), 2018, Munich, Germany
Abstract
In this work we extend WGAN-GP in order to achieve better generation of synthesized images for finger spelling classification. The main difference between the ordinary WGAN-GP and the proposed algorithm is that in the training we employ both training samples and training labels. These training labels are fed to the generator, that generates the synthetic images using both the randomized latent input and the input label. In ordinary WGAN-GP, latent input variables are usually sampled from an unconditional prior. In the proposed algorithm the latent input vector is a concatenation of random part, the class labels and additional variables that are drawn from Gaussian distributions representing hand poses or gesture attributes. The JSL dataset for Hiragana sign recognition has been balanced using the rendered samples on the basis of a 3D hand model as well as the extended WGAN-GP.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Bogdan Kwolek "GAN-based data augmentation for visual finger spelling recognition", Proc. SPIE 11041, Eleventh International Conference on Machine Vision (ICMV 2018), 110411U (15 March 2019); https://doi.org/10.1117/12.2522935
Lens.org Logo
CITATIONS
Cited by 3 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
3D modeling

Data modeling

Gallium nitride

Gesture recognition

Image classification

3D image processing

Statistical modeling

RELATED CONTENT

Augmented reality for spectral imaging applications
Proceedings of SPIE (June 18 2024)
A novel method of target recognition based on 3D color...
Proceedings of SPIE (October 08 2015)
Three dimensional view based on RS and GIS three...
Proceedings of SPIE (October 10 2000)

Back to Top