Paper
22 October 2024 Multiscale spatio-temporal hypergraph convolutional networks for 3D human pose estimation
Xiuning Chen, Huabin Wang, Shulin Dai, Hongrui Yuan
Author Affiliations +
Proceedings Volume 13274, Sixteenth International Conference on Digital Image Processing (ICDIP 2024); 132740O (2024) https://doi.org/10.1117/12.3037586
Event: Sixteenth International Conference on Digital Image Processing (ICDIP 2024), 2024, Haikou, HI, China
Abstract
In the domain of human pose estimation, graph convolutional networks have exhibited notable performance enhancements owing to their adeptness in naturally modeling the representation of human poses through graph structures. However, prevailing methods predominantly concentrate on the local physical connections between joints, overlooking higher-order neighboring nodes. This limitation curtails their ability to effectively exploit relationships between distant joints. This article introduces a Multiscale Spatio-Temporal Hypergraph Convolutional Network (MST-HCN) designed to capture spatio-temporal information and higher-order dependencies. MST-HCN encompasses two pivotal modules: Multiscale Hypergraph Convolution (MHCN) and Multiscale Temporal Convolution (MTCN). The MHCN module represents human poses as hypergraphs in various forms, enabling the comprehensive extraction of both local and global structural information. In contrast to traditional stride convolutions, MTCN leverages multiple branches to learn important frames based on their significance, thereby filtering out redundant frames. Experimental results underscore that MST-HCN surpasses state-of-the-art methods in benchmark tests such as Human3.6M and MPI-INF-3DHP.In particular, our proposed MST-HCN method boosts performance by 1.5% and 0.9%, compared to the closest latest method, using detected 2D poses and ground truth 2D settings respectively.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Xiuning Chen, Huabin Wang, Shulin Dai, and Hongrui Yuan "Multiscale spatio-temporal hypergraph convolutional networks for 3D human pose estimation", Proc. SPIE 13274, Sixteenth International Conference on Digital Image Processing (ICDIP 2024), 132740O (22 October 2024); https://doi.org/10.1117/12.3037586
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Convolution

Pose estimation

Video

Back to Top