Paper
22 May 2024 Research on unsupervised abnormal sound detection based on self-coding model
Heng Wang, Shuai Zhang, Jie Liu, Lei Yu, Chenglong Wang
Author Affiliations +
Proceedings Volume 13176, Fourth International Conference on Machine Learning and Computer Application (ICMLCA 2023); 131761U (2024) https://doi.org/10.1117/12.3029124
Event: Fourth International Conference on Machine Learning and Computer Application (ICMLCA 2023), 2023, Hangzhou, China
Abstract
In view of the difficulty of machine abnormal sound detection under the condition that abnormal sound samples are difficult to collect, this paper proposes an unsupervised abnormal sound detection model based on a self-coding model, which effectively improves the accuracy of abnormal sound detection under this condition. In this paper, location coding in Transformer is replaced with relational awareness self-attention to improve the representation capability of location coding. Secondly, the relevance scores in multi-head attention are mixed to enhance the understanding of context in the attention matrix. At the same time, Layer Normalization was replaced with Batch Normalization to speed up model training, and improved Transformer was introduced into the encoders and decoders of self-coding models. Finally, the improved self-coding model is used for unsupervised learning of the machine's normal sound to obtain the potential feature distribution of its normal sound. ToyADMOS and MIMII open data sets are used for experiments. Compared with traditional autoencoders and two improved self-coding models, The AUC score of toycar, Toycar, fan, slider and valve machines increased by 2.1%, 1.97%, 3.06%, 0.34% and 2.99%, respectively.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Heng Wang, Shuai Zhang, Jie Liu, Lei Yu, and Chenglong Wang "Research on unsupervised abnormal sound detection based on self-coding model", Proc. SPIE 13176, Fourth International Conference on Machine Learning and Computer Application (ICMLCA 2023), 131761U (22 May 2024); https://doi.org/10.1117/12.3029124
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Transformers

Data modeling

Education and training

Feature extraction

Batch normalization

Fluctuations and noise

Machine learning

Back to Top