KEYWORDS: Speech recognition, Data modeling, Performance modeling, Evolutionary algorithms, Acoustics, Detection and tracking algorithms, Signal processing, Process modeling, Optical filters
In this paper, we focus on the application of the LightGBM model for audio sound classification. Though convolutional neural networks (CNN) generally have superior performance, LightGBM model possess certain notable advantages, such as low computational costs, feasibility of parallel implementations, and comparable accuracies over many datasets. In order to improve the generalization ability of the model, data augmentation operations are performed on the audio clips including pitch shifting, time stretching, compressing the dynamic range and adding white noise. The accuracy of speech recognition heavily depends on the reliability of the representative features extracted from the audio signal. The audio signal is originally a one-dimensional time series signal, which is difficult to visualize the frequency change. Hence it is necessary to extract the discernible components in the audio signal. To improve the representative capacity of our proposed model, we use the Mel spectrum and MFCC (Mel-Frequency Cepstral Coefficients) to select features as twodimensional input to accurately characterize the internal information of the signal. The techniques mentioned in this paper are mainly trained on Google Speech Commands dataset. The experimental results show that the method, which is an optimized LightGBM model based on the Mel spectrum, can achieve high word classification accuracy.
Feature extraction and utilization is of great importance for the problem of machine fault diagnosis. In this paper, multihead deep learning network is proposed to achieve machine health status classification using features of different sizes. Firstly, statistical characteristics which reflect machine signal status of time domain and frequency domain are summarized to compose feature vectors as one-dimensional network input. Secondly, Mel power spectrum and its incremental characteristics are utilized as two-dimensional network input of three channels. Lastly, the multi-head network is introduced to analyze both one-dimensional and two-dimensional features using two different sub neural networks and classify the machine health status according to the joint feature analyzing result. The experiments on bearing working status database of Case Western Reserve University show that the proposed method has good mechanical signal classification ability and better stability. Moreover, our final test accuracy of fault diagnosis on 16 kinds of bearing working signals can reach up to about 99.53%.
The conventional channel detector based on the maximum a posteriori (MAP) algorithm for coded multiple-input multiple-output (MIMO) multiuser systems has a computational complexity growing exponentially with the product of the number of users, the number of transmit antennas, and the symbol constellation size. In this paper, we consider the multiuser detection problem from a combinatorial optimization viewpoint and develop a low-complexity iterative receiver based on the evolutionary programming (EP) technique. Simulation results show that with the proposed receiver, the performance of coded multiuser systems approaches that of the iteratively MAP-decoded single-user (SU) MIMO system at a significantly reduced computational complexity even for unknown channel scenarios.
KEYWORDS: Sensors, Receivers, Signal to noise ratio, Signal attenuation, Detection and tracking algorithms, Reliability, Systems modeling, Computer programming, Sensor performance
In this paper, we propose a fully graph-based iterative detection and decoding scheme for low-density parity-check (LDPC) coded generalized two-dimensional (2D) intersymbol interference (ISI) channels. The 2D detector consists of a downtrack detector based on the symbol-level sum-product algorithm (SPA) and a bit-level SPA-based crosstrack detector. A LDPC decoder based on simplified check node operations is used to provide soft information for the 2D channel detector. Numerical results show that the proposed receiver significantly reduces the decoding complexity and also achieves better performance as compared with the trellis-based BCJR detector over 2×2 2D channels.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.