This page has only limited features, please log in for full access.
Convolutional neural networks (CNNs) with log-mel audio representation and CNN-based end-to-end learning have both been used for environmental event sound recognition (ESC). However, log-mel features can be complemented by features learned from the raw audio waveform with an effective fusion method. In this paper, we first propose a novel stacked CNN model with multiple convolutional layers of decreasing filter sizes to improve the performance of CNN models with either log-mel feature input or raw waveform input. These two models are then combined using the Dempster–Shafer (DS) evidence theory to build the ensemble DS-CNN model for ESC. Our experiments over three public datasets showed that our method could achieve much higher performance in environmental sound recognition than other CNN models with the same types of input features. This is achieved by exploiting the complementarity of the model based on log-mel feature input and the model based on learning features directly from raw waveforms.
Shaobo Li; Yong Yao; Jie Hu; Guokai Liu; Xuemei Yao; Jianjun Hu. An Ensemble Stacked Convolutional Neural Network Model for Environmental Event Sound Recognition. Applied Sciences 2018, 8, 1152 .
AMA StyleShaobo Li, Yong Yao, Jie Hu, Guokai Liu, Xuemei Yao, Jianjun Hu. An Ensemble Stacked Convolutional Neural Network Model for Environmental Event Sound Recognition. Applied Sciences. 2018; 8 (7):1152.
Chicago/Turabian StyleShaobo Li; Yong Yao; Jie Hu; Guokai Liu; Xuemei Yao; Jianjun Hu. 2018. "An Ensemble Stacked Convolutional Neural Network Model for Environmental Event Sound Recognition." Applied Sciences 8, no. 7: 1152.
Intelligent machine health monitoring and fault diagnosis are becoming increasingly important for modern manufacturing industries. Current fault diagnosis approaches mostly depend on expert-designed features for building prediction models. In this paper, we proposed IDSCNN, a novel bearing fault diagnosis algorithm based on ensemble deep convolutional neural networks and an improved Dempster–Shafer theory based evidence fusion. The convolutional neural networks take the root mean square (RMS) maps from the FFT (Fast Fourier Transformation) features of the vibration signals from two sensors as inputs. The improved D-S evidence theory is implemented via distance matrix from evidences and modified Gini Index. Extensive evaluations of the IDSCNN on the Case Western Reserve Dataset showed that our IDSCNN algorithm can achieve better fault diagnosis performance than existing machine learning methods by fusing complementary or conflicting evidences from different models and sensors and adapting to different load conditions.
Shaobo Li; Guokai Liu; Xianghong Tang; Jianguang Lu; Jianjun Hu. An Ensemble Deep Convolutional Neural Network Model with Improved D-S Evidence Fusion for Bearing Fault Diagnosis. Sensors 2017, 17, 1729 .
AMA StyleShaobo Li, Guokai Liu, Xianghong Tang, Jianguang Lu, Jianjun Hu. An Ensemble Deep Convolutional Neural Network Model with Improved D-S Evidence Fusion for Bearing Fault Diagnosis. Sensors. 2017; 17 (8):1729.
Chicago/Turabian StyleShaobo Li; Guokai Liu; Xianghong Tang; Jianguang Lu; Jianjun Hu. 2017. "An Ensemble Deep Convolutional Neural Network Model with Improved D-S Evidence Fusion for Bearing Fault Diagnosis." Sensors 17, no. 8: 1729.