Wind Sounds Classification Using Different Audio Feature Extraction Techniques

Wala'a Nsaif Jasim, Saba Abdual Wahid Saddam, Esra'a Jasem Harfash

Abstract


   In this research, Different audio feature extraction technique are implemented and classification approaches are presented to classify seven types of wind. Where we applied features technique such as Zero Crossing Rate (ZCR) ,Fast Fourier Transformation (FFT), Linear predictive coding (LPC), Perceptual Linear Prediction (PLP). We know that some of these methods are good with human voices, but we tried to apply them here to characterize the wind audio content. The CNN classification method is implemented to determine the class of input wind sound signal. Experimental results show that each of these extraction feature methods are gave different results, but classification accuracy that are obtained of PLP features proven to have the best results.


Full Text:

PDF

References


G. Sharma, et al., "Trends in Audio Signal Feature Extraction Methods," Elsevie, vol. 158, pp. 1-21 .2020.

H. Purwins, et al., 2019, "Deep Learning for Audio Signal Processing," IEEE Journal of Selected Topics in Signal Processing,vol. 14, pp. 1-14.

T. Andersson , "Audio Classification and Content Desicription, " M Sc. Thesis, Department of Computer Science Electrical Engeerring , University of Techonology.

S. Liang and X. Fan, "Audio Content Classification Method Research based on Two-Step Strategy," International Journal of Advanced Computer Science and Applications,vol. 5, pp. 57-62, 2014.

D. Moffat, et al., "An Evaluation of Audio Feature Extraction Toolboxes," in Proc. of the 18th International Conference on Digital Audio Effects, 2015, pp. 1-7.

A. Krizhevsky, et al., "Imagenet Classification with Deep Convolutional Neural Networks," in the Proceedings of the 25th International Conference on Neural Information Processing Systems, 2012, pp. 84-90.

P. Zinemanas, et al., "An Interpretable Deep Learning Model for Automatic Sound Classification," Electronics, vol. 10 , pp. 1-23.2021.

Goodfellow, et al., "Deep Learning," MIT Press: Cambridge, 2016.

J. Xie, et al., "Investigation of Different CNN-Dased Models for Improved Bird Sound Classification," IEEE Access, vol. 7, pp. 175353-175361, 2019.

L. Yang, et al., "Sound Classification Based on Multihead Attention and Support," Mathematical Problems in Engineering,. ,pp. 1-11, 2021.

Y. Su, et al., "Environment Sound Classification Using a Two-Stream CNN Based on Decision-Level Fusion," Sensors, vol. 19, pp. 1-15, 2019.

X. Valero and F. Alías, "Análisis De La Señal Acústica Mediante Coeficientes Cepstrales Bio-Inspirados y su Aplicación Al Reconocimiento De Paisajes Sonoros," VIII Congr. Ibero-americano Acústica, 2012, pp. 1-9.

C. Chang and B. Doran, "Urban Sound Classification: With Random Forest SVM DNN RNN and CNN Classifiers," in CSCI E-81 Machine Learning and Data Mining Final Project Fall 2016, Harvard University NCambridge, 2016.

M. A. Acevedo, et al.,"Automated Classification of Bird and Amphibian Calls Using Machine Learning: A Comparison of Methods," Ecological Informatics,vol. 4, pp. 206-214, 2009.

S. Adavanne, et al., "Stacked Convolutional and Recurrent Neural Networks for Bird Audio Detection," in 25th European signal processing conference (EUSIPCO), IEEE, 2017, pp. 1729-1733.

L. Nanni, et al., "Spectrogram Classification Using Dissimilarity Space," Applied Sciences, vol. 10, pp. 1-17, 2020.

S. L. Ullo, et al., "Hybrid Computerized Method for Environmental Sound Classification," IEEE Access, vol. 8, pp. 124055-124065, 2020.

M. Ahmed, et al., "Automatic Environmental Sound Recognition (AESR) Using Convolutional Neural Network," International Journal of Modern Education & Computer Science,vol. 12, pp. 42-54,2020.

I. Diez Gaspon, et al., "Deep Learning For Natural Sound Classification," in Inter-Noise and Noise-Con Congress and Conference Proceedings, Institute of Noise Control Engineering, 2019, pp. 5683-5692..

A. Khamparia, et al., "Sound Classification Using Convolutional Neural Network and Tensor Deep Stacking Network," IEEE Access, vol. 7, pp. 7717-7727, 2019.

M. Malfante, et al., "Automatic Fish Sounds Classification," The Journal of the Acoustical Society of America, vol. 143, pp. 2834-2846, 2018.

S. Sivasankaran and K. Prabhu,"Robust Features for Environmental Sound Classification," in 2013 IEEE International Conference on Electronics, Computing and Communication Technologies, IEEE, 2013, pp. 1-6.

S. Sigtia, et al., "Automatic Environmental Sound Recognition: Performance Versus Computational Cost," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 24, pp. 2096-2107,2016.

I. Paraskevas, "Phase as a Feature Extraction Tool for Audio Classification and Signal Localisation," University of Surrey (United Kingdom),2005

J. Zhang,"Music Feature Extraction and Classification Algorithm Based on Deep Learning," Hindawi, Scientific Programming, pp. 1-9, 2021.

T. Giannakopoulos and A. Pikrakis, "Introduction to Audio Analysis: a MATLAB® Approach," Academic Press, 2014.

M. Müller, "Fundamentals of Music Processing: Audio, Analysis, Algorithms, Applications," Springer, 2015.

R. L. Herman, "An Introduction to Fourier Analysis," Chapman and Hall/CRC, 2016.

A. Terenzi, et al., "Features Extraction Applied to the Analysis of the Sounds Emitted by Honey Bees in a Beehive," in 2019 11 International Symposium on Image and Signal Processing and Analysis (ISPA), IEEE, 2019, pp. 03-08.

L. Grama and C. Rusu,"Audio Signal Classification Using Linear Predictive Coding and Random Forests," in 2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD), IEEE, 2017, pp. 1-9.

N. Dave, "Feature Extraction Methods LPC, PLP and MFCC in Speech Recognition," International journal for Advance Research in Engineering and Technology,vol. 1, pp. 1-4, 2013.

[33] S. Hershey, et al., "CNN Architectures for Large-Scale Audio Classification," in 2017 IEEE International Conference on Acoustics, Speech .




DOI: https://doi.org/10.31449/inf.v45i7.3739

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.