Skip to main content
Log in

LBP-based bird sound classification using improved feature selection algorithm

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

Local binary pattern (LBP)-based features for bird sound classification were investigated in this study, including both one-dimensional (LBP-1D) and two-dimensional (LBP-2D) local binary patterns. Specifically, the discrete wavelet transform was first used as a pooling method to generate multi-level features in both time (LBP-1D-T) and frequency domain (LBP-1D-F) signals. To obtain richer time–frequency information of bird sounds, uniform patterns (LBP-2D) were extracted from the log-scaled Mel spectrogram. To fully exploit the complementarity of different LBP features, a hybrid fusion method was implemented. Next, neighborhood component analysis (NCA) was employed as a feature selection method to remove redundant information in the fused feature set. In order to reduce the running time of NCA and improve the classification accuracy, an improved feature selection method (DSNCA) was proposed. Finally, two machine learning algorithms: K-nearest neighbor and support vector machine were used for classification. Experimental results on 43 bird species of North American wood-warblers indicated that LBP-2D achieved a higher balanced-accuracy than LBP-1D-T and LBP-1D-F (86.33%, 81.05% and 70.02%, respectively). In addition, the highest classification accuracy was up to 88.70%, using hybrid fusion.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17

Similar content being viewed by others

Data availability

Our proposed approach is implemented in Python, and the code will be publicly available on https://github.com/jiang-rgb?tab=repositories for future comparison and development by other researchers.

References

  • Akbal, E. (2020). An automated environmental sound classification methods based on statistical and textural feature. Applied Acoustics, 167, 107413.

    Article  Google Scholar 

  • Bairlein, F. (2016). Migratory birds under threat. Science, 354(6312), 547–548.

    Article  Google Scholar 

  • Bottou, L. (2010). Large-scale machine learning with stochastic gradient descent. In: Proceedings of COMPSTAT'2010 (pp. 177–186).

  • Demir, S., Key, S., Tuncer, T., & Dogan, S. (2020). An exemplar pyramid feature extraction based humerus fracture classification method. Medical Hypotheses, 140, 109663.

    Article  Google Scholar 

  • Emlen, J. T., & DeJong, M. J. (1992). Counting birds: The problem of variable hearing abilities (contando aves: El problema de la variabilidad en la capacidad auditiva). Journal of Field Ornithology, 63, 26–31.

    Google Scholar 

  • Faris, H., Abukhurma, R., Almanaseer, W., Saadeh, M., Mora, A. M., Castillo, P. A., & Aljarah, I. (2020). Improving financial bankruptcy prediction in a highly imbalanced class distribution using oversampling and ensemble learning: A case from the Spanish market. Progress in Artificial Intelligence, 9(1), 31–53.

    Article  Google Scholar 

  • Farnsworth, A., Sheldon, D., Geevarghese, J., Irvine, J., Van Doren, B., Webb, K., et al. (2014). Reconstructing velocities of migrating birds from weather radar—A case study in computational sustainability. AI Magazine, 35(2), 31–48.

    Article  Google Scholar 

  • Han, N. C., Muniandy, S. V., & Dayou, J. (2011). Acoustic classification of Australian anurans based on hybrid spectral-entropy approach. Applied Acoustics, 72(9), 639–645.

    Article  Google Scholar 

  • Hutto, R. L., & Stutzman, R. J. (2009). Humans versus autonomous recording units: A comparison of point-count results. Journal of Field Ornithology, 80(4), 387–398.

    Article  Google Scholar 

  • Kaya, Y., Uyar, M., Tekin, R., & Yıldırım, S. (2014). 1D-local binary pattern based feature extraction for classification of epileptic EEG signals. Applied Mathematics and Computation, 243, 209–219.

    Article  MathSciNet  Google Scholar 

  • Kays, R., Crofoot, M. C., Jetz, W., & Wikelski, M. (2015). Terrestrial animal tracking as an eye on life and planet. Science, 348(6240), aaa2478.

    Article  Google Scholar 

  • Kingma, D. P., & Ba, J. (2014). Adam: A method for stochastic optimization. arXiv preprint. arXiv:1412.6980.

  • Kuncan, M., Kaplan, K., Minaz, M. R., Kaya, Y., & Ertunc, H. M. (2020). A novel feature extraction method for bearing fault classification with one dimensional ternary patterns. ISA Transactions, 100, 346–357.

    Article  Google Scholar 

  • Lanzone, M., Deleon, E., Grove, L., & Farnsworth, A. (2009). Revealing undocumented or poorly known flight calls of warblers (Parulidae) using a novel method of recording birds in captivity. The Auk, 126(3), 511–519.

    Article  Google Scholar 

  • Logan, B. (2000). Mel frequency cepstral coefficients for music modeling. Ismir, 270, 1–11.

    Google Scholar 

  • Loss, S. R., Will, T., & Marra, P. P. (2015). Direct mortality of birds from anthropogenic causes. Annual Review of Ecology, Evolution, and Systematics, 46, 99–120.

    Article  Google Scholar 

  • Lostanlen, V., Palmer, K., Knight, E., Clark, C., Klinck, H., Farnsworth, A., et al. (2019). Long-distance detection of bioacoustic events with per-channel energy normalization. arXiv preprint. arXiv:1911.00417.

  • Marcarini, M., Williamson, G. A., & de Sisternes Garcia, L. (2008). Comparison of methods for automated recognition of avian nocturnal flight calls. In 2008 IEEE international conference on acoustics, speech and signal processing (pp. 2029–2032).

  • Nanni, L., Costa, Y. M., Aguiar, R. L., Mangolin, R. B., Brahnam, S., & Silla, C. N. (2020). Ensemble of convolutional neural networks to improve animal audio classification. EURASIP Journal on Audio, Speech, and Music Processing, 2020, 1–14.

    Article  Google Scholar 

  • Ojala, T., Pietikainen, M., & Maenpaa, T. (2002). Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(7), 971–987.

    Article  Google Scholar 

  • Rosenstock, S. S., Anderson, D. R., Giesen, K. M., Leukering, T., & Carter, M. F. (2002). Landbird counting techniques: Current practices and an alternative. The Auk, 119(1), 46–53.

    Article  Google Scholar 

  • Salamon, J., Bello, J. P., Farnsworth, A., & Kelling, S. (2017). Fusing shallow and deep learning for bioacoustic bird species classification. In 2017 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 141–145).

  • Salamon, J., Bello, J. P., Farnsworth, A., Robbins, M., Keen, S., Klinck, H., & Kelling, S. (2016). Towards the automatic classification of avian flight calls for bioacoustic monitoring. PLoS ONE, 11(11), e0166866.

    Article  Google Scholar 

  • Schrama, T., Poot, M., Robb, M., & Slabbekoorn, H. (2007). Automated monitoring of avian flight calls during nocturnal migration. In International Expert meeting on IT-based detection of bioacoustical patterns (pp. 131–134).

  • Selin, A., Turunen, J., & Tanttu, J. T. (2006). Wavelets in recognition of bird sounds. EURASIP Journal on Advances in Signal Processing, 2007, 1–9.

    Article  Google Scholar 

  • Stanley, C. Q., McKinnon, E. A., Fraser, K. C., Macpherson, M. P., Casbourn, G., Friesen, L., et al. (2015). Connectivity of wood thrush breeding, wintering, and migration sites based on range-wide tracking. Conservation Biology, 29(1), 164–174.

    Article  Google Scholar 

  • Thakur, A., Thapar, D., Rajan, P., & Nigam, A. (2019). Multiscale CNN based deep metric learning for bioacoustic classification: Overcoming training data scarcity using dynamic triplet loss. arXiv preprint. arXiv:1903.10713.

  • Tuncer, T., Dogan, S., Ertam, F., & Subasi, A. (2020a). A dynamic center and multi threshold point based stable feature extraction network for driver fatigue detection utilizing EEG signals. Cognitive Neurodynamics, 15, 223–237.

    Article  Google Scholar 

  • Tuncer, T., & Ertam, F. (2020). Neighborhood component analysis and reliefF based survival recognition methods for Hepatocellular carcinoma. Physica A: Statistical Mechanics and Its Applications, 540, 123143.

    Article  Google Scholar 

  • Tuncer, T., Subasi, A., Ertam, F., & Dogan, S. (2020b). A novel spiral pattern and 2D M4 pooling based environmental sound classification method. Applied Acoustics, 170, 107508.

    Article  Google Scholar 

  • Xie, J., Hu, K., Zhu, M., Yu, J., & Zhu, Q. (2019). Investigation of different CNN-based models for improved bird sound classification. IEEE Access, 7, 175353–175361.

    Article  Google Scholar 

  • Xie, J., & Zhu, M. (2019). Handcrafted features and late fusion with deep learning for bird sound classification. Ecological Informatics, 52, 74–81.

    Article  Google Scholar 

  • Zottesso, R. H., Costa, Y. M., Bertolini, D., & Oliveira, L. E. (2018). Bird species identification using spectrogram and dissimilarity approach. Ecological Informatics, 48, 187–197.

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by the 111 Project. This work is also supported by Fundamental Research Funds for the Central Universities (Grant No: JUSRP11924) and Jiangsu Key Laboratory of Advanced Food Manufacturing Equipment & Technology (Grant No: FM-2019-06). This work is partially supported by National Natural Science Foundation of China (Grant No: 61902154). This work is also partially supported by Natural Science Foundation of Jiangsu Province (Grant No: BK2019043526). This work is also partially supported by Jiangsu province key research and development project—modern agriculture (Grant No: BE2018334).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jie Xie.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ji, X., Jiang, K. & Xie, J. LBP-based bird sound classification using improved feature selection algorithm. Int J Speech Technol 24, 1033–1045 (2021). https://doi.org/10.1007/s10772-021-09866-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-021-09866-4

Keywords

Navigation