Journal of Dali University ›› 2025, Vol. 10 ›› Issue (6): 38-46.

Previous Articles     Next Articles

A Multi-Feature Fusion Bird Song Recognition Method Based on MobileViT

  

  1. (1. College of Engineering, Dali University, Dali, Yunnan 671003, China; 2. Institute of Eastern-Himalaya Biodiversity Research,
    Dali University, Dali, Yunnan 671003, China; 3. State Key Laboratory of Regional and Urban Ecology, Research Center for
    Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China)
  • Online:2025-06-15 Published:2025-06-24

Abstract: Objective: Monitoring bird diversity is a major challenge, as birds are more easily heard rather than seen. To improve the
accuracy of bird song recognition, a multi-feature fusion bird song recognition method based on MobileViT is proposed. Methods:
Using the Beijing BirdsData database as the research object, three different spectrogram sample sets were extracted from preprocessed bird song signals and used as inputs to train three single feature models based on MobileViT. Finally, adaptive weighted feature fusion on the three single feature models was performed. Results: The recognition accuracy of the multi-feature fusion model on the spectrogram test set reached 97.57%, with an improvement of 1.77%-2.24% compared to the single feature model. Conclusion: Different spectrograms extracted from the same bird song signal exhibit differences in the characteristics of bird song, and multifeature fusion models can learn more extensive information from them, thereby significantly improving recognition accuracy.

Key words: bird song recognition, MobileViT, multi-feature fusion

CLC Number: