• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Dual Task Deep Learning Framework for Automatic Recognition of Avian Species and Song Identification


Abstract

Accurate identification of bird species and their songs is essential for avian biodiversity research, ecological monitoring, and conservation. However, traditional methods often struggle with limited feature representations and fail to capture complex temporal and spectral patterns in bird vocalizations. This paper proposes a Dual-Task Deep Learning (DTDL) framework for simultaneous bird species and song identification. The system utilizes a curated dataset comprising recordings from seven bird species, each with three distinct vocalizations. Preprocessing involves noise filtering, silence removal, and signal normalization. A novel Adaptive Two-Region Piecewise Uniform Quantizer (ATR-PWUQ), driven by a Deep Neural Network (DNN), adaptively adjusts decision thresholds to enhance signal representation. Feature extraction combines handcrafted audio features (e.g., MFCCs, time-frequency, harmonic) and deep features derived from Mel-spectrograms using the SqueezeNet model. These features are then classified using a hybrid Bidirectional Long Short-Term Memory with Stacked Gated Recurrent Units (Bi-LSTM-SGRU) for species identification and an Attention-based Deep Belief Network (A-DBN) for song classification. The Bi-LSTM-SGRU model achieves a bird species identification accuracy of 98.11%, while the A-DBN model achieves a song recognition accuracy of 98.98%. The system significantly outperforms existing models across all evaluation metrics. The proposed DTDL framework demonstrates high robustness and precision in avian acoustic analysis, making it a promising tool for real-world ecological applications and automated biodiversity monitoring.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
A. L and G. V. S, "Dual Task Deep Learning Framework for Automatic Recognition of Avian Species and Song Identification," KSII Transactions on Internet and Information Systems, vol. 20, no. 1, pp. 589-608, 2026. DOI: 10.3837/tiis.2026.01.026.

[ACM Style]
Anju L and Ganesh Vaidyanathan S. 2026. Dual Task Deep Learning Framework for Automatic Recognition of Avian Species and Song Identification. KSII Transactions on Internet and Information Systems, 20, 1, (2026), 589-608. DOI: 10.3837/tiis.2026.01.026.

[BibTeX Style]
@article{tiis:105671, title="Dual Task Deep Learning Framework for Automatic Recognition of Avian Species and Song Identification", author="Anju L and Ganesh Vaidyanathan S and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2026.01.026}, volume={20}, number={1}, year="2026", month={January}, pages={589-608}}