Transformer Models improve the acoustic recognition of buzz-pollinating bee species

dc.creatorFerreira, Alef Iury Siqueira
dc.creatorSilva, Nadia Felix Felipe da
dc.creatorMesquita, Fernanda Neiva
dc.creatorRosa, Thierson Couto
dc.creatorBuchmann, Stephen L.
dc.creatorMesquita Neto, José Neiva
dc.date.accessioned2026-03-03T21:07:02Z
dc.date.available2026-03-03T21:07:02Z
dc.date.issued2025
dc.description.abstractBuzz-pollinated crops, such as tomatoes, potatoes, kiwifruit, and blueberries, are among the highest-yielding agricultural products. The flowers of these cultivated plants are characterized by having a specialized flower morphology with poricidal anthers that require vibration to achieve a full seed set. At least 446 bee species, in 82 genera, use floral sonication (buzz pollination) to collect pollen grains as food. Identifying and classifying these diverse often look-alike bee species poses a challenge for taxonomists. Automated classification systems, based upon audible bee floral buzzes, have been investigated to meet this need. Recently, convolutional neural network (CNN) models have demonstrated superior performance in recognizing and distinguishing bee-buzzing sounds compared to classical Machine-Learning (ML) classifiers. Nonetheless, the performance of CNNs remains unsatisfactory and can be improved. Therefore, we applied a novel transformer-based neural network architecture for the task of acoustic recognition of blueberry-pollinating bee species. We further compared the performance of the Audio Spectrogram Transformer (AST) model and its variants, including Self-Supervised AST (SSAST) and Masked Autoencoding AST (MAE-AST), to that of strong baseline CNN models based on previous work, at the task of bee species recognition. We also employed data augmentation techniques and evaluated these models with a data set of bee sounds recorded during visits to blueberry flowers in Chile (518 audio samples of 15 bee species). Our results revealed that Transformer-based Neural Networks combined with pre-training and data augmentation outperformed CNN models (maximum F1-score: 64.5% ± 2; Accuracy: 82.2% ± 0.8). These innovative attention-based neural network architectures have demonstrated exceptional performance in assigning bee buzzing sounds to their respective taxonomic categories, outperforming prior deep learning models. However, transformer approaches face challenges related to small dataset size and class imbalance, similar to CNNs and classical ML algorithms. Combining pre-training with data augmentation is crucial to increase the diversity and robustness of training data sets for the acoustic recognition of bee species. We document the potential of transformer architectures to improve the performance of audible bee species identification, offering promising new avenues for bioacoustic research and pollination ecology.
dc.identifier.citationFERREIRA, Alef Iury Siqueira et al. Transformer Models improve the acoustic recognition of buzz-pollinating bee species. Ecological Informatics, Amsterdam, v. 86, p. 103010, 2025. DOI: 10.1016/j.ecoinf.2025.103010. Disponível em: https://www.sciencedirect.com/science/article/pii/S1574954125000196. Acesso em: 17 fev. 2026.
dc.identifier.doi10.1016/j.ecoinf.2025.103010
dc.identifier.issn1574-9541
dc.identifier.issne- 1878-0512
dc.identifier.urihttps://repositorio.bc.ufg.br//handle/ri/29812
dc.language.isoeng
dc.publisher.countryHolanda
dc.publisher.departmentInstituto de Informática - INF (RMG)
dc.rightsAcesso Aberto
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectBuzz-pollinated crops
dc.subjectEcosystem services
dc.subjectCrop pollination
dc.subjectDeep learning
dc.titleTransformer Models improve the acoustic recognition of buzz-pollinating bee species
dc.typeArtigo

Arquivos

Pacote Original

Agora exibindo 1 - 1 de 1
Carregando...
Imagem de Miniatura
Nome:
Artigo - Alef Iury Siqueira Ferreira - 2025.pdf
Tamanho:
2.15 MB
Formato:
Adobe Portable Document Format

Licença do Pacote

Agora exibindo 1 - 1 de 1
Carregando...
Imagem de Miniatura
Nome:
license.txt
Tamanho:
1.71 KB
Formato:
Item-specific license agreed upon to submission
Descrição: