کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4374822 1303220 2015 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Unsupervised dictionary extraction of bird vocalisations and new tools on assessing and visualising bird activity
ترجمه فارسی عنوان
نگهداری از فرهنگ لغت نویسی آواز پرنده و ابزارهای جدید برای ارزیابی و تجسم فعالیت پرندگان
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک بوم شناسی، تکامل، رفتار و سامانه شناسی
چکیده انگلیسی


• We report on unsupervised extraction of a dictionary of bird vocalisations.
• We examine the multiple templates approach jointly interpretating an audio-scene with bird vocalisations.
• Bird classification of an unknown and time-varying number of birds from a known list of species.
• We show new graphical tools for assessing vocalising birds in large number of recordings

A broad range of organisations and individuals are collecting wildlife audio recordings. Huge amounts of audio data have been gathered in the past and since the popularisation of automatic recording units the data are piling up exponentially. The point in gathering them is to analyse them, evaluate insights and hypotheses, identify patterns of activity that are otherwise not apparent and finally design policies on biodiversity issues. For massive volumes of data even visual inspection of spectrograms is unfeasible and interesting cases that could provide valuable insight for concrete hypotheses on the biodiversity status can slip into bliss. In this paper we research a range of techniques that work with minor human supervision. These techniques will construct a dictionary of templates extracted in an unsupervised way from reference recordings and then crawl over a large number of recordings to examine the underlying bioacoustic activity. This work is general and we have applied it to many datasets of animal's vocalisations (e.g. cetaceans, mice, birds). To test our tools objectively and for the sake of reproducibility in this work we report on the MLSP 2013 bird dataset that recently has been publicly released along with all its annotations. We are not interested as to which is the best scoring approach for this dataset. Our aim is to describe novel machine learning tools that try to refine our understanding of biodiversity by answering questions such as: Is the recording under examination void of bird vocalisations or not? If there is bird activity, how many different species are in the recording? What are the most important characteristic spectral segments for recognizing a specific species? The database however is valuable to us to quantify our findings.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Ecological Informatics - Volume 26, Part 3, March 2015, Pages 6–17
نویسندگان
,