کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
534604 870269 2013 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Overlapping sound event recognition using local spectrogram features and the generalised hough transform
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Overlapping sound event recognition using local spectrogram features and the generalised hough transform
چکیده انگلیسی


• Addressing the challenging task of simultaneous overlapping sound event recognition.
• A novel approach based on local features extracted from the spectrogram.
• Robust recognition is achieved by combining the independent local information.
• The Generalized Hough Transform is used to generate detection hypotheses.
• The method is inspired by human perception and research in image object recognition.

In this paper, we address the challenging task of simultaneous recognition of overlapping sound events from single channel audio. Conventional frame-based methods are not well suited to the problem, as each time frame contains a mixture of information from multiple sources. Missing feature masks are able to improve the recognition in such cases, but are limited by the accuracy of the mask, which is a non-trivial problem. In this paper, we propose an approach based on Local Spectrogram Features (LSFs) which represent local spectral information that is extracted from the two-dimensional region surrounding “keypoints” detected in the spectrogram. The keypoints are designed to locate the sparse, discriminative peaks in the spectrogram, such that we can model sound events through a set of representative LSF clusters and their occurrences in the spectrogram. To recognise overlapping sound events, we use a Generalised Hough Transform (GHT) voting system, which sums the information over many independent keypoints to produce onset hypotheses, that can detect any arbitrary combination of sound events in the spectrogram. Each hypothesis is then scored against the class distribution models to recognise the existence of the sound in the spectrogram. Experiments on a set of five overlapping sound events, in the presence of non-stationary background noise, demonstrate the potential of our approach.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 34, Issue 9, 1 July 2013, Pages 1085–1093
نویسندگان
, , ,