کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
564849 | 875649 | 2013 | 12 صفحه PDF | دانلود رایگان |

A new perceptual audio hashing algorithm based on maximum-likelihood watermarking detection is proposed in this paper. The idea is justified by the fact that the maximum-likelihood watermark detector responds similarly to perceptually close audio using a non-embedded watermark (i.e. virtual watermark). The feature vector, which is composed of the total amplitude of low-order Zernike moments of each audio frame, is modeled by the Gaussian or Rayleigh distribution. Then, the maximum-likelihood watermark detection is performed on the feature vector with the virtual watermarks generated by pseudo-random number generator to construct the hash vector. Extensive experiments over three large audio databases of different type (speech, instrumental music, and sung voice) demonstrate the efficiency of the proposed scheme in terms of discrimination, perceptual robustness and identification rate. It is also verified that the proposed scheme outperforms state-of-the-art techniques in perceptual robustness and can be applied in content-based search, successfully.
Journal: Digital Signal Processing - Volume 23, Issue 4, July 2013, Pages 1216-1227