کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
558315 874897 2013 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Speaker verification in score-ageing-quality classification space
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
Speaker verification in score-ageing-quality classification space
چکیده انگلیسی

A challenge in automatic speaker verification is to create a system that is robust to the effects of vocal ageing. To observe the ageing effect, a speaker's voice must be analysed over a period of time, over which, variation in the quality of the voice samples is likely to be encountered. Thus, in dealing with the ageing problem, the related issue of quality must also be addressed. We present a solution to speaker verification across ageing by using a stacked classifier framework to combine ageing and quality information with the scores of a baseline classifier. In tandem, the Trinity College Dublin Speaker Ageing database of 18 speakers, each covering a 30–60 year time range, is presented. An evaluation of a baseline Gaussian Mixture Model–Universal Background Model (GMM–UBM) system using this database demonstrates a progressive degradation in genuine speaker verification scores as ageing progresses. Consequently, applying a conventional threshold, determined using scores at the time of enrolment, results in poor long-term performance. The influence of quality on verification scores is investigated via a number of quality measures. Alongside established signal-based measures, a new model-based measure, Wnorm, is proposed, and its utility is demonstrated on the CSLU database. Combining ageing information with quality measures and the scores from the GMM–UBM system, a verification decision boundary is created in score-ageing-quality space. The best performance is achieved by using scores and ageing in conjunction with the new Wnorm quality measure, reducing verification error by 45% relative to the baseline. This work represents the first comprehensive analysis of speaker verification on a longitudinal speaker database and successfully addresses the associated variability from ageing and quality arte-facts.


► A speaker ageing database of 18 adults across a 30–60 year time lapse is presented.
► A speaker verification evaluation of this ageing data results in a high error rate.
► The dependency between verification score and ageing progression is analysed.
► Verification score is shown to be correlated with measures of recording quality.
► A score-ageing-quality decision boundary improves significantly over the baseline.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 27, Issue 5, August 2013, Pages 1068–1084
نویسندگان
, , ,