Text-independent speaker identification using Radon and discrete cosine transforms based features from speech spectrogram

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
533524	870124	2011	11 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Feature extraction - استخراج ویژگی Radon transform - تبدیل رادون Discrete cosine transform - تبدیل کسینوس گسسته Speaker recognition - شناسایی بلندگو Spectrogram - طیف طیف

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو

پیش نمایش صفحه اول مقاله

Text-independent speaker identification using Radon and discrete cosine transforms based features from speech spectrogram

چکیده انگلیسی

This paper presents a new feature extraction technique for speaker recognition using Radon transform (RT) and discrete cosine transform (DCT). The spectrogram is compact, efficient in representation and carries information about acoustic features in the form of pattern. In the proposed method, speaker specific features have been extracted by applying image processing techniques to the pattern available in the spectrogram. Radon transform has been used to derive the effective acoustic features from the speech spectrogram. Radon transform adds up the pixel values in the given image along a straight line in a particular direction and at a specific displacement. The proposed technique computes Radon projections for seven orientations and captures the acoustic characteristics of the spectrogram. DCT applied on Radon projections yields low dimensional feature vector. The technique is computationally efficient, text-independent, robust to session variations and insensitive to additive noise. The performance of the proposed algorithm has been evaluated using the Texas Instruments and Massachusetts Institute of Technology (TIMIT) and our own created Shri Guru Gobind Singhji (SGGS) databases. The recognition rate of the proposed algorithm on TIMIT database (consisting of 630 speakers) is 96.69% and for SGGS database (consisting of 151 speakers) is 98.41%. These results highlight the superiority of the proposed method over some of the existing algorithms.

► Speaker recognition based on pattern recognition approach.
► Text-independent.
► Channel and session invariant technique.
► Insensitive to additive noise.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition - Volume 44, Issues 10–11, October–November 2011, Pages 2749–2759

نویسندگان

Pawan K. Ajmera, Dattatray V. Jadhav, Raghunath S. Holambe,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Text-independent speaker identification using Radon and discrete cosine transforms based features from speech spectrogram

دسترسی سریع

ارتباط

English Website