دانلود رایگان مقاله: اتمام نمونه برداری مبتنی بر یادگیری ماشین برای تشخیص گفتار خودکار با استفاده از گفتار زبان آشامی

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
6863220	677610	2016	26 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Machine learning based sample extraction for automatic speech recognition using dialectal Assamese speech

ترجمه فارسی عنوان

اتمام نمونه برداری مبتنی بر یادگیری ماشین برای تشخیص گفتار خودکار با استفاده از گفتار زبان آشامی

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Deep neural network (DNN)Recurrent neural network (RNN)Automatic speech recognition (ASR)Multi Layer Perceptron (MLP)Artificial neural network (ANN) - شبکه عصبی مصنوعی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

اتمام نمونه برداری مبتنی بر یادگیری ماشین برای تشخیص گفتار خودکار با استفاده از گفتار زبان آشامی

چکیده انگلیسی

Automatic Speaker Recognition (ASR) and related issues are continuously evolving as inseparable elements of Human Computer Interaction (HCI). With assimilation of emerging concepts like big data and Internet of Things (IoT) as extended elements of HCI, ASR techniques are found to be passing through a paradigm shift. Oflate, learning based techniques have started to receive greater attention from research communities related to ASR owing to the fact that former possess natural ability to mimic biological behavior and that way aids ASR modeling and processing. The current learning based ASR techniques are found to be evolving further with incorporation of big data, IoT like concepts. Here, in this paper, we report certain approaches based on machine learning (ML) used for extraction of relevant samples from big data space and apply them for ASR using certain soft computing techniques for Assamese speech with dialectal variations. A class of ML techniques comprising of the basic Artificial Neural Network (ANN) in feedforward (FF) and Deep Neural Network (DNN) forms using raw speech, extracted features and frequency domain forms are considered. The Multi Layer Perceptron (MLP) is configured with inputs in several forms to learn class information obtained using clustering and manual labeling. DNNs are also used to extract specific sentence types. Initially, from a large storage, relevant samples are selected and assimilated. Next, a few conventional methods are used for feature extraction of a few selected types. The features comprise of both spectral and prosodic types. These are applied to Recurrent Neural Network (RNN) and Fully Focused Time Delay Neural Network (FFTDNN) structures to evaluate their performance in recognizing mood, dialect, speaker and gender variations in dialectal Assamese speech. The system is tested under several background noise conditions by considering the recognition rates (obtained using confusion matrices and manually) and computation time. It is found that the proposed ML based sentence extraction techniques and the composite feature set used with RNN as classifier outperform all other approaches. By using ANN in FF form as feature extractor, the performance of the system is evaluated and a comparison is made. Experimental results show that the application of big data samples has enhanced the learning of the ASR system. Further, the ANN based sample and feature extraction techniques are found to be efficient enough to enable application of ML techniques in big data aspects as part of ASR systems.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neural Networks - Volume 78, June 2016, Pages 97-111

نویسندگان

Swapna Agarwalla, Kandarpa Kumar Sarma,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : اتمام نمونه برداری مبتنی بر یادگیری ماشین برای تشخیص گفتار خودکار با استفاده از گفتار زبان آشامی

دسترسی سریع

ارتباط

English Website