کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
559008 875029 2015 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Emotion transplantation through adaptation in HMM-based speech synthesis
ترجمه فارسی عنوان
پیوند عاطفی از طریق انطباق در سنتز سخنرانی مبتنی بر HMM
کلمات کلیدی
سنتز گفتاری پارامتریک گفتار؛سنتز سخنرانی فوری؛سازگاری با شکل آبشاری؛پیوند عاطفی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی


• We propose an emotion transplantation method based on adaptation techniques.
• Emotions can be imbued into neutral synthetic speech models regardless of gender.
• Five perceptual evaluations, including one with a robot, were carried out.
• Emotion transplantation clearly improves emotional performance over neutral voices.
• High quality source models provide high quality transplanted models.

This paper proposes an emotion transplantation method capable of modifying a synthetic speech model through the use of CSMAPLR adaptation in order to incorporate emotional information learned from a different speaker model while maintaining the identity of the original speaker as much as possible. The proposed method relies on learning both emotional and speaker identity information by means of their adaptation function from an average voice model, and combining them into a single cascade transform capable of imbuing the desired emotion into the target speaker. This method is then applied to the task of transplanting four emotions (anger, happiness, sadness and surprise) into 3 male speakers and 3 female speakers and evaluated in a number of perceptual tests. The results of the evaluations show how the perceived naturalness for emotional text significantly favors the use of the proposed transplanted emotional speech synthesis when compared to traditional neutral speech synthesis, evidenced by a big increase in the perceived emotional strength of the synthesized utterances at a slight cost in speech quality. A final evaluation with a robotic laboratory assistant application shows how by using emotional speech we can significantly increase the students’ satisfaction with the dialog system, proving how the proposed emotion transplantation system provides benefits in real applications.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 34, Issue 1, November 2015, Pages 292–307
نویسندگان
, , , , , ,