کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
557937 874817 2011 34 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Turn-taking cues in task-oriented dialogue
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
Turn-taking cues in task-oriented dialogue
چکیده انگلیسی

As interactive voice response systems become more prevalent and provide increasingly more complex functionality, it becomes clear that the challenges facing such systems are not solely in their synthesis and recognition capabilities. Issues such as the coordination of turn exchanges between system and user also play an important role in system usability. In particular, both systems and users have difficulty determining when the other is taking or relinquishing the turn. In this paper, we seek to identify turn-taking cues correlated with human–human turn exchanges which are automatically computable. We compare the presence of potential prosodic, acoustic, and lexico-syntactic turn-yielding cues in prosodic phrases preceding turn changes (smooth switches) vs. turn retentions (holds) vs. backchannels in the Columbia Games Corpus, a large corpus of task-oriented dialogues, to determine which features reliably distinguish between these three. We identify seven turn-yielding cues, all of which can be extracted automatically, for future use in turn generation and recognition in interactive voice response (IVR) systems. Testing Duncan’s (1972) hypothesis that these turn-yielding cues are linearly correlated with the occurrence of turn-taking attempts, we further demonstrate that, the greater the number of turn-yielding cues that are present, the greater the likelihood that a turn change will occur. We also identify six cues that precede backchannels, which will also be useful for IVR backchannel generation and recognition; these cues correlate with backchannel occurrence in a quadratic manner. We find similar results for overlapping and for non-overlapping speech.

Research highlights▶ Seven turn-yielding cues precede turn changes in spontaneous task-oriented dialogue. ▶ Cues are prosodic, acoustic, and lexico-syntactic events. ▶ Cues are linearly correlated with the occurrence of turn-taking attempts. ▶ Six backchannel-inviting cues precede the occurrence of a backchannel. ▶ Results will be useful for turn management in future IVR systems.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 25, Issue 3, July 2011, Pages 601–634
نویسندگان
, ,