کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
453497 | 694941 | 2013 | 10 صفحه PDF | دانلود رایگان |

In this work, we propose the use of the Internet as a rich source of knowledge to generate learning corpora for the task of semantic tagging of videos using their Automatic Speech Recognition (ASR) transcripts. We have applied supervised and non-supervised strategies using two recent frameworks related to this task. The obtained results show that, on the one hand, the integration of knowledge from web resources can be useful to generate learning corpora for this task and, on the other hand, the size of the learning corpora should be taken into account in deciding which approach to apply.
► Using web resources is a good strategy to generate corpora for video categorization.
► Google worked better than blogosphere or Wikipedia for the two frameworks tested.
► The corpora size should be taken into account in deciding which approach to apply.
Journal: Computer Standards & Interfaces - Volume 35, Issue 5, September 2013, Pages 519–528