کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6950547 1451616 2014 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
The BBC World Service Archive prototype
ترجمه فارسی عنوان
نمونه اولیه بایگانی سرویس جهانی بی سی
کلمات کلیدی
برون سپاری، وب معنایی، برچسب زدن اتوماتیک، شناسایی بلندگو، ارتباط بین، بایگانی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر سیستم های اطلاعاتی
چکیده انگلیسی
Most broadcasters have accumulated large audio and video archives stretching back over many decades. For example the BBC World Service radio archive includes around 70,000 English-language programmes from over 45 years. This amounts to about three years of continuous audio and around 15 TB of data. The metadata around this archive is sparse and sometimes wrong, but the full audio content is available in digital form. We have built a system to process the existing audio and text and automatically annotate programmes within the archive with Linked Data web identifiers. The resulting interlinks are used to bootstrap search and navigation within this archive and expose it to users. Automated data will never be entirely accurate so we built crowdsourcing mechanisms for users to correct and add data. The resulting crowdsourced data is then used to improve search and navigation within the archive, as well as evaluate and improve our algorithms. As a result of this feedback cycle, the interlinks between our archive and the Semantic Web are continuously improving. This unique combination of Semantic Web technologies, automation and crowdsourcing has dramatically reduced the amount of time and effort required to publish this rich archive online. The BBC World Service archive prototype is available online at http://worldservice.prototyping.bbc.co.uk, last accessed March 2014.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Web Semantics: Science, Services and Agents on the World Wide Web - Volumes 27–28, August–October 2014, Pages 2-9
نویسندگان
, , , ,