کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1109632 1488347 2015 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Stylistic Authorship Comparison and Attribution of Spanish News Forum Messages Based on the TreeTagger POS Tagger
موضوعات مرتبط
علوم انسانی و اجتماعی علوم انسانی و هنر هنر و علوم انسانی (عمومی)
پیش نمایش صفحه اول مقاله
Stylistic Authorship Comparison and Attribution of Spanish News Forum Messages Based on the TreeTagger POS Tagger
چکیده انگلیسی

Electronic texts from emails, social networks or mobile phones are currently of interest in Forensic Linguistics. Many of these texts analyzed are well under 200 words long. This work aims at identifying text authorship by using part-of-speech tags over short texts. Our corpus consists of 28 texts taken from forum messages. The tokens of our corpora were annotated with parts of speech (POS) provided by TreeTagger. A frequency vector based POS features was created and the Euclidean distance among texts was calculated. Results show how 10 out of the 14 (71, 4%) test texts were correctly assigned to their author.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia - Social and Behavioral Sciences - Volume 212, 2 December 2015, Pages 198-204