کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4960435 1446499 2017 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Twitter Pornography Multilingual Content Identification Based on Machine Learning
ترجمه فارسی عنوان
توییتر پورنوگرافی چند زبانه شناسایی محتوا بر اساس آموزش ماشین
کلمات کلیدی
پورنوگرافی، توییتر، فراگیری ماشین،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
چکیده انگلیسی

Pornography on social media raises a lot of negative impact and affect the moral of children and teenagers. Social media used to spread pornography can have a negative impact. Thus, the spread of pornography on social media must be prevented. One of the social media which is often used as a medium pornography is Twitter. Pornography used on Twitter in the form of text and image. Among the two types of media, the text is very interesting to study because of the use of a variety of languages. In this study, the classification process will be conducted in Indonesian and English tweet and a combination of both languages. This classification uses three methods of machine learning, Decision Tree, Naive Bayes and Support Vector Machines for the purpose of comparing which method is the best in the classification process. In this study also conducted additional experiment was carried out with the aim of improving the performance in classification. The results showed that the level of accuracy is quite high. However, different grammar is a constraint that affects the accuracy of the results in the classification.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Computer Science - Volume 116, 2017, Pages 129-136
نویسندگان
, , ,