کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
515476 867023 2015 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Active learning for sentiment analysis on data streams: Methodology and workflow implementation in the ClowdFlows platform
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
Active learning for sentiment analysis on data streams: Methodology and workflow implementation in the ClowdFlows platform
چکیده انگلیسی


• We present a cloud based platform for data stream processing with workflows.
• The ClowdFlows platform enables processing of multiple concurrent data streams.
• We implement an active learning scenario for sentiment analysis on data streams.
• Machine learning methods are shown to be suitable for sentiment analysis.
• Active learning improves the accuracy of sentiment classification.

Sentiment analysis from data streams is aimed at detecting authors’ attitude, emotions and opinions from texts in real-time. To reduce the labeling effort needed in the data collection phase, active learning is often applied in streaming scenarios, where a learning algorithm is allowed to select new examples to be manually labeled in order to improve the learner’s performance. Even though there are many on-line platforms which perform sentiment analysis, there is no publicly available interactive on-line platform for dynamic adaptive sentiment analysis, which would be able to handle changes in data streams and adapt its behavior over time. This paper describes ClowdFlows, a cloud-based scientific workflow platform, and its extensions enabling the analysis of data streams and active learning. Moreover, by utilizing the data and workflow sharing in ClowdFlows, the labeling of examples can be distributed through crowdsourcing. The advanced features of ClowdFlows are demonstrated on a sentiment analysis use case, using active learning with a linear Support Vector Machine for learning sentiment classification models to be applied to microblogging data streams.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Processing & Management - Volume 51, Issue 2, March 2015, Pages 187–203
نویسندگان
, , , , , ,