کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
382892 660796 2014 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Improving data partition schemes in Smart Grids via clustering data streams
ترجمه فارسی عنوان
بهبود برنامه های پارتیشن داده در شبکه های هوشمند از طریق خوشه بندی جریان داده ها
کلمات کلیدی
شبکه های هوشمند، پارتیشن داده یادگیری آنلاین، خوشه بندی جریان داده ها، سیستم طبقه بندی یادگیری
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی


• An online unsupervised LCS algorithm suitable for clustering problems is proposed.
• The problem of partitioning the data storage layer on a Smart Grid is explored.
• Data partitioning using online clustering boosts the storage layer scalability.
• The competence of this approach is assessed using synthetic and real data streams.

Data mining techniques are traditionally divided into two distinct disciplines depending on the task to be performed by the algorithm: supervised learning and unsupervised learning. While the former aims at making accurate predictions after deeming an underlying structure in data—which requires the presence of a teacher during the learning phase—the latter aims at discovering regular-occurring patterns beneath the data without making any a priori assumptions concerning their underlying structure. The pure supervised model can construct a very accurate predictive model from data streams. However, in many real-world problems this paradigm may be ill-suited due to (1) the dearth of training examples and (2) the costs of labeling the required information to train the system. A sound use case of this concern is found when defining data replication and partitioning policies to store data emerged in the Smart Grids domain in order to adapt electric networks to current application demands (e.g., real time consumption, network self adapting). As opposed to classic electrical architectures, Smart Grids encompass a fully distributed scheme with several diverse data generation sources. Current data storage and replication systems fail at both coping with such overwhelming amount of heterogeneous data and at satisfying the stringent requirements posed by this technology (i.e., dynamic nature of the physical resources, continuous flow of information and autonomous behavior demands). The purpose of this paper is to apply unsupervised learning techniques to enhance the performance of data storage in Smart Grids. More specifically we have improved the eXtended Classifier System for Clustering (XCSc) algorithm to present a hybrid system that mixes data replication and partitioning policies by means of an online clustering approach. Conducted experiments show that the proposed system outperforms previous proposals and truly fits with the Smart Grid premises.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 41, Issue 13, 1 October 2014, Pages 5832–5842
نویسندگان
, , , , , , ,