کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
395765 666012 2016 26 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Streaming data reduction using low-memory factored representations
ترجمه فارسی عنوان
کاهش داده های جریان داده با استفاده از بازنمودهای محاسباتی کم حافظه
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی

Many special purpose algorithms exist for extracting information from streaming data. Constraints are imposed on the total memory and on the average processing time per data item. These constraints are usually satisfied by deciding in advance the kind of information one wishes to extract, and then extracting only the data relevant for that goal. Here, we propose a general data representation that can be computed using modest memory requirements with limited processing power per data item, and yet permits the application of an arbitrary data mining algorithm chosen and/or adjusted after the data collection process has begun. The new representation allows for the at-once analysis of a significantly larger number of data items than would be possible using the original representation of the data. The method depends on a rapid computation of a factored form of the original data set. The method is illustrated with two real datasets, one with dense and one with sparse attribute values.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 176, Issue 14, 22 July 2006, Pages 2016–2041
نویسندگان
, ,