کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
554712 1451072 2015 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Multidimensional analysis model for a document warehouse that includes textual measures
ترجمه فارسی عنوان
یک مدل تحلیل چند بعدی برای یک انبار سند که شامل ابعاد متنی است
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر سیستم های اطلاعاتی
چکیده انگلیسی


• A new multidimensional model that integrates text based on three textual measures.
• The granularity of proposed model is at document level.
• The model allows getting topics according to dimensions implied in the query.
• The model allows getting documents according to dimensions implied in the query.
• The model allows getting the words or terms for each topic.

Data warehouses and On-Line Analytical Processing tools, OLAP, together permit a multi-dimensional analysis of structured data information. However, as business systems are increasingly required to handle substantial quantities of unstructured textual information, the need arises for an effective and similar means of analysis. To manage unstructured text data stored in data warehouses, a new multi-dimensional analysis model is proposed that includes textual measures as well as a topic hierarchy. In this model, the textual measures that associate the topics with the text documents are generated by Probabilistic Latent Semantic Analysis, while the hierarchy is created automatically using a clustering algorithm. Documents are then able to be queried using OLAP tools. The model was evaluated from two viewpoints — query execution time and user satisfaction. Evaluation of execution time was carried out on scientific articles using two query types and user satisfaction (with query time and ease of use) using statistical frequency and multivariate analyses. Encouraging observations included that as the number of documents increases, query time increases as a lineal, rather than exponential tendency. In addition, the model gained an increasing acceptance with use, while the visualization of the model was also well received by users.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Decision Support Systems - Volume 72, April 2015, Pages 44–59
نویسندگان
, , , , ,