Article ID Journal Published Year Pages File Type
493032 Procedia Technology 2013 8 Pages PDF
Abstract

Many organisations have separate and disperse set of data for many years. Disperse and separation data have negative impacts for the organizations. Thus, the concept of data warehousing has emerged. From data warehouse concept many different systems and disintegrated data can be modelled and become integrated data. Similarly, in medical data normally these data is not integrated well in the organisation and thus the analysing of the data is difficult. Therefore, this research has the following objectives: 1) to identify the data warehouse design specifications for medical data, 2) to implement a data warehouse using two types of databases using data for cardiovascular disease and 3) to develop applications dashboard for medical data analysis and modelling. In the development, the process of selection and classifying the best features of data for data warehouse are carried out. Cardiovascular disease dataset from the National Heart Institute (IJN) is used as a data problem. ETL software which is Pentaho, is used to combine all the various databases to create a data warehouse for the data integration process. The original data set is stored in a Microsoft Excel spread sheet and still in its original form without any processing. It uses two types of database using data integration process to create a data warehouse using medical data. Data warehouse development suitability model is validated using two different databases. It adopts the theory of Bill Inmon and lead to application dashboard through ETL software.

Related Topics
Physical Sciences and Engineering Computer Science Computer Science (General)