Article ID Journal Published Year Pages File Type
484933 Procedia Computer Science 2015 7 Pages PDF
Abstract

Big Data is the buzz word doing rounds in all areas of human existence be medical, social networks, research, it has also made inroads to education. The large size and complexity of datasets in Big Data need specialized statistical tools for analysis where R can come handy. The Categorical component of any data set can be quantified using limited representations, but evaluating it with respect to the quantitative variables return a larger set of statistical inferences. This paper explores the analysis of categorical and quantitative variables scalable to Big Data in education using a contemporary statistical tool R. R provides multiple dimensions to statistical analysis of dataset, this paper however explores the statistical inference rendered using the Box Plot feature through summary measures of the dataset. These statistical inferences can be used to train a Machine for predictions and classification under a certain category.

Related Topics
Physical Sciences and Engineering Computer Science Computer Science (General)