Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
484933 | Procedia Computer Science | 2015 | 7 Pages |
Big Data is the buzz word doing rounds in all areas of human existence be medical, social networks, research, it has also made inroads to education. The large size and complexity of datasets in Big Data need specialized statistical tools for analysis where R can come handy. The Categorical component of any data set can be quantified using limited representations, but evaluating it with respect to the quantitative variables return a larger set of statistical inferences. This paper explores the analysis of categorical and quantitative variables scalable to Big Data in education using a contemporary statistical tool R. R provides multiple dimensions to statistical analysis of dataset, this paper however explores the statistical inference rendered using the Box Plot feature through summary measures of the dataset. These statistical inferences can be used to train a Machine for predictions and classification under a certain category.