Article ID Journal Published Year Pages File Type
10343169 Journal of Systems and Software 2012 10 Pages PDF
Abstract
► We evaluate Pig's ability to prepare data in a modular way by performing three large-scale MSR studies in detail. Our implementation can be reused by other MSR researchers. ► We compare the use of Pig and Hadoop for preparing data for MSR studies. ► We report the lessons learnt with Pig in order to assist other researchers who want to use Pig as a data preparation language in their MSR studies.
Related Topics
Physical Sciences and Engineering Computer Science Computer Networks and Communications
Authors
, , ,