Article ID Journal Published Year Pages File Type
382452 Expert Systems with Applications 2016 10 Pages PDF
Abstract

•We propose an approach to predicting co-changing code volume.•The success rate of co-changing methods identification is 82%.•Co-changing code volume is predicted using a derived regression line.•MAE of predictions is 95.3% less than the one of a naive approach.

Code smells are symptoms in the source code that indicate possible deeper problems and may serve as drivers for code refactoring. Although effort has been made on identifying divergent changes and shotgun surgeries, little emphasis has been put on predicting the volume of co-changing code that appears in the code smells. More specifically, when a software developer intends to perform a particular modification task on a method, a predicted volume of code that will potentially be co-changed with the method could be considered as significant information for estimating the modification effort. In this paper, we propose an approach to predicting volume of co-changing code affected by a method to be modified. The approach has the following key features: co-changing methods can be identified for detecting divergent changes and shotgun surgeries based on association rules mined from change histories; and volume of co-changing code affected by a method to be modified can be predicted through a derived fitted regression line with t-test based on the co-changing methods identification results. The experimental results show that the success rate of co-changing methods identification is 82% with a suggested threshold, and the numbers of correct identifications would not be influenced by the increasing number of commits as a project continuously evolves. Additionally, the mean absolute error of co-changing code volume predictions is 133 lines of code which is 95.3% less than the one of a naive approach.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , , ,