Article ID Journal Published Year Pages File Type
459249 Journal of Systems and Software 2016 18 Pages PDF
Abstract

•Study on cost-effective strategies for regression testing of database applications.•Pair-wise combinatorial tests are cost-effective for database regression testing.•Omitting tests on production data could see important regression faults slip through.•Tests on production data are complementary to combinatorial tests on synthesized data.

Testing and, more specifically, the regression testing of database applications is highly challenging and costly. One can rely on production data or generate synthetic data, for example based on combinatorial techniques or operational profiles. Both approaches have drawbacks and advantages. Automating testing with production data is impractical and combinatorial test suites might not be representative of system operations.In this paper, based on a large scale case study in a representative development environment, we explore the cost and effectiveness of various approaches and their combination for the regression testing of database applications, based on production data and synthetic data generated through classification tree models of the input domain.The results confirm that combinatorial test suite specifications bear little relation to test suite specifications derived from the system operational profile. Nevertheless, combinatorial testing strategies are effective, both in terms of the number of regression faults discovered but also, more surprisingly, in terms of the importance of these faults. However, our study also shows that relying solely on synthesized test data derived from test models could lead to important faults slipping to production. Thus, we recommend that testing on production data and combinatorial testing be combined to achieve optimal results.

Related Topics
Physical Sciences and Engineering Computer Science Computer Networks and Communications
Authors
, ,