Research Article

Predicting Co-Changed Files: An External, Conceptual Replication

Volume: 15 Number: 2 June 30, 2019
EN

Predicting Co-Changed Files: An External, Conceptual Replication

Abstract

A software project can be comprised of several, highly connected files. A software developer may not know the files that are connected to which are developed or that are changed by another developer. This may induce faults by missing necessary edits on all related files. We build a prediction model for identifying files that should be edited together during a code change, and evaluate the performance of our model on two Apache projects’ development history over more than 10 years. We conduct an external, conceptual replication study based on Wiese et al.'s prior work on predicting co-changed files. Our study shares the same goal but differentiates the experimental design in terms of data set construction, selection of file pairs, feature selection and the model output. Our prediction model’s results, although the same performance measures are used, are much lower than what is reported in Wiese et al.’s study, mainly due to the differences in calculating these measures. The models evaluated at commit granularity could achieve 20% and 45% lower recall and precision rates, respectively, than those aggregated over all file-pairs.  Although it is practically more useful, predicting all files that will be co-changed together during a commit is more challenging than predicting whether a particular file will be changed in that commit. More information about the context of a co-change, the degree of centrality of a file in the project, or project characteristics could reveal more insights in building such predictors in the future.

Keywords

References

  1. 1. Borg, M., Wnuk, K., Regnell, B., Runeson, P. 2017. Supporting change impact analysis using a recommendation system: an industrial case study in a safety- critical context. IEEE Transaction on Software Engineering, 43(3): 675-700.
  2. 2. Kagdi, H., Maletic, J.I. Combining Single-version and Evolutionary Dependencies for Software-change Prediction, proceedings of the Fourth International Workshop on Mining Software Repositories (MSR), Minneapolis, USA, 2007, pp 17.
  3. 3. Wiese, I.S., Ré, R., Steinmacher, I., Kuroda, R.T, Oliva, G.A., Treude C., Gerosa, M.A. 2017. Using contextual information to predict co-changes. Journal of Systems and Software, 128: 220-235.
  4. 4. Ball, T., Kim, J., Porter, A.A., Siy, H.P. If Your Version Control System Could Talk, proceedings of the ICSE Workshop on Process Modeling and Empirical Studies of Software Engineering, 1997.
  5. 5. Gall, H., Hajek K., Jazayeri, M. Detection of Logical Coupling Based on Product Release History, proceedings of the International Conference on Software Maintenance (ICSM), Washington, DC, USA, 1998, pp. 190-198.
  6. 6. Zimmermann, T., Weisgerber, P., Diehl, S., Zeller, A. Mining Version Histories to Guide Software Changes, proceedings of the 26th International Conference on Software Engineering (ICSE), Washington, DC, USA, 2004, pp. 563-572.
  7. 7. Canfora, G., Cerulo, L., Cimitile, M., Penta, M.D. 2014. How changes affect software entropy: an empirical study. Empirical Software Engineering, 19(1): 1-38.
  8. 8. Hassan, A.E., Holt, R.C. Predicting Change Propagation in Software Systems, proceedings of the 20th IEEE International Conference on Software Maintenance, Chicago, IL, USA, 2004, pp. 284-293.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Authors

Betül Romero This is me
Türkiye

Publication Date

June 30, 2019

Submission Date

November 28, 2018

Acceptance Date

May 13, 2019

Published in Issue

Year 2019 Volume: 15 Number: 2

APA
Tosun, A., & Romero, B. (2019). Predicting Co-Changed Files: An External, Conceptual Replication. Celal Bayar University Journal of Science, 15(2), 161-169. https://doi.org/10.18466/cbayarfbe.489291
AMA
1.Tosun A, Romero B. Predicting Co-Changed Files: An External, Conceptual Replication. CBUJOS. 2019;15(2):161-169. doi:10.18466/cbayarfbe.489291
Chicago
Tosun, Ayşe, and Betül Romero. 2019. “Predicting Co-Changed Files: An External, Conceptual Replication”. Celal Bayar University Journal of Science 15 (2): 161-69. https://doi.org/10.18466/cbayarfbe.489291.
EndNote
Tosun A, Romero B (June 1, 2019) Predicting Co-Changed Files: An External, Conceptual Replication. Celal Bayar University Journal of Science 15 2 161–169.
IEEE
[1]A. Tosun and B. Romero, “Predicting Co-Changed Files: An External, Conceptual Replication”, CBUJOS, vol. 15, no. 2, pp. 161–169, June 2019, doi: 10.18466/cbayarfbe.489291.
ISNAD
Tosun, Ayşe - Romero, Betül. “Predicting Co-Changed Files: An External, Conceptual Replication”. Celal Bayar University Journal of Science 15/2 (June 1, 2019): 161-169. https://doi.org/10.18466/cbayarfbe.489291.
JAMA
1.Tosun A, Romero B. Predicting Co-Changed Files: An External, Conceptual Replication. CBUJOS. 2019;15:161–169.
MLA
Tosun, Ayşe, and Betül Romero. “Predicting Co-Changed Files: An External, Conceptual Replication”. Celal Bayar University Journal of Science, vol. 15, no. 2, June 2019, pp. 161-9, doi:10.18466/cbayarfbe.489291.
Vancouver
1.Ayşe Tosun, Betül Romero. Predicting Co-Changed Files: An External, Conceptual Replication. CBUJOS. 2019 Jun. 1;15(2):161-9. doi:10.18466/cbayarfbe.489291