Building an Ensemble for Software Defect Prediction Based on Diversity Selection

Associated organisational units

Electronic data

ESEM2016_paper_157
Rights statement: © ACM, 2016. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in ESEM '16 Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement http://dx.doi.org/10.1145/2961111.2962610
Accepted author manuscript, 297 KB, PDF document
Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Text available via DOI:

https://doi.org/10.1145/2961111.2962610
Final published version

Keywords

diversity, ensembles of learning machines, Software defect prediction, software faults, stacking

View graph of relations

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Published

Standard

Building an Ensemble for Software Defect Prediction Based on Diversity Selection. / Petrić, Jean ; Bowes, David ; Hall, Tracy et al.
ESEM '16 Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement. New York: Association for Computing Machinery, Inc, 2016. 46.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Harvard

Petrić, J , Bowes, D , Hall, T, Christianson, B & Baddoo, N 2016, Building an Ensemble for Software Defect Prediction Based on Diversity Selection. in ESEM '16 Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement., 46, Association for Computing Machinery, Inc, New York, 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, ESEM 2016, Ciudad Real, Spain, 8/09/16. https://doi.org/10.1145/2961111.2962610

APA

Petrić, J., Bowes, D., Hall, T., Christianson, B., & Baddoo, N. (2016). Building an Ensemble for Software Defect Prediction Based on Diversity Selection. In ESEM '16 Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement Article 46 Association for Computing Machinery, Inc. https://doi.org/10.1145/2961111.2962610

Vancouver

Petrić J , Bowes D , Hall T, Christianson B, Baddoo N. Building an Ensemble for Software Defect Prediction Based on Diversity Selection. In ESEM '16 Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement. New York: Association for Computing Machinery, Inc. 2016. 46 doi: 10.1145/2961111.2962610

Author

Petrić, Jean ; Bowes, David ; Hall, Tracy et al. / Building an Ensemble for Software Defect Prediction Based on Diversity Selection. ESEM '16 Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement. New York : Association for Computing Machinery, Inc, 2016.

Bibtex

@inproceedings{ae9bc1da5f934423ba077a96c34b03d3,

title = "Building an Ensemble for Software Defect Prediction Based on Diversity Selection",

abstract = "Background: Ensemble techniques have gained attention in various scientific fields. Defect prediction researchers have investigated many state-of-the-art ensemble models and concluded that in many cases these outperform standard single classifier techniques. Almost all previous work using ensemble techniques in defect prediction rely on the majority voting scheme for combining prediction outputs, and on the implicit diversity among single classifiers. Aim: Investigate whether defect prediction can be improved using an explicit diversity technique with stacking ensemble, given the fact that different classifiers identify different sets of defects. Method: We used classifiers from four different families and the weighted accuracy diversity (WAD) technique to exploit diversity amongst classifiers. To combine individual predictions, we used the stacking ensemble technique. We used state-of-the-art knowledge in software defect prediction to build our ensemble models, and tested their prediction abilities against 8 publicly available data sets. Conclusion: The results show performance improvement using stacking ensembles compared to other defect prediction models. Diversity amongst classifiers used for building ensembles is essential to achieving these performance improvements.",

keywords = "diversity, ensembles of learning machines, Software defect prediction, software faults, stacking",

author = "Jean Petri{\'c} and David Bowes and Tracy Hall and Bruce Christianson and Nathan Baddoo",

note = "{\textcopyright} ACM, 2016. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in ESEM '16 Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement http://dx.doi.org/10.1145/2961111.2962610; 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, ESEM 2016 ; Conference date: 08-09-2016 Through 09-09-2016",

year = "2016",

month = sep,

day = "8",

doi = "10.1145/2961111.2962610",

language = "English",

booktitle = "ESEM '16 Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement",

publisher = "Association for Computing Machinery, Inc",

}

RIS

TY - GEN

T1 - Building an Ensemble for Software Defect Prediction Based on Diversity Selection

AU - Petrić, Jean

AU - Bowes, David

AU - Hall, Tracy

AU - Christianson, Bruce

AU - Baddoo, Nathan

N1 - © ACM, 2016. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in ESEM '16 Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement http://dx.doi.org/10.1145/2961111.2962610

PY - 2016/9/8

Y1 - 2016/9/8

N2 - Background: Ensemble techniques have gained attention in various scientific fields. Defect prediction researchers have investigated many state-of-the-art ensemble models and concluded that in many cases these outperform standard single classifier techniques. Almost all previous work using ensemble techniques in defect prediction rely on the majority voting scheme for combining prediction outputs, and on the implicit diversity among single classifiers. Aim: Investigate whether defect prediction can be improved using an explicit diversity technique with stacking ensemble, given the fact that different classifiers identify different sets of defects. Method: We used classifiers from four different families and the weighted accuracy diversity (WAD) technique to exploit diversity amongst classifiers. To combine individual predictions, we used the stacking ensemble technique. We used state-of-the-art knowledge in software defect prediction to build our ensemble models, and tested their prediction abilities against 8 publicly available data sets. Conclusion: The results show performance improvement using stacking ensembles compared to other defect prediction models. Diversity amongst classifiers used for building ensembles is essential to achieving these performance improvements.

AB - Background: Ensemble techniques have gained attention in various scientific fields. Defect prediction researchers have investigated many state-of-the-art ensemble models and concluded that in many cases these outperform standard single classifier techniques. Almost all previous work using ensemble techniques in defect prediction rely on the majority voting scheme for combining prediction outputs, and on the implicit diversity among single classifiers. Aim: Investigate whether defect prediction can be improved using an explicit diversity technique with stacking ensemble, given the fact that different classifiers identify different sets of defects. Method: We used classifiers from four different families and the weighted accuracy diversity (WAD) technique to exploit diversity amongst classifiers. To combine individual predictions, we used the stacking ensemble technique. We used state-of-the-art knowledge in software defect prediction to build our ensemble models, and tested their prediction abilities against 8 publicly available data sets. Conclusion: The results show performance improvement using stacking ensembles compared to other defect prediction models. Diversity amongst classifiers used for building ensembles is essential to achieving these performance improvements.

KW - diversity

KW - ensembles of learning machines

KW - Software defect prediction

KW - software faults

KW - stacking

U2 - 10.1145/2961111.2962610

DO - 10.1145/2961111.2962610

M3 - Conference contribution/Paper

AN - SCOPUS:84991666877

BT - ESEM '16 Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement

PB - Association for Computing Machinery, Inc

CY - New York

T2 - 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, ESEM 2016

Y2 - 8 September 2016 through 9 September 2016

ER -

Research

Associated organisational units

Electronic data

Links

Text available via DOI:

Keywords