A new Ensemble approach for Software Fault Prediction

Ehsan Elahi, Saima Kanwal, Ali Nouman Asif

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

Software Fault prediction is about prediction of faulty modules in the software systems. This prediction is done using different metrics such as lines of code, depth of inheritance tree etc. The fault prediction is carried out during early stages of development, as it decreases the maintenance costs and improve the software quality. Though software engineering research community have employed certain approaches to build an effective fault prediction model by using machine learning techniques, but the resampling techniques are used as an explicit activity in their studies. There is no study which reports the impact of discriminative power of resampling technique with classifiers. The main objective is to utilize the best combination of classifier with resampling technique in the model averaging method. In this study, we have investigated and compared the effect of different ensemble methods in improving the performance of prediction model. We have benchmarked the model averaging method using existing ensemble methods such as voting and stacking. The class imbalance problem is tackled with the use of different resampling techniques such as SMOTE, random under sampling and random over sampling. Five classifiers have been employed in this study due to their wide acceptance in fault prediction context which includes decision trees, logistic regression, Naive bayes, multinomial naive bayes and K nearest neighbor. The model is tested on 4 publicly available datasets from the PROMISE repository. In order to evaluate the performance of fault prediction model, F-measure is used. In order to avoid the sample randomness and biasness, the results are cross validated using k-fold ( k=10) cross validation. The experimental results showed that model averaging method outperformed when compared with other ensembles.

Original languageEnglish
Title of host publicationProceedings of 2020 17th International Bhurban Conference on Applied Sciences and Technology, IBCAST 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages407-412
Number of pages6
ISBN (Electronic)9781728146768
DOIs
Publication statusPublished - Jan 2020
Externally publishedYes
Event17th International Bhurban Conference on Applied Sciences and Technology, IBCAST 2020 - Islamabad, Pakistan
Duration: 14 Jan 202018 Jan 2020

Publication series

NameProceedings of 2020 17th International Bhurban Conference on Applied Sciences and Technology, IBCAST 2020

Conference

Conference17th International Bhurban Conference on Applied Sciences and Technology, IBCAST 2020
Country/TerritoryPakistan
CityIslamabad
Period14/01/202018/01/2020

Keywords

  • class imbalance
  • classifier
  • ensemble
  • software fault prediction

Fingerprint

Dive into the research topics of 'A new Ensemble approach for Software Fault Prediction'. Together they form a unique fingerprint.

Cite this