Classifying bug reports to bugs and other requests: an approach using topic modelling and fuzzy set theory
Mohammed D. Ali and Ahmed A. Abusnaina
Abstract
Stakeholders of a software system usually deliver bug reports to a software bug tracking system to report problems they encounter during the use of that system. After that, those incoming requests are assigned to the proper technicians to be analysed and fixed. However, issue reporters frequently misclassify bug requests as non-bug and vice versa. This problem is called bug report misclassification. In fact, it is very costly in terms of time and effort as developers have to manually reclassify those requests to then be able to take the appropriate action. Accordingly, the automatic classification of incoming new reports would be of a valuable demanded feature in bug tracking systems. Careful analysis for literature related to this problem was carried out. It has been found that a hybrid approach combining between topic modelling as a feature extraction technique and the fuzzy logic as a classification technique is promising but rarely utilized approach. In this work, a combined approach of topic modelling and fuzzy logic was introduced to classify bug reports to bugs and non-bugs. The proposed approach was validated using three open-source projects. Finally, the conducted experiments have shown that proposed classifier achieves significant and competitive predictive accuracy.
Keyword
Software maintenance, Bug reports, Fuzzy sets, Topic modelling, Classification, Bug tracking systems.
Cite this article
Ali MD, Abusnaina AA.Classifying bug reports to bugs and other requests: an approach using topic modelling and fuzzy set theory. International Journal of Advanced Computer Research. 2021;11(56):103-115. DOI:10.19101/IJACR.2021.1152031
Refference
[1]Pandey N, Sanyal DK, Hudait A, Sen A. Automated classification of software issue reports using machine learning techniques: an empirical study. Innovations in Systems and Software Engineering. 2017; 13(4):279-97.
[2]Xia X, Lo D, Ding Y, Al-Kofahi JM, Nguyen TN, Wang X. Improving automated bug triaging with specialized topic model. IEEE Transactions on Software Engineering. 2016; 43(3):272-97.
[3]Pingclasai N, Hata H, Matsumoto KI. Classifying bug reports to bugs and other requests using topic modeling. In Asia-pacific software engineering conference 2013(pp. 13-8). IEEE.
[4]Otoom AF, Al-jdaeh S, Hammad M. Automated classification of software bug reports. In proceedings of the 9th international conference on information communication and management 2019 (pp. 17-21).
[5]Chawla I, Singh SK. An automated approach for bug categorization using fuzzy logic. In proceedings of the 8th india software engineering conference 2015 (pp. 90-9).
[6]Zhou Y, Tong Y, Gu R, Gall H. Combining text mining and data mining for bug report classification. Journal of Software: Evolution and Process. 2016; 28(3):150-76.
[7]Chawla I, Singh SK. Automated labeling of issue reports using semi supervised approach. Journal of Computational Methods in Sciences and Engineering. 2018; 18(1):177-91.
[8]Jangra M, Singh SK. Evaluating topic modeling as pre-processing for component prediction in bug reports. In advanced computing and communication technologies 2016 (pp. 465-73). Springer, Singapore.
[9]Terdchanakul P, Hata H, Phannachitta P, Matsumoto K. Bug or not? bug report classification using n-gram idf. In international conference on software maintenance and evolution 2017 (pp. 534-8). IEEE.
[10]Qin H, Sun X. Classifying bug reports into bugs and non-bugs using lstm. In proceedings of the tenth Asia-Pacific symposium on internetware 2018 (pp. 1-4).
[11]Kukkar A, Mohana R. A supervised bug report classification with incorporate and textual field knowledge. Procedia Computer Science. 2018; 132:352-61.
[12]Herzig K, Just S, Zeller A. Its not a bug, its a feature: how misclassification impacts bug prediction. In 2013 international conference on software engineering 2013(pp. 392-401). IEEE.
[13]Hammad M, Alzyoudi R, Otoom AF. Automatic clustering of bug reports. International Journal of Advanced Computer Research. 2018; 8(39):313-23.
[14]Behl D, Handa S, Arora A. A bug mining tool to identify and analyze security bugs using naive bayes and tf-idf. In international conference on reliability optimization and information technology 2014 (pp. 294-9). IEEE.
[15]Antoniol G, Ayari K, Di Penta M, Khomh F, Guéhéneuc YG. Is it a bug or an enhancement? a text-based approach to classify change requests. In proceedings of the conference of the center for advanced studies on collaborative research: meeting of minds 2008 (pp. 304-18).
[16]Chen TH, Thomas SW, Hassan AE. A survey on the use of topic models when mining software repositories. Empirical Software Engineering. 2016; 21(5):1843-919.
[17]Zhang T, Chen J, Yang G, Lee B, Luo X. Towards more accurate severity prediction and fixer recommendation of software bugs. Journal of Systems and Software. 2016; 17:166-84.
[18]Zolkeply MS, Shao J. Classifying software issue reports through association mining. In proceedings of the 34th ACM/SIGAPP symposium on applied computing 2019 (pp. 1860-3).
[19]Panda RR, Nagwani NK. Software bug categorization technique based on fuzzy similarity. In 9th international conference on advanced computing 2019 (pp. 1-6). IEEE.
[20]http://nlp.stanford.edu/software/tmt/tmt- 0.4/. Accessed: 10-September-2021.
[21]Zadeh LA. Fuzzy sets. In fuzzy sets, fuzzy logic, and fuzzy systems: selected papers by Lotfi A Zadeh 1996 (pp. 394-432).
[22]http://jackrabbit.apache.org/. Accessed: 10-September-2021.
[23]http://lucene.apache.org/. Accessed: 10-September-2021.
[24]http://hc.apache.org/. Accessed: 10-September-2021.
[25]https://www.st.cs.uni-saarland.de//softevo/. Accessed: 10-September-2021.
[26]Brownlee J. Machine learning mastery with python. Machine Learning Mastery Pty Ltd. 2016; 527:100-20.
[27]Han J, Pei J, Kamber M. Data mining: concepts and techniques. Elsevier; 2011.
[28]Brownlee J. Imbalanced classification with Python: better metrics, balance skewed classes, cost-sensitive learning. Machine Learning Mastery; 2020.