Home > Research > Publications & Outputs > Duplicate Bug Report Detection and Classificati...

Text available via DOI:

View graph of relations

Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique. / Kukkar, Ashima; Mohana, Rajni; Kumar, Yugal et al.
In: IEEE Access, Vol. 8, 9235309, 2020, p. 200749-200763.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

Kukkar, A, Mohana, R, Kumar, Y, Nayyar, A, Bilal, M & Kwak, KS 2020, 'Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique', IEEE Access, vol. 8, 9235309, pp. 200749-200763. https://doi.org/10.1109/ACCESS.2020.3033045

APA

Kukkar, A., Mohana, R., Kumar, Y., Nayyar, A., Bilal, M., & Kwak, K. S. (2020). Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique. IEEE Access, 8, 200749-200763. Article 9235309. https://doi.org/10.1109/ACCESS.2020.3033045

Vancouver

Kukkar A, Mohana R, Kumar Y, Nayyar A, Bilal M, Kwak KS. Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique. IEEE Access. 2020;8:200749-200763. 9235309. doi: 10.1109/ACCESS.2020.3033045

Author

Kukkar, Ashima ; Mohana, Rajni ; Kumar, Yugal et al. / Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique. In: IEEE Access. 2020 ; Vol. 8. pp. 200749-200763.

Bibtex

@article{a03a955be59541148306621ca7e8f8fc,
title = "Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique",
abstract = "Duplicate bug report detection is a process of finding a duplicate bug report in the bug tracking system. This process is essential to avoid unnecessary work and rediscovery. In typical bug tracking systems, more than thousands of duplicate bug reports are reported every day. In turn, human cost, effort and time are increased. This makes it an important problem in the software management process. The solution is to automate the duplicate bug report detection system for reducing the manual effort, thus the productivity of triager's and developer's is increased. It also speeds up the process of software management as a result software maintenance cost is also reduced. However, existing systems are not quite accurate yet, in spite of these systems used various machine learning approaches. In this work, an automatic bug report detection and classification model is proposed using deep learning technique. The proposed system has three modules i.e. Preprocessing, Deep Learning Model and Duplicate Bug report Detection and Classification. Further, the proposed model used Convolutional Neural Network based deep learning model to extract relevant feature. These relevant features are used to determine the similar features of bug reports. Hence, the bug reports similarity is computers through these similar features. The performance of the proposed system is evaluated on six publicly available datasets using six performance metrics. It is noticed that the proposed system outperforms the existing systems by achieving an accuracy rate in the range of 85% to 99 % and recall@k rate in between 79%-94%. Moreover, the effectiveness of the proposed system is also measured on the cross training datasets of same and different domain. The proposed system achieves a good high accuracy rate for same domain data sets and low accuracy rate for different domain datasets.",
keywords = "bug tracking system, convolutional neural network, deep learning, Duplicate bug report detection, natural language processing, Siamese networks, software development, software engineering, software maintenance",
author = "Ashima Kukkar and Rajni Mohana and Yugal Kumar and Anand Nayyar and Muhammad Bilal and Kwak, {Kyung Sup}",
note = "Funding Information: This work was supported by the National Research Foundation of Korea funded by the Korean Government, Ministry of Science and ICT, under Grant NRF-2020R1A2B5B02002478. Publisher Copyright: {\textcopyright} 2013 IEEE.",
year = "2020",
doi = "10.1109/ACCESS.2020.3033045",
language = "English",
volume = "8",
pages = "200749--200763",
journal = "IEEE Access",
issn = "2169-3536",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

RIS

TY - JOUR

T1 - Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique

AU - Kukkar, Ashima

AU - Mohana, Rajni

AU - Kumar, Yugal

AU - Nayyar, Anand

AU - Bilal, Muhammad

AU - Kwak, Kyung Sup

N1 - Funding Information: This work was supported by the National Research Foundation of Korea funded by the Korean Government, Ministry of Science and ICT, under Grant NRF-2020R1A2B5B02002478. Publisher Copyright: © 2013 IEEE.

PY - 2020

Y1 - 2020

N2 - Duplicate bug report detection is a process of finding a duplicate bug report in the bug tracking system. This process is essential to avoid unnecessary work and rediscovery. In typical bug tracking systems, more than thousands of duplicate bug reports are reported every day. In turn, human cost, effort and time are increased. This makes it an important problem in the software management process. The solution is to automate the duplicate bug report detection system for reducing the manual effort, thus the productivity of triager's and developer's is increased. It also speeds up the process of software management as a result software maintenance cost is also reduced. However, existing systems are not quite accurate yet, in spite of these systems used various machine learning approaches. In this work, an automatic bug report detection and classification model is proposed using deep learning technique. The proposed system has three modules i.e. Preprocessing, Deep Learning Model and Duplicate Bug report Detection and Classification. Further, the proposed model used Convolutional Neural Network based deep learning model to extract relevant feature. These relevant features are used to determine the similar features of bug reports. Hence, the bug reports similarity is computers through these similar features. The performance of the proposed system is evaluated on six publicly available datasets using six performance metrics. It is noticed that the proposed system outperforms the existing systems by achieving an accuracy rate in the range of 85% to 99 % and recall@k rate in between 79%-94%. Moreover, the effectiveness of the proposed system is also measured on the cross training datasets of same and different domain. The proposed system achieves a good high accuracy rate for same domain data sets and low accuracy rate for different domain datasets.

AB - Duplicate bug report detection is a process of finding a duplicate bug report in the bug tracking system. This process is essential to avoid unnecessary work and rediscovery. In typical bug tracking systems, more than thousands of duplicate bug reports are reported every day. In turn, human cost, effort and time are increased. This makes it an important problem in the software management process. The solution is to automate the duplicate bug report detection system for reducing the manual effort, thus the productivity of triager's and developer's is increased. It also speeds up the process of software management as a result software maintenance cost is also reduced. However, existing systems are not quite accurate yet, in spite of these systems used various machine learning approaches. In this work, an automatic bug report detection and classification model is proposed using deep learning technique. The proposed system has three modules i.e. Preprocessing, Deep Learning Model and Duplicate Bug report Detection and Classification. Further, the proposed model used Convolutional Neural Network based deep learning model to extract relevant feature. These relevant features are used to determine the similar features of bug reports. Hence, the bug reports similarity is computers through these similar features. The performance of the proposed system is evaluated on six publicly available datasets using six performance metrics. It is noticed that the proposed system outperforms the existing systems by achieving an accuracy rate in the range of 85% to 99 % and recall@k rate in between 79%-94%. Moreover, the effectiveness of the proposed system is also measured on the cross training datasets of same and different domain. The proposed system achieves a good high accuracy rate for same domain data sets and low accuracy rate for different domain datasets.

KW - bug tracking system

KW - convolutional neural network

KW - deep learning

KW - Duplicate bug report detection

KW - natural language processing

KW - Siamese networks

KW - software development

KW - software engineering

KW - software maintenance

UR - http://www.scopus.com/inward/record.url?scp=85096232168&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2020.3033045

DO - 10.1109/ACCESS.2020.3033045

M3 - Journal article

AN - SCOPUS:85096232168

VL - 8

SP - 200749

EP - 200763

JO - IEEE Access

JF - IEEE Access

SN - 2169-3536

M1 - 9235309

ER -