liu.seSearch for publications in DiVA
Change search
ReferencesLink to record
Permanent link

Direct link
Automated bug assignment: Ensemble-based machine learning in large scale industrial contexts
Linköping University, Department of Computer and Information Science. Linköping University, Faculty of Science & Engineering. Ericsson AB, Sweden.
Lund University, Sweden.
KTH Royal Institute Technology, Sweden; University of Calif Berkeley, CA 94720 USA.
Linköping University, Department of Computer and Information Science, Software and Systems. Linköping University, Faculty of Science & Engineering.
Show others and affiliations
2016 (English)In: Journal of Empirical Software Engineering, ISSN 1382-3256, E-ISSN 1573-7616, Vol. 21, no 4, 1533-1578 p.Article in journal (Refereed) PublishedText
Abstract [en]

Bug report assignment is an important part of software maintenance. In particular, incorrect assignments of bug reports to development teams can be very expensive in large software development projects. Several studies propose automating bug assignment techniques using machine learning in open source software contexts, but no study exists for large-scale proprietary projects in industry. The goal of this study is to evaluate automated bug assignment techniques that are based on machine learning classification. In particular, we study the state-of-the-art ensemble learner Stacked Generalization (SG) that combines several classifiers. We collect more than 50,000 bug reports from five development projects from two companies in different domains. We implement automated bug assignment and evaluate the performance in a set of controlled experiments. We show that SG scales to large scale industrial application and that it outperforms the use of individual classifiers for bug assignment, reaching prediction accuracies from 50 % to 89 % when large training sets are used. In addition, we show how old training data can decrease the prediction accuracy of bug assignment. We advice industry to use SG for bug assignment in proprietary contexts, using at least 2,000 bug reports for training. Finally, we highlight the importance of not solely relying on results from cross-validation when evaluating automated bug assignment.

Place, publisher, year, edition, pages
SPRINGER , 2016. Vol. 21, no 4, 1533-1578 p.
Keyword [en]
Machine learning; Ensemble learning; Classification; Bug reports; Bug assignment; Industrial scale; Large scale
National Category
Software Engineering
Identifiers
URN: urn:nbn:se:liu:diva-130374DOI: 10.1007/s10664-015-9401-9ISI: 000379060700004OAI: oai:DiVA.org:liu-130374DiVA: diva2:952818
Note

Funding Agencies|Industrial Excellence Center EASE Embedded Applications Software Engineering

Available from: 2016-08-15 Created: 2016-08-05 Last updated: 2016-08-15

Open Access in DiVA

No full text

Other links

Publisher's full text

Search in DiVA

By author/editor
Jonsson, LeifSandahl, Kristian
By organisation
Department of Computer and Information ScienceFaculty of Science & EngineeringSoftware and Systems
In the same journal
Journal of Empirical Software Engineering
Software Engineering

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 23 hits
ReferencesLink to record
Permanent link

Direct link