Support Vector Machine and Random Forest led software fault prediction

Research area:Software engineeringSoftwareSoftware Engineering Research

What the study found

The review found that Support Vector Machine and Random Forest were the strongest methods in the studies it examined, based on accuracy, precision, recall, and F1-score. It also found that public datasets, especially from the PROMISE and NASA Metric Data Program repositories, are widely used in software fault prediction.

Why the authors say this matters

The authors say the review is meant to advance research in software fault prediction and support the development of high-quality software products by improving defect predictability. The study also suggests it will be useful to researchers by providing an updated overview of the literature.

What the researchers tested

The paper reviewed 45 articles published between 2023 and 2025 from the IEEE, Springer, ACM, and ScienceDirect digital libraries. It covered factors influencing software fault prediction, prediction techniques, datasets and software metrics, evaluation metrics, model selection criteria, and challenges in current solutions.

What worked and what didn't

Across the reviewed articles, Support Vector Machine and Random Forest had the best reported performance on the metrics named in the abstract. The use of public datasets, particularly PROMISE and NASA Metric Data Program product-metric datasets, was associated with improved model performance. The abstract does not identify which approaches performed poorly overall.

What to keep in mind

The summary is based only on a review of 45 articles and the findings reported in their abstracts and selected results. The abstract does not provide detailed limitations, and it does not specify the full range of models or datasets beyond those named.

Key points

The review covered 45 articles on software fault prediction published from 2023 to 2025.
Support Vector Machine and Random Forest were reported as the best-performing methods in the reviewed studies.
Public datasets from PROMISE and NASA Metric Data Program were widely used and linked to improved model performance.
The review examined prediction techniques, datasets, software metrics, evaluation metrics, model selection criteria, and challenges.
The authors say the review may help advance research and improve defect predictability.

Disclosure

Research title:: Support Vector Machine and Random Forest led software fault prediction
Authors:: Ruchika Aggarwal, Kamaljit Kaur
Publication date:: 2026-01-28
DOI:: 10.7759/s44389-026-00021-7
OpenAlex record:: View

AI provenance: This post was generated by OpenAI. The original authors did not write or review this post.

Support Vector Machine and Random Forest led software fault prediction

What the study found

Why the authors say this matters

What the researchers tested

What worked and what didn't

What to keep in mind

Disclosure

More posts

Photonic chip synthesizes flexible millimeter-wave signals

Spin-dependent absorptive scattering shows universal patterns

Pancakes in 2D cosmic structure are mostly C-shaped

Toponium masses estimated from QCD sum rules