Classification of Extracted Features from Software Requirements Specification Documents using Support Vector Machine Learning Technique

  • Share this course:

Classification of Extracted Features from Software Requirements Specification Documents using Support Vector Machine Learning Technique

Reviews:

0 (0)

183 178
  • Volume : 2 Issue : 2 2024
  • Page Number : 19-26
  • Publication : ISRDO

Published Manuscript

Title

Classification of Extracted Features from Software Requirements Specification Documents using Support Vector Machine Learning Technique

Author

1. Sadiq Waziri, Student, Abubakar Tafawa Balewa University, Nigeria

Abstract

This paper presents the results of an experiment on the classification of extracted features from Software Requirements Specification (SRS) documents using various Machine Learning techniques. The primary focus was on the linear Support Vector Machine (SVM) technique, with comparative analysis involving three additional techniques, namely Decision Tree (DT), Naïve Bayes (NB), and K-Nearest Neighbors (KNN). During the experimentation, features, which are fundamental building blocks of Software Product Lines [1], were classified into optional and mandatory. This differentiation facilitates both variability and similarity within a product family [1]. While previous research has explored similar classifications using diverse techniques, this study specifically identifies the most effective method for binary classification of features for feature modeling. The experiment was conducted on nine selected documents from the PURE dataset. The performance of each model was evaluated rigorously based on accuracy, precision, recall (sensitivity), and F1-score. The findings provide valuable insights into the optimal classification technique, enhancing the development and management of software product lines.

Keywords

Requirements Feature Feature Extraction Feature Classification Feature Modeling Support Vector Machine

Conclusion

This study highlighted the potential of automating Software Product Line Engineering (SPLE) processes through feature extraction and classification. Automating these tasks can streamline SPLE, saving time and reducing errors. A key aspect of SPLE involves classifying features as mandatory (present in all products) or optional (included only in specific variants). The study demonstrated that SVM effectively performed binary classification. The advantage of a simple classification model, like the one used in the study, lies in its efficiency and ease of interpretation. However, for very complex product lines with numerous features and intricate relationships, more advanced models are necessary.

Author Contrubution

The experiment, implementation, and manuscript writing were carried out by S. M. Waziri. F. U. Zambuk and B. I. Ya’u supervised the project and contributed from the beginning to the end of the manuscript writing. Dr. M. A. Lawal is the departmental project coordinator who searched for and recommended the best organizations for publishing papers in journals. He also guided in formatting.

Funding

The Department of Computer Science, Abubakar Tafawa Balewa University, Nigeria approved this study. It also supported research; however, the funding was by the main author.

Conflict of Interest

There was no conflict of interest in financial, commercial, legal, or professional relationships with organizations or individuals.

Data Sharing Statement

The dataset generated in this study is available in the PURE (Public Requirements) data pool at http://nlreqdataset.isti.cnr.it/.

Software And Tools Use

Acknowledgements

I acknowledge the Department of Computer Science, Faculty of Science, Abubakar Tafawa Balewa University, Bauchi, Nigeria, for support and guidance.

Corresponding Author

Sadiq Waziri

Abubakar Tafawa Balewa University, Student, Nigeria

Copyright

Copyright: ©2024 Corresponding Author. This is an open access article distributed under the terms of the Creative Commons Attribution License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Waziri, Sadiq. “Classification of Extracted Features from Software Requirements Specification Documents using Support Vector Machine Learning Technique.” Scientific Research Journal of Science, Engineering and Technology, vol. 2, no. 2, 2024, pp. 19-26, https://isrdo.org/journal/SRJSET/currentissue/classification-of-extracted-features-from-software-requirements-specification-documents-using-support-vector-machine-learning-technique

Waziri, S. (2024). Classification of Extracted Features from Software Requirements Specification Documents using Support Vector Machine Learning Technique. Scientific Research Journal of Science, Engineering and Technology, 2(2), 19-26. https://isrdo.org/journal/SRJSET/currentissue/classification-of-extracted-features-from-software-requirements-specification-documents-using-support-vector-machine-learning-technique

Waziri Sadiq, Classification of Extracted Features from Software Requirements Specification Documents using Support Vector Machine Learning Technique, Scientific Research Journal of Science, Engineering and Technology 2, no. 2(2024): 19-26, https://isrdo.org/journal/SRJSET/currentissue/classification-of-extracted-features-from-software-requirements-specification-documents-using-support-vector-machine-learning-technique

2578

Total words

1076

Unique Words

123

Sentence

19.967479674797

Avg Sentence Length

0.23009407259407

Subjectivity

0.065954221929832

Polarity

Text Statistics

  • Flesch Reading Ease : 41.06
  • Smog Index : 13
  • Flesch Kincaid Grade : 10.8
  • Coleman Liau Index : 14.66
  • Automated Readability Index : 13.4
  • Dale Chall Readability Score : 8.74
  • Difficult Words : 476
  • Linsear Write Formula : 25.333333333333
  • Gunning Fog : 9.46
  • Text Standard : 10th and 11th grade

Viewed / Downloads

Total article views: 162 (including HTML, PDF, and XML)
HTML PDF XML Total
92 35 35 162

Viewed (geographical distribution)

Total article views: 162 (including HTML, PDF, and XML)
Thereof 156 with geography defined and 6 with unknown origin.

No records found.