Retrieval of Interactive requirements for Data Intensive Applications using Random Forest Classifier

Renita Raymond, Margret Anouncia Savarimuthu

Abstract


Classifying requirements in data-intensive systems based on their interactions can assist the requirements engineering process in becoming more systematic and transparent, resulting in higher requirement compliance and software project completion. However, understanding the requirements centred on interactions with the system is particularly tough due to the increased complexity of big data. In most cases, awareness of interaction-based requirements is critical in moving forward with prediction and decision-making. As a result, the classification of interactive requirements plays a critical role in removing the difficulties from unclear requirements. Various approaches to effective requirement classification are being devised. However, due to inadequate requirement management reflecting the fast-changing organizational change, classification accuracy does not achieve its maximum potential. The best approach for reducing misclassification rate and retrieving interactive requirements for data-intensive systems would be to use Word Embedding and a Fast Similarity Search (k-NN) retrieval mechanism, as none of the studies to date have emphasized it. It also assessed the impact by comparing the results to metrics derived from the Random Forest classifier's training on word count characteristics. The data set used to experiment with the classification, particularly for interaction-based needs, is unique to our work and has not been covered by any other studies to date. The researchers will benefit from this study as they will better understand the requirement classification process. With an F1 score of 0.91, precision of 0.89, and recall of 0.93, statistical analysis showed that Word Embedding followed by k-NN similarity search produced a relatively high classification result to differentiate interactive requirements for data-intensive systems.


Full Text:

PDF

References


Qader WA, Ameen MM, Ahmed BI. An Overview of Bag of Words; Importance, Implementation, Applications, and Challenges. In: 2019 International Engineering Conference (IEC); 2019. p. 200–204.

Wang P. Eliciting big data requirement from big data itself: A task-directed ap- proach. 6th International Workshop on Software Mining (SoftwareMining). 2017;.

Palomares C, Quer C, Franch X. Requirements reuse and requirement patterns: a state of the practice survey. Empirical Software Engineering. 2017;22:2719–2762.

Robinson WN, Pawlowski SD, Volkov V. Requirements Interaction Management. ACM Computing Surveys (CSUR). 2003;35(2):132–190.

Meth H, Brhel M, Maedche A. The state of the art in automated requirements elici- tation. Information and Software Technology. 2013;55:1695–1709.

Pohl K. Rocky Nook, Inc; 2016.

Li C. Automatically classifying user requests in crowdsourcing requirements engi- neering. Journal of Systems and Software. 2018;138:108–123.

Madhavji NH, Miranskyy A, Kontogiannis K. Big picture of big data software engi- neering: with example research challenges. IEEE/ACM 1st International Workshop on Big Data Software Engineering. 2015;.

Sodagari E, Keyvanpour M. Challenges Classification of Software Requirements In- teraction Management Using Search-Based Methods. 5th International Conference on Web Research (ICWR). 2019;.




DOI: https://doi.org/10.31449/inf.v47i9.3772

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.