ARTÍCULO
TITULO

Applying machine learning algorithms to provide quality requirements specification

A.D. Belonogova    
P.A. Ognyanovich    
K.I. Gaydamaka    

Resumen

This article is devoted to the problem of ensuring the quality of requirements specifications for complex technical systems. The purpose of this article is to use neural networks, classification and clustering algorithms to check requirements specifications for consistency and atomicity. It is believed that the use of neural networks will provide a vector representation of textual requirements formulations in order to identify inconsistencies in requirements specifications and to check the atomicity of individual requirements. This article demonstrates the use of such natural language processing techniques as fasttext, doc2vec, and BERT. K-means clustering is used to find inconsistencies in requirements specifications based on the assumption that the requirements of one cluster are potentially conflicting. Requirements are checked for atomicity by using gradient boosting over decision trees. The study showed that using the pretrained BERT neural network gives the best vector representations of requirements for solving clustering and classification problems using k-Means and gradient boosting, respectively. In addition, training the doc2vec model on requirements specifications is impractical, because the number of requirements in the specifications is usually limited and not enough for training, and FastText does not consider the semantics of the full requirement statement. In conclusion, a comparison of the results of the natural language processing methods considered in the article is given.

 Artículos similares

       
 
Dwaipayan Chakraborty and Subhashis Mallick    
Ocean-water temperature and salinity are two vital properties that are required for weather-, climate-, and marine biology-related research. These properties are usually measured using disposable instruments at sparse locations, typically from tens to hu... ver más

 
Sta?a Pu?karic, Mateo Sokac, ?ivana Nincevic, Danijela ?antic, Sanda Skejic, Tomislav D?oic, Heliodor Prelesnik and Knut Yngve Børsheim    
In this communication, we present an innovative approach leveraging advanced Machine Learning (ML) and Artificial Intelligence (AI) techniques, specifically the Non-Negative Matrix Factorization (NMF) method, to analyze downward and upward light spectra ... ver más

 
Huang Feng and Yu Zhang    
Extensive research in predicting annual passenger throughput has been conducted, aiming at providing decision support for airport construction, aircraft procurement, resource management, flight scheduling, etc. However, how airport operational throughput... ver más
Revista: Aerospace

 
Marko Ðurasevic, Domagoj Jakobovic, Stjepan Picek and Luca Mariot    
The automated design of dispatching rules (DRs) with genetic programming (GP) has become an important research direction in recent years. One of the most important decisions in applying GP to generate DRs is determining the features of the scheduling pro... ver más
Revista: Algorithms

 
Myoung-Su Choi, Dong-Hun Han, Jun-Woo Choi and Min-Soo Kang    
Sleep apnea has emerged as a significant health issue in modern society, with self-diagnosis and effective management becoming increasingly important. Among the most renowned methods for self-diagnosis, the STOP-BANG questionnaire is widely recognized as... ver más
Revista: Applied Sciences