Research Article

[Retracted] Software Systems Security Vulnerabilities Management by Exploring the Capabilities of Language Models Using NLP

Algorithm 1

Text classification using CNN.
Input: security- and nonsecurity-related text with labeling
Process:
(1)Tokenization of text to create vocabulary:
 t = tf.keras.preprocessing.text
 Tokenizer (oov_token = “<UNK>”)
(2)Conversion of text to sequence of words further to sequence of numeric IDs: train_sequences = t.texts_to_sequences (normalized training text)
(3)Sentence length distribution visualization
(4)Text sequence padding:
 tf.keras.preprocessing
 sequence.pad_sequences ()
(5)FastText-based embedding matrix construction
(6)Model architecture construction:
 tf.keras.models.Sequential ()
(7)Training and validation
(8)Model performance evaluation on test data
Output:
Accuracy: 71.89%
Precision: 0.88
Recall: 0.72
F1-score: 0.77