Research Article
[Retracted] Software Systems Security Vulnerabilities Management by Exploring the Capabilities of Language Models Using NLP
Algorithm 1
Text classification using CNN.
| Input: security- and nonsecurity-related text with labeling | | Process: | (1) | Tokenization of text to create vocabulary: | | t = tf.keras.preprocessing.text | | Tokenizer (oov_token = “<UNK>”) | (2) | Conversion of text to sequence of words further to sequence of numeric IDs: train_sequences = t.texts_to_sequences (normalized training text) | (3) | Sentence length distribution visualization | (4) | Text sequence padding: | | tf.keras.preprocessing | | sequence.pad_sequences () | (5) | FastText-based embedding matrix construction | (6) | Model architecture construction: | | tf.keras.models.Sequential () | (7) | Training and validation | (8) | Model performance evaluation on test data | | Output: | | Accuracy: 71.89% | | Precision: 0.88 | | Recall: 0.72 | | F1-score: 0.77 |
|