Deep Learning Approaches for Cyberbullying Detection and Classification on Social Media

S, Neelakandan; M, Sridevi; Chandrasekaran, Saravanan; K, Murugeswari; Singh Pundir, Aditya Kumar; R, Sridevi; Lingaiah, T.Bheema

doi:https://doi.org/10.1155/2022/2163458

Computational Intelligence and Neuroscience

On this page

Abstract Introduction Related Works Conclusions Data Availability Conflicts of Interest References Copyright Related Articles

Special Issue

Advances in Computational Intelligence Techniques for Next Generation Internet of Things

View this Special Issue

Research Article | Open Access

Volume 2022 | Article ID 2163458 | https://doi.org/10.1155/2022/2163458

Deep Learning Approaches for Cyberbullying Detection and Classification on Social Media

Neelakandan S,¹Sridevi M,²Saravanan Chandrasekaran,³Murugeswari K,⁴Aditya Kumar Singh Pundir,⁵Sridevi R,⁶and T.Bheema Lingaiah⁷

Academic Editor: Akshi Kumar

Received23 Mar 2022

Revised23 Apr 2022

Accepted25 May 2022

Published11 Jun 2022

Abstract

As a result of the ease with which the internet and cell phones can be accessed, online social networks (OSN) and social media have seen a significant increase in popularity in recent years. Security and privacy, on the other hand, are the key concerns in online social networks and other social media platforms. On the other hand, cyberbullying (CB) is a serious problem that needs to be addressed on social media platforms. Known as cyberbullying (CB), it is defined as a repetitive, purposeful, and aggressive reaction performed by individuals through the use of information and communication technology (ICT) platforms such as social media platforms, the internet, and cell phones. It is made up of hate messages that are sent by e-mail, chat rooms, and social media platforms, which are accessed through computers and mobile phones. The detection and categorization of CB using deep learning (DL) models in social networks are, therefore, crucial in order to combat this trend. Feature subset selection with deep learning-based CB detection and categorization (FSSDL-CBDC) is a novel approach for social networks that combines deep learning with feature subset selection. The suggested FSSDL-CBDC technique consists of a number of phases, including preprocessing, feature selection, and classification, among others. Additionally, a binary coyote optimization (BCO)-based feature subset selection (BCO-FSS) technique is employed to select a subset of features that will increase classification performance by using the BCO algorithm. Additionally, the salp swarm algorithm (SSA) is used in conjunction with a deep belief network (DBN), which is known to as the SSA-DBN model, to detect and characterize cyberbullying in social media networks and other online environments. The development of the BCO-FSS and SSA-DBN models for the detection and classification of cyberbullying highlights the originality of the research. A large number of simulations were carried out to illustrate the superior classification performance of the proposed FSSDL-CBDC technique. The SSA-DBN model has exhibited superior accuracy to the other algorithms, with a 99.983 % accuracy rate. Overall, the experimental results revealed that the FSSDL-CBDC technique beats the other strategies in a number of different aspects.

1. Introduction

In recent years, people utilize virtual meeting platforms in their daily lives using global online social network (OSN) to facilitate communication. This network helps users for finding new friends and increases their connections around the world. Furthermore, sharing of data and opinions is the significant features of OSN [1–3]. In recent years, the rate of utilizing OSN has rapidly increased. With OSNs such as Facebook, Google+, LinkedIn, Twitter, VKontakte, Mixi, and Sina Weibo, a Japanese social network turned into the desired manner of transmission for billions of everyday active users. A user consumes maximal time for updating their content, interacting with primary user, and browsing others’ account for finding particular data that are the major implication of social network website. OSN could remove the economical and geographical barriers among the users for sharing information and communication. In addition, OSN is highly beneficial to attain the objectives like amusement, education, search for jobs, etc. The popularity of OSN leads to high risks of an attack on the OSN users. Several OSN users expose their private data and that act as a proposal for the attacker to perform specific malicious activities [4, 5].

The current extensive nature of cyberbullying (CB) has enlarged the significance of its recognition. As per the survey, nearly 43% of the teenagers in the US alone have been stated to be the targets of CB at a certain point. CB is deliberated as a novel or electric method of conventional bullying. CB is determined as an aggressive, repeated, and intended response determined by an individual/group toward other individuals/groups, which is created using information and communication technology (ICT) methods like the internet, mobile phones, and social media [6]. The whole CB events are executed in internet broadcasting instead of in a physical system. The CB contains hate letters transferred by e-mails, social networking, and so on, via public/private computers/using private mobile phones. It is raised as a severe threat among the states. Current research displays the ratio as improved to be about 59% in the US. CB has a similar, when it is not a better, negative impact on the victim against conventional bullying, since the predators generally attack a victim on the aspect that an individual could not alter (viz., ethnicity, physical appearance, skin color, and religion), which leaves a deep and long-lasting effect on the victim. Occasionally, the related humiliations are sufficient for pushing the victim to self-infliction of suicide/harm.

A study in [7] displayed that suicide thoughts tend to rise among teenagers because of the disclosure of various types of CB. Although precautions are occupied, the redevelopment of victims of CB cases is challenging for society and families. Self-hate, dominance, isolation, and reaction to the socializing procedure lead to troubled and unhappy adults. Furthermore, this mental imbalance could alone make upcoming bullies. Among many problems, which create the recognition of CB in OSN is highly complicated, current advanced solution for detecting CB does not determine the possibility of bullying types in their detection method [8]. We specified the different kinds of CB, which could arise on the web, and it is not possible for assuming that a similar detection method would be effective in finding all types of bullying.

The main limitation present in the current detection system using CB study is the absence of input data. This study is traditionally executed on an available dataset/surveyed data, while the victim’s/perpetrators are permitted for reporting the impression. Another issue with automatic CB identification is determining the most appropriate operation on CB material that takes into account the available studies in the CB detection region in order to achieve the goal of automatic detection accurately recognizing CB actions, which is another issue with automatic CB identification. It becomes more difficult to determine the actions as a result of this, and well-developed tools for combining the information via an autonomous decision technique are necessary [9]. To achieve the goal of automatic detection to precisely recognize CB actions, a CB detection zone was created. The automated decision-making is the process of making decisions without human intervention. Inferred data or digitally developed profiles can also be used to make these decisions. A preprogrammed algorithm and criteria may be used in an online loan decision or a recruitment aptitude examination. Administration heavily relies on automation. Automated systems can increase administrative decision-making consistency, correctness, and transparency, and enable new service delivery choices in the relevant areas and with suitable supervision. According to the findings of this study, machine learning was utilized to detect artificial CB material based on a number of psychological and common characteristics. The CB detection rate of this intelligent system has been reported to be lower, and it has been found to be mostly confined to a person writing a comment in the text [10]. The present research has stated that the consumption of user context in the event includes the history and features of user comments for improving the efficiency of CB classification or detection [11]. SSA with the DBN algorithm was used. Salp swarm algorithm (SSA) with deep belief network (DBN) is called as the SSA-DBN model. The SSA-DBN model is employed to detect and classify cyberbullying in social networks. For identifying suspicious attacks in a social, a salp swarm algorithm-based deep belief network is presented. As a result, the suggested chronological salp swarm algorithm-based deep belief network is constructed by fusing the chronological and salp swarm concepts. The fitness function, which accepts the minimal error vale as the optimal solution, reveals the optimal solution for detecting the incursion. The suggested approach tunes the weights appropriately in this case to produce an effective and optimal solution for identifying intruders.

This study presents a novel feature subset selection with DL-based CB detection and classification (FSSDL-CBDC) model on social networks. In addition, a binary coyote optimization-based feature subset selection (BCO-FSS) technique is applied to choose a set of features for enhanced classification efficiency. Moreover, the salp swarm algorithm (SSA) with deep belief network (DBN), called the SSA-DBN model, is working to detect and classify CB in social networks. Deep belief networks (DBNs) were created as a response to the issues that classic neural networks have with deep layered networks’ training, such as slow learning, becoming stuck in local minima owing to poor parameter selection, and requiring much training datasets. The greedy algorithm is used to precondition deep belief networks—the design of the BCO-FSS and SSA-DBN models for CB detection and categorization procedure. To choose a set of features for improved classification efficiency, the binary coyote optimization-based feature subset selection (BCO-FSS) technique is used. To detect and classify CB in social networks, we combine the salp swarm algorithm (SSA) with a deep belief network (DBN) and dubbed the SSA-DBN model. BCO-FSS and SSA-DBN model development for CB detection and classification process demonstrates the effort’s inventiveness. Furthermore, the utilization of the SSA to fine-tune the hyperparameter of the DBN model resulted in enhanced outcomes over the traditional DBN model. The BCO technique is applied to choose a set of features for enhanced classification efficiency, and the SSA is employed to detect and classify cyberbullying in social networks. For the exploratory better detection presentation of the proposed FSSDL-CBDC method, a comprehensive range of simulations was performed on a benchmark dataset.

This section reviews the recently developed automated CB classification models on social networks. Yuvaraj et al. [12] integrate the classification and feature extraction engine. The classification engine utilizing ANN categorizes the result, and it is given by a calculation scheme that either penalizes/rewards the categorized output. DRL performs the calculations, which increases the efficiency of classification. In their study, Mahbub et al. [13] investigate the impact of predatory approach words on CB detection and present a method for generating a vocabulary of predatory approach phrases. This study brings together findings from investigations of convicted criminals’ chat logs in order to develop a lexicon of sexual approach terms for use in the future. Through the examination of data from a variety of social networks, the research establishes the relevance of this dictionary of approach terms in detecting online predatory behavior via machine learning methodologies. The variety of contents available on different social media sites are demonstrated by this example.

Talpur and O’Sullivan [14] created a supervised machine learning strategy for detecting CB and categorizing its severity on Twitter, which they published in Nature. The text classification engine created by Yuvaraj et al. [15] that preprocesses tweets, eliminates noisy data and other background information, extracts the desired features, and categorizes without overfitting the data is described in detail below. This research advances a novel DDT strategy that processes input components by utilizing the DNN hidden layer as tree nodes, as demonstrated in previous research. Chia et al. [16] use feature engineering and machine learning approaches to explore the use of irony and sarcasm on social media platforms. To begin, they define and assess the definitions of sarcasm and irony by looking at a large number of research studies that are focused on the contexts in which they are used. Subsequently, a comparison of numerous classification approaches with a few widely used classification schemes for the text classification process is carried out following the initial research. A variety of methods of data preprocessing were examined and compared in the following research.

In Murnion et al. [17], an automated data collection scheme is proposed that always gathers game chat data from the common online multiple player games. The data have been combined and collected using other data regarding the companies from the presented connected data service. It presented a scoring system for enabling the detection of CB depending upon this study. The organization of the gathered data was executed by humble feature recognition with SQL database enquiries and related with classifications from the AI-based sentimentality text analysis services, which have currently turned into presented and automatically classified data utilizing custom-built classification user.

Bu and Cho [18] proposed an ensemble technique of the 2 DL methods: first is character-level CNN that takes lower-level syntactic data from the series of characters and strong to the noise by the TL method. Next is word-level LRCN that takes higher-level semantic data from the series of words, accompanying the CNN module. Kumari and Singh [19] extract integrated features of text and images for identifying distinct events of CB. They utilize a pretrained VGG-16 network and CNN for extracting the features from text and images, correspondingly. These features are additionally improved by GA for increasing the performance of the entire system. Al-Garadi et al. conducted an in-depth research on cyberbullying prediction models on social media and identified several unresolved issues, including the prediction of cyberbullying intensity, human data features, and language dynamics. Numerous studies examine various machine learning options for detecting cyberbullying. Hosseinmardi et al. investigate the detection of cyberbullying episodes on the social media platform Instagram. They employ naive Bayes and SVM classifiers, the latter of which achieves the highest performance by combining multimodal text and image information, and media session data. Several other studies concentrate on the characteristics believed to be associated with cyberbullying, such as analyzing the social network structure of users, combining text and picture analysis techniques, profanity features, sentiment analysis, or geographical features, among others.

3. The Proposed Model

As illustrated in Figure 1, the operating principle of the FSSDL-CBDC approach is described. Several steps are required for the proposed FSSDL-CBDC technique to be effective, including preprocessing, feature selection using BCO-FSS, and classification using SSA-DBN. The FSSDL-CBDC technique was used. A variety of simulations were run to demonstrate the proposed FSSDL-CBDC technique’s improved classification performance. Cyberbullying is a pernicious form of online abuse of authority that has malicious consequences. It takes on a variety of formats, and in the majority of social media platforms, it is in textual format. Intelligent systems are required to automatically detect such situations. Deep learning-based models have made their way into the detection of cyberbullying occurrences, claiming to be able to overcome the limits of conventional models and significantly enhance detection performance. The activities of these processes are described in greater detail in the following sections.

3.1. Preprocessing

During preprocessing, a lexical normalization technique is employed, which makes use of different elements for cleaning the input data. It also transforms the numerical parameters to the corresponding textual data. In addition, a spell corrector tool is used to eradicate the outbound vocabulary words. In addition, the repetitive or missing parameters are removed involving spelling mistakes, incorrect punctuation marks, and so on.

3.2. Design of the BCO-FSS Technique

Once the input social network data are preprocessed, they are fed into the BCO-FSS technique. The COA is a strong populace-based method projected lately by Juliano and Leandro [20]. This method draws stimulation from the common behaviors of Canis latrans species that live mostly in the NA. Because of its exclusive form, the COA could be categorized as evolutionary heuristics and swarm intelligence (SI). The coyote’s population is separated to packs, and coyotes for each pack. The amount of coyotes in apiece pack is thought to be constant and equal. As a result, the populace size may be calculated by multiplying and . Every coyote’s social status denotes a potential solution to the optimization issue. In this case, the communal situation of c^th coyote in p^th packet at t^th time may be described as follows:where represents the amount of dimension in the search space. Initially, a population of coyote is arbitrarily initiated within the predetermined search space as follows:where and denote lower and upper bounds of decision variable, correspondingly. represents real number arbitrarily created among zero and one, succeeding uniform distributions. Then, the adaption of coyotes to their corresponding social condition is estimated by the following:

As mentioned, the coyotes tend to leave their present pack for leading a lonely life or join other packs. The exclusion of coyote from its pack shadows a likelihood , which differs based on the pack size as follows:

This method improves population range by stimulating a global interchange of data between the coyote packs. As goes beyond one for , the COA limits the maximal amount of coyotes for each pack to fourteen [21]. In every pack, the coyote is optimally familiarized with the atmosphere and is allocated as alpha. For the minimization problem, the alpha is given as

With the consideration of clear indications of the SI in coyotes, the COA assumes all the coyotes’ share their social condition with the remaining packs for improving the pack’s survivability. Regarding this, the traditional tendency of a pack is determined according to the data given its member, ciz.,where represents hierarchical communal condition of the prairie wolf within pack at time, for . In this method, the traditional propensity of every pack is calculated as the middle of the combined communal condition within the carton. For modeling the two main organic proceedings of the coyote, namely, birth and death, the ages of every coyote are deliberated, . The birth of novel coyotes is defined by the combination among the communal circumstances of the two arbitrary parental coyotes from similar packs, and the effect of the environmental factor where and denote arbitrary coyote from the pack, and and denote two arbitrary dimensions of the search space. Alternatively, and denote scatter and relationship likelihoods, correspondingly. denotes arbitrarily created vector within the bounds of dimension, and denotes uniformly arbitrary amounts in zero and one. The scatter and relationship likelihoods have a substantial impact on the composition and diversity of the coyote’s pack. It is given by the following:

Based on this, the coyote’s pup has around 10% chance of death at birth. Moreover, the death risk of every coyote increases with age. Thus, the COA designs the coyote’s survivability depending upon a simple method, whereas and denote set of coyotes fewer adapted to the atmosphere (viz. worst fitness value) compared to pup and size of the groups, correspondingly.

Additionally, the COA represents the traditional communication among the coyotes in the pack using and . The previous denotes the effect of alpha on an arbitrary coyote , where the later indicates the effect of the traditional tendency of the pack on other arbitrary coyotes . The and are chosen after a uniformly distributed likelihood. Therefore, and are given by

The social condition of the coyote is inclined by the alpha, and other members of the pack will be upgraded usingwhere and are random values. The coyotes of new social conditions are assessed using

The coyotes’ general fitness will either increase or remain the same, but it will never deteriorate.

Additional features could inhibit the learning procedure. The FS could authenticate the significance of the features, which create a dataset, and removing these does not assist in a positive manner. The selected features using an FS method could be denoted as N-sized vector, whereas N denotes the overall number of features in a dataset, whereas every position of the vector could consider the values as zero/one, whereas zero denotes features that were not chosen as also one signifies features that were chosen. The transfer purpose technique determines the likelihood of altering a position vector component from zero to one and conversely in an efficient and simple manner, and hence, the binarization method is the utilized most, particularly for the FS problem [22]. Based on this, a transfer function considerably affects the efficiency of the FS methods in seeking an optimal set of features, concerning the local optimal prevention and the balance between exploitation and exploration, and thus, it is a significant role in the binary version of metaheuristics. In the BCOA, the constraint of the social conditions of coyotes for the binary values by a V-shaped transfer function is defined by equation (14) as follows:where relates to the upgraded social condition vector existing, considering continuous values.

3.3. Algorithmic Design of the SSA-DBN Technique

Once the subsets have been feature-reduced, they are fed into the SSA-DBN model, which is then utilized to complete the classification assignment. This is accomplished through the use of the DBN model, which generates feature vectors that are then classified using the softmax layer. By picking hyperparameter values in the most optimal method, the SSA is used to improve the detection performance of a DBN model, whereas the SSA is used to improve its detection performance.

3.3.1. Architecture of DBN

Since DBN contains multiple hidden levels and countless hidden units within each of those layers, it is considered a member of the DNN family. The standard DBN technique is similar to the RBM technique in that it includes an output layer. Additionally, the DBN achieves its outcomes using a strong, greedy unsupervised learning technique to train RMB, and a supervised fine-tuning mechanism that adjusts the scheme using labeled data. The RBM is composed of two types of layers: visible and buried layers coupled with undirected weights. When RBMs are stacked in DBN, the RBM’s hidden layer is chosen to be the visible layer of the subsequent RBM. This is because the RBM’s hidden layer provides information about the subsequent RBM. RMB’s variable sets are defined as , where ij is the weight difference between I and h j. The bias layers b I and a j must be identified.

Figure 2 displays the framework of DBN. The RBM defines equivalent energy as described as follows:

The joint likelihood distribution of and is defined below as follows:

Here, the marginal likelihood distribution of is established by

To gain an optimum value for a solitary data vector , the incline of probability approximation is evaluated by [23]the following:

Here, denotes expectation using the delivery of a specific subscript. Because of the lack of influences between units within the same layer, 〈∙〉_data suggests that it may be obtained by measuring the conditional likelihood distribution by measuring the provisional likelihood delivery by

Due to the shape of the activation function, it is referred to as a sigmoid function. Contrastive divergence can be regarded of as a learning technique that approximates maximum likelihood. It calculates the divergence/differences between the positive phase (energy of first encoding) and the negative phase (energy of second encoding) (energy of the last encoding). To reduce the variation of two Kullback-Leibler divergences through renovation, the contrastive divergence (CD) learning module is used in the case of _model (KL). To begin, the CD learning is more efficient than Gibbs sampling in real-world applications and requires less processing time. Thus, weights in the DBN layer are taught using unlabeled data by unsupervised algorithms that are both fast and greedy in their information search. For predictions, the DBN uses the supervised layer to fine-tune the learned features using labeled data from the training set. The fully connected (FC) layer is now the top layer, and the layers beneath it are activated using the sigmoid activation function.

3.3.2. Overview of SSA

The SSA is a novel SI method, which was established lately by [24]. The main concept behindhand in the SSA operator is that they simulate the swarming behavior of salp in deep oceans. Salp belonged to the species of Salpidae and contain transparent barrel-shaped bodies. They are related to jellyfishes in their tissue and motion. Moreover, they shift as the water is driven by the body as a force to move onward. The salp provides a novel 160 forms of swarm called a slapping chain while directing in the ocean. The salp chain behavior was numerically modeled by separating the population into groups depending upon leader and follower. The front of the chain is deliberated as the led 1 when the remaining salps are called as followers. The leader’s part is to direct the swarm of salp, and all the followers follow the previous one. Related to the other SI techniques, the procedure of SSA imitates by initiating an arbitrary population of salp and later evaluates the fitness for every salp [25]. The slap with an optimal best fitness value is represented as a front-runner salp, whereas an additional slap is symbolized as a follower. The salp swarm algorithm (SSA) is a new stochastic algorithm inspired by salps’ navigational and foraging abilities. However, higher dimension problems show a poor convergence rate for classical SSA. The SSA lacks exploration and exploitation, resulting in inefficient convergence. A salp’s best fitness is found by exploring and exploiting the search space. The leader’s salp location is modified based on the distance between the salp and food supply. The optimal performing slaps are represented as a food source to be chased using a salp chain. For updating the location of the slap chain, two major stages are determined: leader and follower phases.

The location of the leader is upgraded by equation (20) as follows:where and represent novel location of leader and food source in dimension, and and represent upper and lower bounds of dimension, correspondingly. and denote arbitrarily created amounts in the range zero and one The variable presents an important aspect in the SSA that controls the balance between exploration and exploitation. Moreover, gradually decreases by iteration as displayed in the following equation (21):where specifies present iteration, and denotes maximal amount of iterations. Figure 3 exemplifies the flowchart of the SSA. For updating the location of follower, novel idea is presented where 185 is depending upon Newton’s law of movement as in equation (22).where denotes location of follower salp in dimension. In the optimization procedure, the time corresponds to the present iteration, 188, whereas and indicate velocity and acceleration, correspondingly. In equation (22), the early speed is set to zero and the inconsistency is set 190 to one ; thus, the upgrading procedure of follower is equated in 191 equation (23).

3.3.3. Parameter Optimization of DBN Using SSA

For optimally regulating the hyperparameters of the DBN model, the SSA is used and the detailed working is provided in the following. The training process of the DBN model takes place using a fitness function [26–30]. In addition, 10‐fold cross‐validation (CV) process is utilized to evaluate the FF. Under 10‐fold CV, the training dataset is randomly subdivided into a collection of ten equally exclusive subsets of nearly equal sizes, where nine subsets are used to train the data, and the remaining one is applied to test the data. These processes are repeated for a set of 10 iterations in such a way that each subset can be used to test the model. The FF is denoted as of the 10‐fold CV model in the training data, as defined in equation (24). Also, a solution with maximum leads to minimal fitness value [31–33].where and indicate the true and false organization count. Finally, the hyperparameter involved in the DBN model is optimally picked up by the SSA, and also, the performance of classification gets improved.

4. Performance Evaluation

In this section, we validate the proposed model performance under several aspects. Table 1 investigates the performance of the feature selection techniques in terms of classification accuracy under different sets of training data and varying number of residuals [34, 35]. Figure 4 examines the result analysis of different feature selection techniques in terms of classification accuracy on 60% of training data. From the figure, it is depicted that the BCO-FSS model is found to be an effective method and it leads to maximum classification accuracy. For instance, under 200 residuals, the proposed BCO-FSS technique has accomplished a higher classification accuracy of 28.61%, whereas Pearson’s correlation, chi-squared, and information gain techniques have resulted in a lower classification accuracy of 28.61%, 27.10%, and 25.18%, respectively.

Moreover, under 1000 residuals, the BCO-FSS technique has obtained an increased classification accuracy of 37.80%, whereas Pearson’s correlation, chi-squared, and information gain techniques have attained a decreased classification accuracy of 34.78%, 32.68%, and 29.46%, respectively.

Figure 5 inspects the outcome analysis of different feature selection approaches with respect to classification accuracy on 75% of training data. From the figure, it can show that the BCO-FSS method is found to be an effective technique and it leads to maximal classification accuracy. For instance, under 200 residuals, the presented BCO-FSS manner has accomplished a higher classification accuracy of 43.24%, whereas Pearson’s correlation, chi-squared, and information gain methods have resulted in a lesser classification accuracy of 40.39%, 36.26%, and 34.65%, correspondingly. Furthermore, under 1000 residuals, the BCO-FSS technique has gained an improved classification accuracy of 66.26%, whereas Pearson’s correlation, chi-squared, and information gain techniques have achieved a reduced classification accuracy of 60.07%, 55.47%, and 52.19%, correspondingly.

Figure 6 investigates the result analysis of different feature selection methods in terms of classification accuracy on 90% of training data. From the figure, it can be stated that the BCO-FSS manner is initiated to be an effective approach and it leads to higher classification accuracy. For sample, under 200 residuals, the projected BCO-FSS method has talented a larger classification accuracy of 47.44%, while Pearson’s correlation, chi-squared, and information gain techniques have resulted in a lower classification accuracy of 43.88%, 39.55%, and 33.29%, correspondingly. Also, under 1000 residuals, the BCO-FSS technique has attained a maximum classification accuracy of 74.18%, whereas Pearson’s correlation, chi-squared, and information gain techniques have gained a lesser classification accuracy of 69.60%, 64.51%, and 59.61%, correspondingly.

If fewer than 60% of the training data are available, the suggested SSA-DBN model is compared to existing techniques in Table 2. Figure 7 displays an overview of the SSA-DBN model’s sensitivity and specificity studies, which were conducted using the comparable methodologies. The LR model performed worse than the LR model, as seen in the figure, with a sensitivity of 74.335 % and a specificity of 75.025 %, respectively. It was expected that the RF model would produce a little better result, with a sensitivity of 74.825 % and a specificity of 78.196 %, and the RF model did so. Furthermore, the SVM model displayed significantly improved performance, with a sensitivity of 76.506 % and a specificity of 83.708 %, respectively, compared to the baseline model. The ANN model also generated moderate results, with a sensitivity of 76.976 % and a specificity of 84.719 %, respectively, for the sensitivity and specificity tests. Aside from that, the NB model provided a tolerable outcome, as evidenced by its sensitivity of 80.657 % and specificity of 73.475 %, among other statistics. Additionally, the ANN-DRL model must yield competitive outcomes with 83.598 percent% sensitivity and 85.129 % specificity in order to be considered successful. But the proposed SSA-DBN model outperformed the competition, with sensitivity and specificity values of 85.728 % and 88.728 %, respectively, in the study.

Figure 8 offers a comparative analysis of the SSA-DBN with other classifiers in terms of accuracy, F-measure, and G-mean. The obtained results illustrated that the NB and LR models have attained a lower accuracy of 61.821 and 69.023, respectively. Next, the RF and SVM models have showcased moderately closer presentation with the correctness of 71.814% and 77.236%, respectively. In addition, the ANN and ANN-DRL techniques have showcased reasonable consequences with the correctness of 80.897% and 85.369%, correspondingly. However, the SSA-DBN perfect has outperformed the other methods with a maximum accuracy of 88.473%.

On the following page, you will find a quick comparison of the given SSA-DBN approach to alternative strategies that use less than 75 % of the training data. Figure 9 displays a detailed sensitivity and specificity evaluation of the SSA-DBN model, which was carried out utilizing comparative methodology. In this study, we discovered that the NB approach had lower sensitivity than the other methods (68.593 %) but greater specificity than the other methods (98.093 %). Additionally, the LR strategy produced somewhat better outcomes, with a sensitivity of 69.583 % and a specificity of 98.583 %, respectively, compared to the LR approach. Afterward, the RF algorithm displayed significantly improved performance, with a sensitivity of 70.074 % and a specificity of 98.513 %, respectively, compared to the baseline algorithm. The SVM model also performed well in terms of sensitivity and specificity, scoring 70.394 % and 98.663 %, respectively, for mild outcomes in the study. As an additional benefit of using the artificial neural network model, a tolerable outcome was achieved with a sensitivity of 72.614 % and a specificity of 98.553 %. At the same time, the ANN-DRL model aimed to exhibit competitive outcomes with 73.315 % compassion and 98.353 % specificity while also attempting to demonstrate competitive outcomes. However, the accessible SSA-DBN practice obtained optimal performance, with a compassion rate of 79.063 % and a specificity rate of 98.976 %, respectively, compared to the other practices tested in Table 3.

Figure 10 provides a comparative analysis of the SSA-DBN with other classifiers with respect to accuracy, F-measure, and G-mean. The attained outcomes showcased that the NB and RF methods have attained a lower accuracy of 96.743 and 96.993, correspondingly. Next, the SVM and LR techniques have depicted moderately closer performance with the accuracy of 97.003% and 97.063%, correspondingly. Also, the ANN and ANN-DRL techniques have exhibited reasonable consequences with the correctness of 97.553% and 97.313%, respectively. However, the SSA-DBN perfect has outperformed the other techniques with a maximal accuracy of 98.362%.

With less than 90 % of the training data, the anticipated SSA-DBN model’s performance is compared to the performance of other techniques, which is presented in detail in Table 4. According to the comparison methodologies used in this work, the SSA-DBN model’s sensitivity and specificity were briefly evaluated (Figure 11). In the figure, it can be seen that the LR technique fared poorer than the other approaches, with a sensitivity of 92.766 % and a specificity of 99.583 %. Although the NB technique produced a little better results than expected (sensitivity 92.796 % and specificity 99.673 %), it did so in a more consistent manner. A similar improvement in performance was seen in the RF model, which had a sensitivity of 93.537 % and a specificity of 99.773 % when compared to the baseline model. The SVM model also showed moderate results, with a sensitivity of 93.697 % and a specificity of 99.803 %, respectively, according to the results of the study. In addition, the ANN model provided a tolerable outcome, with a sensitivity of 94.387 % and a specificity of 99.733 %, according to the results. As part of this effort, the ANN-DRL approach aimed to simultaneously exhibit competitive outcomes. The method’s sensitivity and specificity were both 99.873 %, indicating that it was successful in demonstrating competitive outcomes. The new SSA-DBN model, on the other hand, outperformed the prior model, with sensitivity and specificity values of 96.037 and 99.997 %, respectively, in comparison.

Figure 12 gives a comparative analysis of the SSA-DBN with other classifiers in terms of accuracy, F-measure, and G-mean. The attained results demonstrated that the NB and RF approaches have attained a lower accuracy of 99.503 and 99.623, respectively. In addition, the LR and SVM copies have showcased abstemiously closer presentation with the correctness of 99.633% and 99.673%, correspondingly. At the same time, the ANN and ANN-DRL manners have showcased reasonable consequences with the correctness of 99.563% and 99.703%, congruently. However, the SSA-DBN perfect has demonstrated the other algorithms with a higher accuracy of 99.983%.

5. Conclusions

In this post, we will describe a novel FSSDL-CBDC technique for detecting and classifying cyberbullying in social media, and how to apply it. The suggested FSSDL-CBDC technique consists of a number of phases, including preprocessing, feature selection, and classification, among others. Additionally, by creating the BCO-FSS approach to choose the optimal collection of features from the preprocessed data, the overall classification results are significantly enhanced. Figure 1 shows the BCO-FSS technique design. The SSA-DBN model receives and classifies the feature-reduced subset in the same time frame as the other models. When compared to the classic DBN model, the usage of the SSA to fine-tune the DBN model’s hyperparameter resulted in improved outcomes when using the SSA. A large number of simulations on a benchmark dataset were carried out in order to assess the increased detection performance of the proposed FSSDL-CBDC technique, which was found to be effective. When compared to other state-of-the-art approaches, the simulation results revealed that the FSSDL-CBDC strategy performed significantly better in classification than the others. In the future, the performance of the FSSDL-CBDC technique may be enhanced by including outlier identification and feature reduction techniques in the algorithm. Unsupervised feature selection (FS) for outlier detection (OD) in streaming data (SD) for fields such as intrusion detection and network security, which are increasingly challenged with large amounts of high-dimensional data that must be analyzed in near real time.

Data Availability

This article contains all of the data.

Conflicts of Interest

The authors declare that there are no conflicts of interest.

References

S. R. Sahoo and B. B. Gupta, “Classification of various attacks and their defence mechanism in online social networks: a survey,” Enterprise Information Systems, vol. 13, no. 6, pp. 832–864, 2019.
View at: Publisher Site | Google Scholar
R. Annamalai, S. J. Rayen, and J. Arunajsmine, “Social media networks owing to disruptions for effective learning,” Procedia Computer Science, vol. 172, pp. 145–151, 2020.
View at: Publisher Site | Google Scholar
M. Fire, G. Katz, and Y. Elovici, “Strangers intrusion detection detecting spammers and fake profiles in social networks based on topology anomalies,” Human Journal, vol. 1, no. 1, pp. 26–39, 2012.
View at: Google Scholar
D. Paulraj, “A gradient boosted decision tree based sentiment classification of twitter data,” International Journal of Wavelets Multiresolution and Information Processing, vol. 18, no. 4, pp. 2050027–2050121, 2020.
View at: Google Scholar
H. Kefi and C. Perez, “Dark Side of Online Social Networks: Technical, Managerial, and Behavioral Perspectives,” Encyclopedia of Social Network Analysis and Mining, vol. 143, pp. 535–556, 2018.
View at: Publisher Site | Google Scholar
D. Vinotha and M. Vasanthi, “Classification rule discovery using ant-miner algorithm: an application of network intrusion detection,” Nternational Journal Of Modern Engineering Research (IJMER), vol. 4, no. 8, 2014.
View at: Google Scholar
S. Hinduja and J. W. Patchin, “Bullying, cyberbullying, and suicide,” Archives of Suicide Research, vol. 14, no. 3, pp. 206–221, 2010.
View at: Publisher Site | Google Scholar
S. Tripathi, V. B. Devi, I. Bhardwaj, and N. Arulkumar, “IoT-based traffic prediction and traffic signal control system for smart city,” Soft Computing, vol. 25, no. 18, pp. 12241–12248, 2021.
View at: Publisher Site | Google Scholar
M. W. Savage and R. S. Tokunaga, “Moving toward a theory: testing an integrated model of cyberbullying perpetration, aggression, social skills, and Internet self-efficacy,” Computers in Human Behavior, vol. 71, pp. 353–361, 2017.
View at: Publisher Site | Google Scholar
P. Subbulakshmi and V. Ramalakshmi, “Honest Auction Based Spectrum Assignment,” Wireless Personal Communications, vol. 102, no. 1, 2008.
View at: Publisher Site | Google Scholar
T. Vaillancourt, R. Faris, and F. Mishna, “Cyberbullying in children and youth: implications for health and clinical practice,” Canadian Journal of Psychiatry, vol. 62, no. 6, pp. 368–373, 2017.
View at: Publisher Site | Google Scholar
N. Yuvaraj, K. Srihari, G. Dhiman et al., “Nature-inspired-based approach for automated cyberbullying classification on multimedia social networking,” Mathematical Problems in Engineering, vol. 2021, Article ID 6644652, 12 pages, 2021.
View at: Publisher Site | Google Scholar
S. Mahbub, E. Pardede, and A. S. M. Kayes, “Detection of Harassment Type of Cyberbullying: A Dictionary of Approach Words and its Impact,” Security and Communication Networks, vol. 2021, Article ID 5594175, 12 pages, 2021.
View at: Publisher Site | Google Scholar
B. A. Talpur and D. O’Sullivan, “Cyberbullying severity detection: a machine learning approach,” PLoS One, vol. 15, no. 10, Article ID e0240924, 2020.
View at: Publisher Site | Google Scholar
N. Yuvaraj, V. Chang, B. Gobinathan et al., “Automatic detection of cyberbullying using multi-feature based artificial intelligence with deep decision tree classification,” Computers & Electrical Engineering, vol. 92, Article ID 107186, 2021.
View at: Publisher Site | Google Scholar
Z. L. Chia, M. Ptaszynski, F. Masui, G. Leliwa, and M. Wroczynski, “Machine Learning and feature engineering-based study into sarcasm and irony classification with application to cyberbullying detection,” Information Processing & Management, vol. 58, no. 4, Article ID 102600, 2021.
View at: Publisher Site | Google Scholar
S. Murnion, W. J. Buchanan, A. Smales, and G. Russell, “Machine learning and semantic analysis of in-game chat for cyberbullying,” Computers & Security, vol. 76, pp. 197–213, 2018.
View at: Publisher Site | Google Scholar
S. J. Bu and S. B. Cho, “A hybrid deep learning system of CNN and LRCN to detect cyberbullying from SNS comments,” Hybrid Artificial Intelligent Systems. HAIS 2018. Lecture Notes in Computer Science, Springer, vol. 10870, pp. 561–572, 2018.
View at: Publisher Site | Google Scholar
K. Kumari and J. P. Singh, “Identification of cyberbullying on multi modal social media posts using genetic algorithm,” Transactions on Emerging Telecommunications Technologies, vol. 32, no. 2, Article ID e3907, 2021.
View at: Publisher Site | Google Scholar
J. P. L. d. S. Coelho, “Coyote Optimization Algorithm: a new metaheuristic for global optimization,” IEEE Congress on Evolutionary Computation (CEC), Rio de Janeiro, Brazil, pp. 2633–2640, 2018.
View at: Google Scholar
S. Saravanan, M. Hailu, G. M. Gouse, M. Lavanya, and R. Vijaysai, “Optimized secure scan flip flop to thwart side channel attack in crypto-chip,” Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol. 274, pp. 410–417, 2019.
View at: Publisher Site | Google Scholar
R. C. T. de Souza, C. A. de Macedo, L. dos Santos Coelho, J. Pierezan, and V. C. Mariani, “Binary coyote optimization algorithm for feature selection,” Pattern Recognition, vol. 107, Article ID 107470, 2020.
View at: Publisher Site | Google Scholar
J. Yu and G. Liu, “RETRACTED: knowledge-based deep belief network for machining roughness prediction and knowledge discovery,” Computers in Industry, vol. 121, Article ID 103262, 2020.
View at: Publisher Site | Google Scholar
S. Mirjalili, A. H. Gandomi, S. Z. Mirjalili, S. Saremi, H. Faris, and S. M. Mirjalili, “Salp Swarm Algorithm a bio inspired optimizer for engineering design problems,” Advances in Engineering Software, vol. 114, pp. 163–191, 2017.
View at: Publisher Site | Google Scholar
S. Neelakandan and D. Paulraj, “An automated exploring and learning model for data prediction using balanced CA svm,” Journal of Ambient Intelligence and Humanized Computing, vol. 12, no. 5, 4990 pages, 2020.
View at: Google Scholar
L. Abualigah, M. Shehab, M. Alshinwan, and H. Alabool, “Salp Swarm Algorithm: A Comprehensive Survey,” Neural Computing and Applications, vol. 32, pp. 11195–11215, 2019.
View at: Publisher Site | Google Scholar
V. J. Chin and Z. Salam, “Coyote optimization algorithm for the parameter extraction of photovoltaic cells,” Solar Energy, vol. 194, pp. 656–670, 2019.
View at: Publisher Site | Google Scholar
R. F. Mansour, N. M. Alfar, S. Abdel-Khalek, M. Abdelhaq, R. A. Saeed, and R. Alsaqour, “Optimal deep learning based fusion model for biomedical image classification,” Expert Systems, vol. 39, no. 3, Article ID e12764, 2021.
View at: Publisher Site | Google Scholar
J. R. Beulah, L. Prathiba, G. L. N Murthy, E. Fantin Irudaya Raj, and N. Arulkumar, “Blockchain with Deep Learning Enabled Secure Healthcare Data Transmission and Diagnostic Model,” International Journal of Modeling, Simulation, and Scientific Computing, Article ID 2241006, 2022.
View at: Publisher Site | Google Scholar
T. Kavitha, C. Karthikeyan, M. Ashok, R. Kohar, J. Avanija, and S. Neelakandan, “Deep learning based capsule neural network model for breast cancer diagnosis using mammogram images,” Interdisciplinary Sciences: Computational Life Sciences, vol. 14, no. 1, pp. 113–129, 2021.
View at: Publisher Site | Google Scholar
D. Venu, A. V. R. Mayuri, S. Neelakandan, G. Murthy, N. Arulkumar, and N. Shelke, “An efficient low complexity compression based optimal homomorphic encryption for secure fiber optic communication,” Optik, vol. 252, Article ID 168545, 2022.
View at: Publisher Site | Google Scholar
S. Neelakandan, M. Prakash, S. Bhargava, and K. Mohan, “Optimal stacked sparse autoencoder based traffic flow prediction in intelligent transportation systems,” Virtual and Augmented Reality for Automobile Industry: Innovation Vision and Applications, vol. 412, pp. 111–127, 2022.
View at: Publisher Site | Google Scholar
A. Muneer and S. M. Fati, “A comparative analysis of machine learning techniques for cyberbullying detection on twitter,” Future Internet, vol. 12, no. 11, p. 187, 2020.
View at: Publisher Site | Google Scholar
R. Ram Bhukya, B. Hardas, T. Anil Kumar, and M. Ashok, “An automated word embedding with parameter tuned model for web crawling,” Intelligent Automation & Soft Computing, vol. 32, no. 3, pp. 1617–1632, 2022.
View at: Publisher Site | Google Scholar
H. Singh, D. Ramya, R. Saravanakumar et al., “Artificial intelligence based quality of transmission predictive model for cognitive optical networks,” Optik, vol. 257, Article ID 168789, 2022.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2022 Neelakandan S et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

3307

Downloads

1496

Citations

Computational Intelligence and Neuroscience

Advances in Computational Intelligence Techniques for Next Generation Internet of Things

Deep Learning Approaches for Cyberbullying Detection and Classification on Social Media

Abstract

1. Introduction

2. Related Works

3. The Proposed Model

3.1. Preprocessing

3.2. Design of the BCO-FSS Technique

3.3. Algorithmic Design of the SSA-DBN Technique

3.3.1. Architecture of DBN

3.3.2. Overview of SSA

3.3.3. Parameter Optimization of DBN Using SSA

4. Performance Evaluation

5. Conclusions

Data Availability

Conflicts of Interest

References

Copyright