Abstract

More and more image materials are used in various industries these days. Therefore, how to collect useful images from a large set has become an urgent priority. Convolutional neural networks (CNN) have achieved good results in certain image classification tasks, but there are still problems such as poor classification ability, low accuracy, and slow convergence speed. This article mainly introduces the image classification algorithm (ICA) research based on the multilabel learning of the improved convolutional neural network and some improvement ideas for the research of the ICA based on the multilabel learning of the convolutional neural network. This paper proposes an ICA research method based on multilabel learning of improved convolutional neural networks, including the image classification process, convolutional network algorithm, and multilabel learning algorithm. The conclusions show that the average maximum classification accuracy of the improved CNN in this paper is 90.63%, and the performance is better, which is beneficial to improving the efficiency of image classification. The improved CNN network structure has reached the highest accuracy rate of 91.47% on the CIFAR-10 data set, which is much higher than the traditional CNN algorithm.

1. Introduction

The convolutional neural network can read attributes. The convolutional neural network should be useful for Internet learning. The user does not need to pay too much attention to the specific function that has been trained, as long as the weight is exercised. The disadvantage is that a lot of examination data is required, the number of counts is large, and the coefficient needs to be adjusted. By increasing the size of the data set, improving the computing power of the computer, and proposing some good examples for the Internet, the amount of layers of useful convolutional neural networks is updated. Figure 1 is a schematic diagram of the structure of a convolutional neural network.

In our real life, there are various systems and networks that can be described by complex networks [1]. The definition of a complex network is a network in some or all of the properties of self-organization, similarity, tractor, small world, and scaleless. It is called a complex network. There are many specific networks in the environment where we live, such as between friends, the Internet, public transportation networks, and mobile communication networks [2, 3]. At the same time, there are also some things that can be described by the network, such as the food chain network and the language vocabulary network; there are many networks that actually exist in the microscopic world, such as protein networks and cellular chemical reaction networks. Human development and social progress have made some networks have the scale to continue cognitive ability to make network understanding further, and complex network research allows us to understand. Object recognition based on convolutional neural networks has been applied to autonomous driving and real-time traffic monitoring systems.

Multilabel learning comes from the research of text classification problems and has become an important research hotspot in the field of international machine learning. Many learning problems in real life can be regarded as problems of classifying multiple tags [4]. In practical applications, it is necessary to manually process a large amount of multilabeled data, which has attracted many experts and studies in this aspect to actively participate in the research of multilabel learning in the field of data mining and machine learning. For example, in a large amount of data information retrieval on the Internet, it is necessary to filter documents, images, multimedia, etc., to obtain retrieval materials that meet specific functional requirements.

Yang believes that target recognition is a well-known and important problem in computer vision, so he proposed a new method of target recognition using an improved convolutional neural network. However, this research is relatively single and impractical [5]. Cao et al. believe that data collection is an important way to reduce the power fee of networks (WSN). He combined the theory of the original set with the improved convergence neural network and proposed a new wireless sensor network information collection algorithm. However, this method lacks experimental data support and is not very convincing [6]. Xu et al. analyzed the local Rademacher complexity of a multilabel learning algorithm based on empirical risk minimization (ERM) and proposed a new multilabel learning algorithm [7]. But such algorithms have not been used in real life, and the specific situation remains to be seen.

The new trying innovations of this paper are the following: (1) propose an ICA based on improved convolutional neural network multilabel learning, (2) propose model compression technology, and (3) propose dense convolutional neural network (DenseNet) technology.

2. Method of ICA Based on Multilabel Learning of Improved Convolutional Neural Network Based on Big Data

2.1. Image Classification Process

Image data has exploded. The traditional method of image classification is feature description and detection. Manual classification consumes a lot of time and manpower, and the classification phenomenon is not ideal, so image classification technology is gradually developing. Image classification technology uses a computer to analyze an image and divide it or part of the image into specific categories, without the need to judge with eyes. The image classification process is divided into four steps: image preprocessing, image feature extraction, algorithm design, and classification result analysis [8].

2.1.1. Image Preprocessing

Image preprocessing is the process of reducing unnecessary image information and restoring or marking effective information. Commonly used methods include normalization and spatial transformation. Image preprocessing can improve image quality and reduce noise, thereby improving the reliability of image classification results [9]. The algorithm is a prerequisite operation for feature extraction. The input image is generally smoothed in the scale space through a Gaussian blur kernel. After that, one or more features of the image are calculated by local derivative operation.

2.1.2. Image Feature Extraction

The export image function uses a computer to export relevant features and determine whether the points in the image represent its function. This article expresses the characteristics of images in the form of media. The method of feature extraction determines the result of classification. High-quality image functions should not only describe the information expressed by the image as accurately as possible but also have a certain degree of stability and flexibility to respond to environmental changes [10].

2.1.3. ICA Design

The design and application of algorithms have a great influence on classification phenomena. According to the correlation between images and related measurement methods, the mapping relationship between image features and categories is determined. Finally, these images are classified in a specific category [11]. However, image classification has many problems. Therefore, it is difficult for ICAs to establish an accurate mapping relationship between image features and image categories. Different algorithms will have different classification results. Therefore, designing a suitable classification algorithm has important research significance [12].

2.1.4. Analysis of Classification Results

After the results of image classification are obtained, specific evaluations are used to explode objectively the algorithm, such as the number of correctly sorted images and the number of incorrectly sorted images in each category. At the same time, different classification algorithms are compared, and the advantages and disadvantages of each algorithm and the future direction of improvement are analyzed and summarized [13].

2.2. Convolutional Neural Network

Traditional machine learning algorithms need a feature extraction process to collect useful information from initial data samples to obtain a set of feature vectors and then use a trained classifier for final classification and recognition [14]. A convolutional neural network imitates the construction of a biological visual perception mechanism, which can perform supervised learning and unsupervised learning. The convolution kernel parameter sharing in the hidden layer and the sparsity of interlayer connections enable the convolutional neural network to perform smaller calculations or quantity-to-grid features. The difference of the CNN model is that the model combines feature output and classification recognition, and a series of feature extraction processes are hidden in the network [15]. Figure 2 shows the description of several common combined creep models.

This process is called feedforward operation [16]. In the last layer, the target task is formalized; that is, classification or regression is performed in the target function. The use of local receptive fields, weight sharing, and pooling in CNN greatly reduces the number of network parameters and improves network performance [17]. The principle of the local receptive field is the correlation between adjacent pixels of the image [18, 19], and weight sharing is beneficial to reduce the number of network parameters [20, 21]. The pooling operation reduces the feature dimension while removing part of the redundant information [22].

Neural network and X-ray scattering effect: (1)Neural Network Construction [23]. The input of a single neuron is shown in

According to the above two formulas, are constants and can take any value but cannot be 0. The learning rate was multiplied to update each weight parameter and bias vector until the error value of the loss function is small enough:

In actual research and application, we delimit the probability function distributions, as shown in

Tanh function expression is

ReLu function expression is

The offset of the visible layer unit is expressed as [24]

Design space degradation [25]:

When the evolution unfolds in a certain generation, if the first few atoms of the fitness function value from large to small are all in a subset of the same string:

2.3. Multilabel Learning Classification Algorithm

Suppose represents a -dimensional example space, and represents a label space containing categories. For a given multilabel training set , where is a -dimensional feature vector , and is a set of labeled subsets corresponding to , the goal of multilabel learning is to learn a function , that is, from the training set map a set of suitable category tags for each unseen example. Based on this, for any given example , the class label subset of the example predicted by the classifier is . In addition, a multilabel classifier and a multilabel test set are defined, where is a related label set belonging to the example .

2.3.1. Hamming Loss

This indicator is used to evaluate the error rate between the real label of the sample and the label predicted by the system; that is, the sample has the mark but is not recognized or does not have the mark but the possibility of being misjudged:

2.3.2. Sorting Loss

This indicator is used to investigate the status of classification errors according to the classification order of the sample category indicators, that is, the probability that the indicator classification is lower than the nonindicator classification:

2.3.3. 1-Error Rate

This evaluation indicator is used to investigate whether it is possible to follow the classification order of the sample category labels. The label with the highest classification is not the actual label of the sample. When learning a label, it will evolve into a general error rate:

2.3.4. Coverage Rate

This grading indicator checks the average search depth required to cover all relevant sample labels in the sample category label ranking row:

2.3.5. Average Accuracy

This rating indicator checks the situation in the ranking row of the sample category labels. The labels with high membership are still their relevant labels or reflect the average accuracy of the expected category labels:

2.3.6. Classification Accuracy

This evaluation index reflects the degree of consistency between the label predicted by the system and the true label of the sample:

Then, we use the weighted kernel function to convert the standard distance to and get

Then, calculate the posterior probability of belonging to class based on the nearest neighbors ; there are

Finally, the simplified overall function of the algorithm is obtained by introducing a way that the data obeys a certain distribution composed of a mixture of Gaussian distributions as shown in

In summary, we have completed the optimization process of the dichotomy in the safety monitoring and evaluation algorithm. Let us start the experiment.

ICA is to find out the mutually independent parts that constitute the signal, corresponding to high-order statistical analysis. ICA theory believes that the mixed data matrix used for observation is obtained by linear weighting of the independent element through . This part uses the above method to study the ICA based on multilabel learning as shown in Table 1.

3. Experiment on ICA Based on Multilabel Learning of Improved Convolutional Neural Network Based on Big Data

3.1. Improvement of Convolutional Neural Network
3.1.1. Improvement of Convolutional Neural Network Structure

Network deepening is accompanied by improvements in network components. Components with better properties can support deeper and more complex networks. Therefore, there is a lot of work around how to improve or design components in the network.

There are often better choices for the components in the network. ReLU can be used as an activation function in the network, which can ensure that no loss occurs during the back propagation of errors. Compared with the traditional sigmoid activation function and tanh activation function, it has a faster convergence rate and has a neuroscience explanation. In contrast to the proposed new model component, in order to avoid network overfitting, adding random components to the original network components will often have good results, such as random pooling and random Maxout. There are also some components that can improve network performance, such as smooth pooling, partial maximum pooling, and intranetwork networks.

Batch standardization can standardize the output of each layer of the network, so that the information can be spread in the network as much as possible. Batch standardization avoids the possibility of gradient disappearance and gradient explosion, which makes it possible to design deeper networks. The deconvolution layer is an inverse convolution operation, which can restore the signal obtained by the convolution, thereby obtaining an output signal of the same size as the original input image, which can be used to process pixel-level marks.

3.1.2. Improvement of Convolutional Neural Network Training

The training process of the convolutional neural network is divided into two stages. The first stage is the stage of data propagation from a low level to a high level, that is, the forward propagation stage. Another stage is the stage in which the error is propagated and trained from the high level to the bottom level when the results obtained by the forward propagation do not match the expectations, that is, the back propagation stage. First, we need to determine the final loss function of the network. Deep neural networks generally use negative maximum likelihood, cross-entropy loss, Euclidean distance loss, contrast loss, and triple loss.

During the training process, the input image can be transformed, such as flipping, rotating, and adding noise. The transformed image can be used as an expansion of the original data set, thereby increasing the training data. At the same time, the robustness of the network to specific transformations can be increased [26]. A similar operation can also be used on the label. For example, DisturbLabel deletes some labels randomly during the training process.

3.2. Improved Convolutional Neural Network for Image Classification Based on Multilabel Learning
3.2.1. Model Compression Technology

SqueezeNet is a classic lightweight convolutional neural network model. The design and application of the Fire module is the most important factor to significantly reduce the parameters of the SqueezeNet model. SqueezeNet mainly reduces the number of network parameters through the following three design strategies. (1)Part of the two-dimensional convolution kernel in the network structure is replaced by a convolution kernel. Generally speaking, the size of the convolution kernel set in the convolutional neural network model is mostly . Related experiments demonstrate the effectiveness of this alternative method. The goal of SqueezeNet is to reduce the number of network parameters. Too many convolution kernels will affect the feature extraction ability of the entire model. Therefore, only a part of the convolution kernels are replaced in the SqueezeNet network(2)The number of input channels of the two-dimensional convolution kernel is reduced in the network. The initial layer of the convolutional layer is decomposed into two layers, one is the compression layer (squeeze layer), the other is the expansion layer (expand layer), and the two layers are encapsulated in a Fire module(3)The delayed downsampling operation in the network structure retains more critical features or information for the convolutional layer, which can improve the accuracy of model recognition. The Fire module is the most important part of SqueezeNet. It splits the original convolutional layer into two layers, namely, a compression layer (squeezeayer) and an expansion layer (expand layer), each of which performs ReLU regularization operations

3.2.2. Convolution Kernel Separation Technology

The InceptionV3 network structure based on GoogLeNet has good recognition performance. It mainly introduces the idea of factorization into small meetings, which is to separate a larger two-dimensional convolution kernel into two smaller one-dimensional convolution kernels, such as dividing a two-dimensional convolution kernel into one-dimension convolution kernel and . The advantages of this strategy are the following: in the process of extracting features through convolution operation, it can save a lot of calculation time and save the number of parameters, accelerate the operation speed of the model, effectively alleviate the problems of network overload and fading, and further enhance the nonrelevance of the extended model. The separation of this asymmetric cluster structure is more effective than the symmetric two-dimensional small cluster core. This structure can handle more complex and abstract key features, can output the information capability of multiple targets on multiple scales, and improve the performance of network recognition. The process of decomposing a large convolution operation into a horizontal convolution in series with a vertical convolution is called convolution separable. Convolution separable is a common optimization technique in image processing, and its effect is the same as the original convolution.

3.2.3. Dense Convolutional Neural Network (DenseNet) Technology

The dense network uses dense connection technology to connect any layers to each other, and the input of each layer contains all the previous feature maps. Then output to the subsequent layers of the network, and the deep feature maps are stacked. Dense connection technology can increase the diversity of output and make feature reuse, thereby improving network efficiency. In order to further accelerate the spread of information in the network, dense connection technology uses a different connection method in the past, which directly connects each layer to all subsequent layers of the network structure. The improvement of CNN is shown in Table 2.

4. ICA Based on Improved Convolutional Neural Network Multilabel Learning Based on Big Data

4.1. Algorithm Comparison

(1)Perform experimental comparisons on the ImageNet image database. The ImageNet database has more than 15 million images, with at least 510 images in each category. In this experiment, we select a total of 5000 images from ten categories for verification. Image preprocessing is size. When the number of samples is different, the recognition rate of each algorithm is shown in Table 3 and Figure 3

When NTS is 1500, the number is less than the number of test samples. At this time, the recognition rate of CNN and SAE is extremely low, while the improved CNN algorithm still has a certain degree of recognition rate. It can be seen that no matter what the NTS is, the recognition rate of the improved CNN algorithm is always higher than the other two algorithms. (2)Experimental comparison is carried out on the Oxfords flowers image library. The Oxfords flowers image library has 80 pictures for each flower type. This article selects five types. The recognition rate of each algorithm when the number of samples is different is shown in Table 4 and Figure 4. At this time, the number of iterations is 40, NTS is 500, and the number of test samples is 100~500

It can be seen from the chart that when NTS is 100, the recognition rate of CNN and SAE is extremely low, while the improved CNN algorithm still has a certain degree of recognition rate. It can be seen that when NTS is small, the recognition rate difference between the other two algorithms and the improved CNN algorithm is obvious. As the number of NTS increases, the gap gradually decreases. (3)The experimental data set in this section uses clothing images of ten categories selected from the e-commerce platform. These image collections come from Jingdong, Taobao, etc. 110 images of each category are basically images with commodities as the main body. The selected images do not have problems such as different viewing angles, insufficient light, blur, and occlusion. The total number of pictures is 1100. The average size of each picture is , and the format is JPG. For each type of sample, try to select images with large differences between the types. The recognition rate of each algorithm when the number of samples is different is shown in Table 5 and Figure 5. At this time, the number of iterations is 50, NTS is 1100, and the number of test samples is 110~1100

It can be seen from the graph that the smaller NTS, the better the recognition effect of improved CNN than CNN and SAE. When NTS is 110, even if iteration is 50 times, NTS is too small, but the recognition rate of CNN and SAE algorithms is still very low. At this time, the improved CNN algorithm can still have a certain degree of recognition rate.

4.2. Experiment Analysis

(1)In order to verify the effectiveness of the improved convolutional network, an experimental comparison with other classic algorithms is also carried out. The classification results of each network structure on the CIFAR-10 and CIFAR-100 data sets are shown in Table 6 and Figure 6

From the chart, it can be seen that the classification effect of ScatNet on images with complex details such as the CIFAR data set is significantly lower than the hybrid convolutional network structure and other deep convolutional neural network structures. The improved CNN network structure achieved the highest accuracy rate of 91.47% on the CIFAR-10 data set and the highest accuracy rate of 84.26% on the CIFAR-100. On CIFAR-10 and CIFAR-100 data sets with relatively large data samples, the improved dense convolutional network and the other three deep convolutional neural networks DCNN, VGG, and NIN have achieved better classification results and improved dense. The classification effect of the convolutional network is optimal. The performance comparison of multiple algorithms is shown in Table 7 and Figure 7.

The experimental results in the table show that the image classification effect of the multilabel algorithm based on the improved CNN neural network is significantly better than that based on the BP neural network and the RBF neural network. Other algorithms consider the correlation between examples and tags, so the classification effect will be better than BOOST and SVM algorithms. From the overall comparison result, the improved CNN algorithm in this paper has a better classification effect. (3)Record the highest classification accuracy rate of the improved convolutional network during the experimental training process, and draw it into a chart, as shown in Table 8 and Figure 8

The detailed reference table for the type identification of each device is shown in Table 9.

It can be seen from the graph that the correct rate of improved CNN is always higher than that of CNN, and the average highest classification correct rate of improved CNN can be obtained from the calculation of 90.63%. This algorithm has good performance and can perform image classification smoothly. Through the above, it can be concluded that the improved CNN algorithm in this article is much higher than the traditional CNN in terms of accuracy and correctness, which also verifies the feasibility of the experiment in this article.

5. Conclusions

The main task of image classification is to analyze the content of digital images and obtain key information in the images. Many application fields have put forward requirements for image classification capabilities, such as automatic image annotation, content-based image retrieval, video surveillance, medical image processing, and automatic robots. Therefore, image classification technology has always been a research hotspot, although a large number of efficient classification methods have emerged in recent years to continuously improve the accuracy of image classification. It is difficult to achieve efficient and accurate classification of large amounts of data. These problems need to be further studied and resolved. This paper proposes a simulation research on the algorithm of monitoring and evaluating the deformation safety of coastal soft foundation pits based on big data. The dense network uses dense connection technology to connect any layers with each other. The input of each layer contains all the previous feature maps, and then the output is output to the subsequent layers of the network, deep-level feature mapping gets stacked. Dense connection technology can increase the diversity of the output and make the feature reuse, thereby improving the recognition efficiency. The training data in this experiment is still too small, resulting in the model’s effect on the training sample being good, but the generalization effect on the test data set is not very good.

Data Availability

No data were used to support this study.

Conflicts of Interest

The authors declare that there is no conflict of interest with any financial organizations regarding the material reported in this manuscript.