Abstract

There exist large numbers of methods/algorithms which can be used for the classification of aerobic images. While the current method is used to classify the aerobics image, it cannot effectively remove the noise in the aerobics image. The classification time is long, and there are problems of poor denoising effect and low classification efficiency. Therefore, the aerobics image classification algorithm based on the modal symmetry algorithm is proposed. The method of nonlocal mean filtering based on structural features is used to denoise the aerobics image, and the pyramid structure of the image is introduced to decompose the aerobics image. According to the denoising and decomposition results, the enhancement of aerobics image is realized by the logarithmic image processing (LIP) model and gradient sharpening method. Finally, the aerobics image after the enhancement is classified by a modal symmetry algorithm. Experimental results show that the proposed method has a good denoising effect and high classification efficiency, which shows that the algorithm has significant effectiveness and high application performance.

1. Introduction

Classification of the image is one of the main aspects of digital image processing. It starts in the late 1950s and has been extensively used in various fields like human-car tracking, geology, climate detection, monitoring of natural calamities, medical, communications, military, and other fields of engineering [14]. Digital image processing is an important branch of information technology, including image denoising, enhancement, restoration, segmentation, and feature extraction [5, 6]. With the development and popularization of computer technology, the demand for digital images in agriculture, animal husbandry, forestry, environment, military, industry, and medicine is also growing. It can be said that image processing technology has penetrated into all aspects of our lives [79]. At present, some people are in subhealth state, and more and more people pay attention to physical exercise. Aerobics is a relatively simple and popular sport, which is widely concerned. In the process of aerobics teaching, decomposition teaching is essential, and the performance of decomposition action image recognition has a direct impact on the effect of aerobics teaching. Therefore, the aerobics image classification method has become the focus of current research. At present, there are some problems in aerobics image classification, such as poor denoising effect and low classification efficiency, so it is necessary to study the aerobics image classification method [10, 11].

Zhao and Feng put forward the aerobics image classification algorithm based on sparse coding and spatial pyramid matching. This algorithm sparse encodes the SIFT features of aerobics image, instead of the traditional vector quantization method, which can effectively reduce the quantization error and build a more accurate image representation. Then, combined with the spatial pyramid matching algorithm, the linear classifier is used for aerobics image classification and recognition. The algorithm takes a long time to sparse code SIFT features, resulting in a long time to classify, so there is a problem of low classification efficiency [12]. Shen et al. [13] put forward the aerobics image classification algorithm based on edge detection. This algorithm preprocesses the aerobics image, including image denoising and image enhancement. The region of interest in the aerobics image is obtained by the edge detection method, and the features of the detected region of interest are extracted by a gray level cooccurrence matrix. For the extracted features, a support vector machine is used for classification. After denoising, the image signal frequency fluctuates greatly, which indicates that the denoising effect of the algorithm is poor. Lu et al. [14] have proposed the aerobics image classification algorithm based on feature optimization. This algorithm carries on the threshold segmentation, the morphological filtering, and the edge tracking to the collected aerobics image to obtain the complete color back image. At the same time, based on the image set to extract the texture features, combined with the F-score feature evaluation method, the feature parameters are selected as the optimal feature subset, which is used as the input feature value of the BP neural network to realize the classification of aerobics image. This algorithm takes a long time to screen the feature parameters and has the problem of low classification efficiency. Zhao et al. [15] put forward the aerobics image classification algorithm based on the adaptive soft distribution of synonyms and chi-square model. The algorithm uses the probability latent semantic analysis model to analyze the semantic symbiosis probability of visual words in the image, excavates the hidden semantic subject of the image, and then obtains the probability distribution of the semantic subject on a visual word. The K-L divergence is introduced to measure the semantic correlation between visual words and acquire semantic related synonyms. The soft mapping between SIFT feature points and some semantic related synonyms is realized by combining the adaptive soft allocation strategy. The chi-square model is used to filter out “visual stop words” and reconstruct the histogram of visual word distribution and an SVM classifier is used to complete the target classification. This algorithm cannot effectively remove the noise in the aerobics image, and there is a problem of poor denoising effect. So based on the literature, it is found that most of the available methods cannot effectively remove the noise in the aerobics image. The classification time is long, and there are problems of poor denoising effect and low classification efficiency. Therefore, the aerobics image classification algorithm based on the modal symmetry algorithm is proposed. The results of the proposed model are better than the other currently available methods.

To sum up, this paper puts forward the aerobics image classification algorithm based on the modal symmetry algorithm. The specific process is as follows:(1)The aerobics image is denoised(2)The aerobics image then undergoes enhancement processing(3)The classification of aerobics sports image is realized by modal symmetry algorithm(4)Experiments and discussion: the overall effectiveness of the aerobics image classification algorithm based on modal symmetry algorithm is verified in two aspects of denoising effect and classification efficiency(5)Conclusions

2. Preprocessing of Aerobics Images

2.1. Nonlocal Mean Filtering Based on Structural Features

The basic features of gray image mainly include gray value, edge, texture, geometry, and spatial relationship of image. The target features extracted by the aerobics image classification algorithm based on the modal symmetry algorithm include the local edge, shape, special point, and special line of the pixel which represent the local structure of the image. To some extent, these local features can reflect the most important and essential feature information in the image [16, 17].

The denoising process of aerobics image classification algorithm based on modal symmetry algorithm is as follows:(1)Input noise image and reduce its sensitivity to noise with Gaussian filter.(2)Each pixel block is represented by a binary descriptor.In order to facilitate the calculation, a set of specific neighborhood pixels is selected to represent the feature information of the central block, measure the similarity, and screen the feature similar blocks. Local block matching can also be measured by the distance between the central block and the selected reference block in its neighborhood by using the following equation:Before the start, the image is smoothed by the Gaussian filter to reduce its sensitivity to noise. and represent the pixel values corresponding to the center block and the neighborhood block in the smoothed image. The neighborhood block used to represent the feature information is preselected according to the experimental data, and is the -the preselected neighborhood block. To facilitate calculation, if the local distance is less than a certain threshold , it means that the central pixel block is relatively flat and has no obvious structural features, and then, its binary string is directly set to 0 as given inwhere represents the maximum value in . If is greater than , it means that the central pixel block contains local image feature information; then its binary string is defined as given inwhere represents the -th bit in the string. The local distance is less than the local distance of the adjacent two pixel blocks; that is, the center point of the block that is more like the center block than the adjacent two pixel blocks is recorded as 1; the rest is 0, where to are considered as an annular. From the above definition, it can be concluded that the local binary descriptor is a 16-bit binary code and can better describe the local characteristics of pixels [18]. The binary descriptors of local similar blocks can be distinguished from other nonsimilar blocks quickly.(3)Generate corresponding binary descriptor template for each binary descriptor to match. Binary descriptors can effectively improve the matching efficiency of similar blocks by comparing the binary codes ( and ) of central pixel block and neighboring pixel block. However, in order to improve the denoising effect, more similar blocks should be selected to avoid information loss. The aerobics image classification algorithm based on the modal symmetry algorithm adopts a new local binary feature descriptor template to select more similar blocks through logical operation [19]. Let the coding of the central pixel block be , and let the coding of the two preselected similar blocks located in and be and , respectively. In order to retain more feature information, the coding of the original central pixel block is extended so that the left and right binary numbers with the coding median value of 1 are also 1, and the descriptor is obtained by using  After that, the descriptor template and its coding of neighborhood pixel block are made bit operation. If the result is equal to the neighborhood coding, it is regarded as similar to the central pixel block; otherwise, it is not similar and can be represented as follows [20]:(4)For a pixel block whose binary descriptor is greater than 0, it means that the region contains characteristic information. Then,(1)Features and gray similar blocks are selected by the following prescreening equation:(2)The corresponding weights of similar blocks can be calculated by the following equation:where is the Euclidean distance between the center pixel block and the preselected similar block . It can be calculated usingwhere represents the size of the pixel block, and the thresholds and are defined by the following equation:

2.2. Image Enhancement
2.2.1. Image Pyramid

For an image with , 1/2 subsampling in two directions can obtain a 1/4 thumbnail of the original image. Through continuous subsampling, we can get a series of pyramid structures composed of subimages of different sizes, and each layer of which has different sizes and resolutions. The size of layer 0 is the largest and the resolution is the highest. With the movement to the upper layer of the pyramid, the size and resolution of the image decrease until layer [21].

In the process of building the pyramid, from the later level to the former level, the resolution of the row and column is reduced by factor 2, and the size of the image is reduced accordingly. For a complete two-dimensional image pyramid with layers in total, the total number of elements is given byThe low-scale level mainly shows the contour information of the image, while the high-scale level reflects the detail characteristics of the image [22].

Only depending on subsampling to reduce the size of the image, we often lose a lot of information. According to the sampling theorem, it is necessary to make all the images obtained by sampling at the shortest wavelength less than 1/4 pass the smoothing filter to eliminate the distortion. From the perspective of scale space, reducing image size needs to be synchronized with appropriate image smoothing filtering. If smoothing and subsampling are repeated, subimages with different resolutions can be obtained [23, 24]. The image pyramid can be formed by sorting the obtained subimages.

Taking the Gaussian pyramid as an example, the Gaussian low-pass smoothing filtering process can be expressed as and can be found by

That is to say, the two-dimensional image is convoluted with the Gaussian filter to obtain the smoothed image , where the expression of the Gaussian filter is as follows:

The scale parameter is the bandwidth of the Gaussian filter. Using Gaussian smoothing filter, the single operation of generating Gaussian pyramid subimage can be expressed as follows:

That is to say, the -th layer Gaussian image is used to calculate the -th layer Gaussian image , where represents the Gaussian smoothing operator and its subscript represents subsampling with a sampling rate of 2. The bottom layer of the pyramid is the original image, and the highest layer corresponds to the thickest size.

2.2.2. Enhancement Algorithm of the LIP Model

The enhancement algorithm based on the LIP model simplifies the gray-scale image by using the unified complement transformation. The simplified processing algorithm of gray-scale function is expressed in the form ofwhere represents the output image after the normalization and complement transformation, and the value range of the gray function is defined in interval for nonlinear transformation as expressed inwhere and are input and output pixel gray values, and are real numbers respectively, and the expression of is as follows:

When , the nonlinear transformation process can expand the dynamic range of the dark area of the image. When , the process can expand the dynamic range of the bright area of the image. When , the difference between the mean value of the central pixel and the surrounding pixel is magnified nonlinearly. The larger the value is, the greater the edge enhancement of the image is.

2.2.3. Gradient Sharpening

Gradient algorithm is used to sharpen image edge information. The gradient of the function at a point is defined as a vector as

That is, the direction of the gradient is in the direction of the maximum change rate of function , and the magnitude of the gradient is defined as given by

A threshold is introduced to determine whether to process the gray value of a pixel, which is given by

The gradient value between the background and the object does not change much. The obvious change of gray level is mainly reflected in the junction of the object and the background, that is, the edge of the image. By setting the threshold value, of the image edge’s gray value is greater than the threshold value, which makes the pixel point bright and highlights the edge; for the same pixel gray value is not greater than the threshold value, it is smoothed [25, 26].

3. Aerobics Image Classification Algorithm Based on Modal Symmetry Algorithm

The aerobics image classification algorithm based on the modal symmetry algorithm realizes the aerobics image classification through the modal symmetry algorithm. The specific steps are as follows.

3.1. Establishing “Visual Vocabulary”

Supposing that we train a MIL classifier for a scene , let denote the labeled image set, where denotes the image belonging to scene , −1 denotes the image not belonging to scene , and denotes the unlabeled image set. If any image is divided into regions, the corresponding visual feature vector of each region is recorded as , and represents the dimension of the visual feature vector, then is the MIL training package, and is the example in the package. All the examples of all the images in are put together, called the example set, and record it as and given bywhere is the total number of examples, and its calculation formula is as follows:

The aerobics image classification algorithm based on modal symmetry algorithm uses -Means method to gather the elements in into class. Because each clustering center usually represents a group of image areas with the same visual characteristics, it is called a “visual word” [27], and it is recorded as , and these “visual words” are called “visual vocabulary” and recorded as .

3.2. Constructing Fuzzy “Word-Document” Matrix

In order to obtain the latent semantic model of the image by LSA method, according to the principle of minimum Euclidean distance, the number of different “visual words” appearing in the multiple sample packets is counted; that is, the multiple sample packets are represented by word frequency vector [28]. Let the word frequency vector of the multiple example package be as given in where represents the number of occurrences of the -th “visual word” in . In terms of word frequency statistics, the traditional method is as follows: if the example is closest to the Euclidean distance of “visual word” , then add 1 to the -th component value of word frequency vector, i.e., , and there is irrationality in the way of word frequency statistics. Let and represent two different “visual words” and , , , and represent four different examples. From Figure 1, it can be seen intuitively that the confidence of is higher than that of belonging to , while the distance between and and is the same, and there is ambiguity in whether belongs to or . Traditional word frequency statistical methods do not consider these differences and ambiguity [29].

To solve this problem, the fuzzy membership function is defined according to the Euclidean distance between example X and “visual word” . It can be said that each example in the multiexample package belongs to all the “visual words” at the same time; only according to the distance, the degree of membership is different. Through the above analysis, the fuzzy word frequency vector defined by the aerobics image classification algorithm based on the modal symmetry algorithm can be represented bywhere represents the -th example in the multisample package and represents the number of examples in . It can be seen from the above formula that the value of the -th component of the fuzzy word frequency vector is determined by the sum of the fuzzy membership degrees between all the examples in and [30].

In order to highlight the importance of different “visual words” in image classification, the fuzzy word frequency vector is weighted by word frequency-anti-document frequency, namely, where represents the fuzzy frequency of containing “visual word” ; represents the number of multiple sample packets containing “visual word” in the training set; represents the total number of all multiple sample packets in the training set.

In order to control the change range of within the same range, normalization is carried out, as given by

The weighted and normalized word frequency vector is recorded as . By arranging the fuzzy word frequency vectors corresponding to all multiple example packages in the training set, we can get the fuzzy “word-file” knowledge matrix, which is recorded as given bywhere each row of corresponds to a “visual word” and each column corresponds to a multiple sample package.

3.3. Fuzzy Latent Semantic Features

As a natural language processing method, LSA’s core idea is to build a potential semantic space through truncated singular value decomposition and project words and documents to various dimensions representing potential semantics, so that the potential semantic relationship between words can be obtained, and the related documents can obtain the same vector representation even if they do not use the same words [31, 32]. According to the singular value decomposition theorem, the matrix of “word-file” can be decomposed into three matrix products, as given bywhere is the dimension of the original feature space, , and are left and right singular vector matrices corresponding to the singular value of matrix , respectively, and and are diagonal matrices which arrange the singular value of matrix in descending order. If only the first singular values in and the first columns of and are taken, that is, , , and , then the best approximation of matrix in the sense of -order least squares can be obtained, as given by

In general, the above formula is called truncated singular value decomposition. In this way, the matrix after dimension reduction of can be obtained; that is, . Here, each column in is the fuzzy latent semantic feature of the corresponding package in the training set, which is reduced from the original dimension to the dimension. Let be the normalized fuzzy word frequency vector of any new multiexample package , and its fuzzy latent semantic features are as given by

This is derived from , where the space formed by column vectors in is called fuzzy latent semantic space, which can be regarded as the compression of the original vector space. column vectors in are the basis of fuzzy latent semantic space.

3.4. Training TSVM Classifier

is initialized, is as labeled package, and its fuzzy latent semantic feature is calculated; then is added to TS, where is the label of package . If is an unmarked package, its fuzzy latent semantic feature is calculated. is added to TS, where the label of the unmarked package is 0.

The TS is used to solve the following optimization problem, and the class label and the TSVM classifier of the unlabeled image set are obtained as given bywhere is the total number of unlabeled samples; is the loss function, usually ; ; is the label assigned to unlabeled samples in the optimization process; is the proportion of the number of samples to be marked as positive in the total number of unlabeled samples; and the control parameter is used to adjust the balance between algorithm complexity and loss function.

Through the above process, the classification of aerobics sports image is achieved.

4. Experiment and Discussion

In order to verify the overall effectiveness of the aerobics image classification algorithm based on the modal symmetry algorithm, it is necessary to test the aerobics image classification algorithm based on the modal symmetry algorithm. This test is completed in the MATLAB platform. The aerobics image classification algorithm based on the modal symmetry algorithm (algorithm 1), the aerobics image classification algorithm based on the sparse coding space pyramid matching (algorithm 2), the aerobics image classification algorithm based on edge detection (algorithm 3), and the aerobics image classification algorithm based on feature optimization (algorithm 4) are used to test. The above algorithms are used to denoise the aerobics image, and the signal frequency of the image before and after denoising is compared with four different methods. The test results are as represented in Figure 2.

Figure 2 represents the comparison of the image signal frequency before and after denoising by four different algorithms. It can be seen that the signal frequency after denoising of aerobics image classification algorithm based on modal symmetry algorithm is smaller than that before denoising for all the iterations. The more the denoising effect is, the more suitable the algorithm is to achieve a stable frequency. The signal frequency tends to be stable, indicating that the aerobics image classification algorithm based on modal symmetry algorithm has a good denoising effect because the algorithm uses a new local binary feature descriptor template to filter out more similar blocks through logical operation to avoid information loss and improve the denoising effect.

The aerobics image classification algorithm based on modal symmetry algorithm (algorithm 1), the aerobics image classification algorithm based on sparse coding space pyramid matching (algorithm 2), the aerobics image classification algorithm based on edge detection (algorithm 3), and the aerobics image classification algorithm based on feature optimization (algorithm 4) are tested, respectively, the classification times of four different algorithms are compared, and the test results are as follows.

The analysis of Figure 3 shows that the classification time of aerobics image classification algorithm based on modal symmetry algorithm in multiple iterations is lower than that of aerobics image classification algorithm based on sparse coding space pyramid matching, the aerobics image classification algorithm based on edge detection, and the aerobics image classification algorithm based on feature optimization because the aerobics image classification algorithm based on the modal symmetry algorithm enhances the aerobics image between the classification, improves the significance of the image characteristics, shortens the time of classifying the aerobics image, and improves the classification efficiency of the aerobics image classification algorithm based on the modal symmetry algorithm.

5. Analysis

There are a number of methods and algorithms to classify the aerobic images. Most of them are slow for detection. So, it is necessary to test the aerobics image classification algorithm based on the modal symmetry algorithm. The aerobics image classification algorithm based on the modal symmetry algorithm (algorithm 1), the aerobics image classification algorithm based on the sparse coding space pyramid matching (algorithm 2), the aerobics image classification algorithm based on edge detection (algorithm 3), and the aerobics image classification algorithm based on feature optimization (algorithm 4) are used to test. The test results represented in Figure 2 indicate that the proposed method has the best denoising effect as compared to the other three methods. Also, Figure 3 shows that the proposed method has the least processing time among the four methods to denoise the aerobic images.

6. Conclusions

In order to classify and recognize the aerobics image, it is necessary to study the aerobics image classification algorithm. At present, the aerobics image classification algorithm has the problems of poor denoising effect and low classification efficiency. The aerobics image classification algorithm based on the modal symmetry algorithm is proposed, which can effectively remove the noise in the aerobics image in a short time, complete the classification of the aerobics image accurately, solve the problems in the current method, and provide the relevant information for the aerobics image recognition. The results presented in the results and discussion section clearly indicated that the proposed algorithm has a more denoising effect, as compared to the other methods. Also, for the proposed method, the time taken for processing data is the minimum. Hence, it is concluded that the proposed method is better than the other three methods.

Data Availability

Data will be made available on request to the corresponding author.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.