Abstract

The traditional teaching model cannot adapt to the teaching needs of the era of smart teaching. Based on this, this paper combines data mining technology to carry out teaching reforms, constructs a computer-aided system based on data mining, and constructs teaching system functions based on actual conditions. The constructed system can carry out multisubject teaching. Moreover, this paper uses a data mining system to mine teaching resources and uses spectral clustering methods to integrate multiple teaching resources to improve the practicability of data mining algorithms. In addition, this paper combines digital technology to deal with teaching resources. Finally, after building the system, this paper designs experiments to verify the performance of the system. From the research results, it can be seen that the system constructed in this paper has certain teaching and practical effects, and it can be applied to a larger teaching scope in subsequent research.

1. Introduction

With the continuous development and popularization of information technology, comprehensive informatization has become the inevitable development direction of this era. This is especially true in the field of education, and digital education makes teaching content more reliable and has a uniform standard [1]. From the recording and broadcasting online school in the late 1990s to the higher education resource sharing platform, “Love Course,” which provides complete video resources, to the “China University MOOC” online open course class that integrates teaching, Q&A, testing, and homework teaching, to the “NetEase Cloud Classroom” that can realize real-time interactive online live teaching [2], modern education is closely related to the information age, and building a teaching platform based on information technology is the direction of reform of teaching informatization in major colleges and universities [3].

Teaching decisions are the process of analyzing, judging, exploring, and choosing teaching implementation plans in order to achieve teaching objectives. Teaching is a controlled dynamic complex system, to achieve effective control of system motion, requires timely confirmation of various elements and its interaction relationships in the system, and makes decisions according to the corresponding teaching principles. Data is an important basis for teaching decisions. The smart teaching platform provides data on the status of the learner but is currently only an evaluation method for students. In traditional teaching activities, students are often collected in the forms of students or exams due to the collection and analysis of students’ data, and the teaching behavior of teachers has a great impact. With today’s scientific and technical assistance, the extraction of teaching data is no longer a problem, but the use of teaching data is used in teaching evaluation, teaching management, statistical attendance, etc., and the teaching of teachers is not much. In summary, in the environment of education big data, how to help teachers give full play to the teaching process and how to use the data in the smart teaching platform to optimize the teaching decisions are the core issues of the research in education.

This paper applies data mining technology to computer-assisted teaching to build a data mining-based smart teaching system and improves the traditional teaching mode to improve the teaching effect.

Regarding the characteristics and value of smart classrooms, literature [4] believed that mobile terminal-based interaction can help improve children’s developmental learning and social skills. Literature [5] believed that with the help of wireless smart devices, learners’ participation in classroom learning can be improved. Literature [6] believed that the teaching terminal based on the smart classroom can clarify the geographical location and learning progress of the students, determine the current teaching activities of students, recommend learning resources according to the needs of students, and support effective real-time collaboration and resource sharing between teachers and students, as well as students and students. Literature [7] believed that a smart classroom is a new type of classroom, and the fundamental goal is to cultivate students’ wisdom. The smart classroom focuses not only on the knowledge level of the learner and the scores obtained in the exam but also on stimulating the potential of students and focusing on cultivating students’ wisdom. Literature [8] believed that the cultivation of wisdom should exist throughout the entire classroom teaching process and use experience and accumulated thinking experience to enhance wisdom so as to achieve the ultimate goal of using wisdom to solve problems. Literature [9] reshaped and upgraded the flipped classroom and proposed a breakthrough from a flipped classroom to a smart classroom in terms of resource quality improvement and teaching method optimization.

As a probe into the teaching mode of the theoretical level in a wisdom classroom, Literature [10] believes that learners can personalize and autonomously learn from their own rhythm in wisdom class. Document [11] is a research on the autonomous constructive processing and treatment of learners in the smart classroom. Literature [12] believed that wisdom learning is in a contextual environment, providing students with a wide range of learning resources for students, and promotes new learning paradigms of education development. In order to achieve smart learning, literature [13] designed a smart learning system model including cloud computing, learning analysis technology, and mixed reality, reflecting the three major characteristics in the process of education, interactive, personalized custom, and independent control. Document [14] shows that on the basis of studying the intelligence learning connotation, the conceptual framework of wisdom learning is built, and four wisdom learning models are designed. Literature [15] believed that the technical characteristics of the wisdom class have designed a learning model based on the wisdom class, and the application research of the learning model is designed. Literature [16] gave a “three-section ten-step” structural model of the wisdom class through the comparison with traditional classroom teaching processes. As a research on wisdom classroom teaching practice, Literature [17] reshaped and upgraded the wisdom classroom learning environment from hardware and software, thereby solving the failure of teachers and students in LMS (Learning Management Services). Document [18] shows that based on the ITLA (Integrated Teaching and Learning Assistance) system, it studied important factors that determine the effective development of the wisdom class. Literature [19] is supported by the HiTeach Interactive System, compared to traditional classrooms and wisdom classroom teaching, exploring the positive significance of teaching in teaching. About wisdom classroom teaching evaluation, Literature [20] shows the response to the teaching strategy of the wisdom classroom teaching under the network learning space while presenting the teaching strategy of the teacher’s teaching behavior and the student’s learning behavior for teaching evaluation.

3. Spectral Clustering-Aided Teaching Algorithm

Spectral clustering is a very popular research field in cluster analysis. Its main idea is to obtain a graph cut through the feature decomposition of the graph (Laplacian matrix). It is a clustering method based on a graph cut. This research mainly introduces the spectral graph theory, the graph partition method, the spectral clustering algorithm of spectral clustering, and the problems existing in spectral clustering at present.

We first give a set , , which contains data points. We assume that each data sample is regarded as a vertex in the graph, and the edge between the vertices is assigned a weight value according to the similarity between the samples so that an undirected weighted graph based on the sample similarity is obtained. It can be defined with the following formula [21]:

Among them, is the similarity value between points and , and is the Euclidean distance between points and , and is the scale parameter, which controls the speed at which the similarity value decays with the Euclidean distance .

The clustering problem can be expressed as a cut problem on the graph. The result of the cut is as far as possible to minimize the similarity between the two subgraphs and maximize the internal similarity. The quality of the clustering results is directly related to the quality of the cut criteria. As shown in Figure 1, through a cut strategy, H can be divided into one category, and all other points can be divided into another category. Alternatively, another classification strategy can be used to classify H, A, B, and C into one category and D, E, G, and F into another category. Obviously, the second cut strategy can divide undirected graphs into two more balanced categories. Common cut criteria include the minimum cut criterion, the normalized cut criterion, the ratio cut criterion, and the average cut criterion. (1)Minimum cut set criterion

Now, we first consider the simplest case: two-way cut. The undirected weighted graph is divided into two subsets and that do not want to intersect, where . The easiest way is to take the method of minimizing the cut to produce two disjoint subsets. Cutting is defined as [22]

is the similarity between point and point . When the data point is in Euclidean space , a reasonable default candidate similarity function is the Gaussian similarity function, which is defined as follows:

We used a clustering algorithm based on the graph theory to solve the segmentation problem, and we obtained very good segmentation results in the experiment. But at the same time, the minimum cut criterion is easy to produce unbalanced results: one type contains most data points, while the other type contains only a few data points or even only one data point. This is because the minimum cut does not consider the size of the cluster. In order to solve this problem, we proposed the normative cut set criterion. The experimental results show that this cutting method can obtain relatively balanced clustering results. (2)Normative cut set criterion (Ncut)

The normative cut set criterion focuses on the data in the global scope, not only the local solution of the dataset. The Ncut algorithm has two-way partition and multiway partition. The two-way cut only uses the eigenvector corresponding to the second smallest eigenvalue of the Laplacian matrix for segmentation. However, the multipath cut uses eigenvectors starting from the eigenvector corresponding to the second smallest eigenvalue to cluster together, where is a predetermined constant. For the two-partition problem of cutting into two regions, and , the objective function of the Ncut’s optimized graph partition is shown in the following formula:

Among them, is the minimum cut set shown above, and is the total number of connections from all data points in to all data points in graph . From the criterion function, it can be seen that the normalized cut set criterion considers the separation state of the two subsets, so it can avoid unbalanced clustering results.

At the same time, Shi and Malik proposed another standard to measure the total normalized association criterion for a given divided area. Its formula is as follows:

Among them, and are the total weights of the edges connecting the nodes in and , respectively. By inference, the relationship between these two division standards is . Obviously, in order to achieve a better clustering effect, it is necessary to minimize the noncorrelated function and maximize the correlation function at the same time.

The above criteria are all two-way division methods for undirected graphs. We proposed a method to divide the graph into subgraphs. The objective function of the -dimensional normalized cut set criterion can be written as follows:

Among them, is the number of data in dataset . is the complement of dataset . When , multichannel division is equal to two-channel division. (3)Average cut set criterion (Avcut)

The objective function of the average cut set criterion is as follows:

It can be seen from the formulas of the normative cut set criterion and the average cut set criterion that these two formulas both express the relationship between the boundary loss and the correlation of the divided regions in the undirected weighted graph in the form of the sum of ratios. This shows that the objective function of the minimum scale cut set criterion and the normalized cut set criterion can be cut more accurately. Their common disadvantage is that it is easy to segment very small subgraphs containing only a few points, and they are undersegmented. It can be seen from the experimental results in the literature that the cut result of the normative cut set criterion is better than that of the average cut set criterion. (4)Ratio cut set criterion (Rcut)

The formula of the ratio cut set criterion function is as follows:

Among them, and represent the number of vertices of subgraphs and , respectively.

The advantage of this criterion is that when minimizing the criterion function, only the minimum similarity between classes needs to be considered, which reduces the possibility of oversegmentation, but the disadvantage is that the operating efficiency is too low. (5)Minimum-maximum cut set criterion (Mcut)

The min-max cut criterion maximizes and while minimizing . The objective function of this criterion is as follows:

Among them, and are the total weights of the edges connecting the nodes in and , respectively. By minimizing the criterion function, undersegmentation or only segmentation of smaller subgraphs containing a few vertices can be avoided. Therefore, by minimizing the minimum-maximum cut set criterion function, a more balanced cut set can be obtained, but the realization speed is relatively slow. Both the minimum-maximum cut set criterion and the normative cut set criterion can satisfy the clustering principle that the similarity within a class is small, but the similarity between classes is large. The difference is that when the overlap between classes is too large, the cut effect of the normative cut set criterion is not as good as the minimum-maximum cut set criterion.

The adjacency matrix (denoted as or ) is also called the similarity matrix, and the Laplacian matrix (denoted as ) is a common representation of graphs. The similarity matrix of the weighted graph uses real numbers to reflect the different relationships between the vertices. The elements in this matrix can be expressed by the following formula:

Among them, is the -th sample point in the dataset, and is the distance between the sample point and the sample point . The distance can use any form of distance function, and the more commonly used is the Euclidean distance . In the formula, is the nuclear radius, which is a parameter that needs to be given in advance. In the clustering algorithm, the attenuation rate is constrained by the parameter , so a proper value of must be given to improve the clustering accuracy of the algorithm. The row vectors in represent the distribution of the dataset, and they are usually distributed on the hypersphere of the -dimensional space. Researchers usually use degrees to represent the distribution of the dataset around the point, and the diagonal matrix composed of all the degree values as diagonal elements is the degree matrix, which is usually represented by :

Among them, is the number of sample points.

The Laplacian matrix is , where is the degree matrix and is the similarity matrix. Most spectral clustering algorithms cut graphs based on the spectrum of the Laplacian matrix. Laplacian matrices can be divided into two types: nonnormalized Laplacian matrix and normalized Laplacian matrix. The normalized Laplacian matrix includes a symmetric form (denoted as ) and a random walk form (denoted as ).

There are many spectral clustering algorithms. The difference lies in how to choose the object function and how to construct the affinity matrix of the graph, but the basic framework is the same.

Step 1. According to the given dataset, the algorithm constructs a graph matrix, and there are different methods for different situations.

Step 2. The algorithm solves the first eigenvectors of the matrix and constructs the eigenspace .

Typical spectral clustering algorithms such as the Shi and Marik algorithm; Kan R, Vempala S, and Vetta A algorithm; Ng, Jordan, and Weiss algorithm; link algorithm; and Markov random walk algorithm have achieved the expected application effect. (1)SM algorithm

The normative cut set criterion is a very popular technique in spectral clustering, and it has achieved good results in the application of image segmentation. The normative cut set criterion can be described as follows:

This is an NP-hard problem. Fortunately, the problem can be solved by relaxing the discrete constraint of . We assume that is a real value, the objective function can be solved by the Rayleigh quotient, and this problem can be transformed into the second smallest eigenvalue of the solving formula . The SM algorithm can be described as follows.

Step 1. The algorithm first constructs the similarity matrix and calculates the Laplacian matrix according to the formula .

Step 2. The algorithm calculates the first -generated eigenvalues and obtains the corresponding eigenvector according to the formula .

Step 3. The algorithm uses the eigenvectors calculated in the second step to construct a matrix .

Step 4. The algorithm uses the second smallest eigenvalue and the corresponding Fiedler vector to find the cutting point to cut the graph through the Fiedler vector so that Ncut is the smallest. In the Fiedler vector, the larger than this point is divided into one category, and the smaller than this point is divided into another category. The normalized adjacency matrix defined by the algorithm is as follows:

Step 5. The algorithm uses the -means algorithm to cluster the matrix into categories .

The computational complexity of solving the eigenvalue problem of all eigenvectors is , where is the number of input sample sets. (2)NJW algorithm

The above SM algorithm only uses Fiedler vectors, but the NJW algorithm uses k feature vectors at the same time. Because when calculating the k-way partition, using more feature vectors will achieve better results. The NJW algorithm can be described as follows:

Input: data points

Output: cluster , where (3)Markov random walk algorithm

Step 1. The algorithm constructs the similarity matrix , the diagonal matrix , and the Laplacian matrix through the formula .

Step 2. The algorithm calculates the first eigenvectors of the Laplacian matrix .

Step 3. The algorithm uses the eigenvector calculated in the second step to construct a matrix .

Step 4. The algorithm forms a normalized matrix by normalizing the rows to norm 1, where .

Step 5. The algorithm makes a vector corresponding to the -th row of the normalized matrix .

Step 6. The algorithm uses typical clustering algorithms such as the -means algorithm to cluster the abovementioned matrix into classes. By repeatedly executing the NJW algorithm, the scale parameter , which measures the similarity between sample points, can be obtained, but this increases the time used by the algorithm.

Another point of view in spectral clustering is the Markov random walk algorithm. It uses a probability model to obtain the spectrum method, and the random walk on the spectrum is considered to be a random jump from one node to another node. Spectral clustering can be understood as looking for such a graph partitioning; that is, the random walk stays in the same class for a long time and rarely stays in another class. From this point of view, random walk and graph partitioning have the same idea.

The random transition matrix can be obtained from the normalized similarity matrix , so the sum of each row of the matrix is 1. Among them, is the probability from point to point . The Markov random walk algorithm is basically the same as the NJW algorithm. The Markov random walk algorithm has achieved good results and can automatically determine the clustering value.

In addition to the above SC algorithm, there are many other SC algorithms. Perona and Freeman proposed the PF algorithm. Scott et al. proposed the SLH algorithm, and WISS combined the SLH algorithm and the SM algorithm to propose a new algorithm. Ding et al. proposed a new division criterion Mcut. In addition, Meila et al. proposed a new spectral clustering algorithm called the MS algorithm under the framework of the Markov random walk.

(1) The Choice of the Laplacian Matrix. In the spectral clustering algorithm, it is very important to construct the similarity graph matrix. After the number of data samples is given, the most commonly used formula for constructing similarity graphs and selecting the Laplacian matrix is as follows:

In this paper, some commonly used Laplacian construct methods have been collected, but how to select the Laplacian matrix is uncertain.

(2) Selection of Parameters. The spectrum clustering is uncertain when constructing similar matrices, and the determination of 8 is often necessary to obtain the uncertainty of clustering results according to the experience of the researcher and multiple attempts. So the choice of parameters is an important research direction.

(3) Determination of the Number of Clusters. The choice of clustering directly affects the cluster results. Now, the current spectrum clustering research does not give a strategy for the number of choices of clusters, which is also a more important research direction in cluster research.

(4) Differential Problems with Uneven Distribution of Density Distribution. The existing spectrum clustering method is still unable to obtain a good clustering effect for the density distribution.

4. Computer-Aided Teaching System Based on Data Mining

After the SMART conceptual model was put forward, this paper designs the “I-SMART” model from the aspect of hardware and software configuration, which provides framework support for the construction of smart classrooms (Figure 2).

Under the guidance of the activity theory, based on the above analysis of teaching elements and types of teaching interaction, an interactive teaching model in a smart classroom environment is proposed (Figure 3).

There are nine modules in the teaching platform: homepage, statistics, classroom activities, homework, examination, discussion, information, notification, and management. Users can directly access teaching data. The various types of data that can be read by the teacher after being automatically collected by the platform and analyzed by the model are summarized in Figure 4.

Through the correlation analysis of multiple factors in the S-P table, it can provide a clear direction for teaching decision-making. S-line tomography is used to analyze the types of students. Using the Yio-CSi (Student Positive Answer Rate-Student Attention Coefficient) model (Figure 5), we can find the students with the best and stable grades, the students with the worst and stable grades, and the students with unstable grades that cannot be found in the general statistical description. Using the Yjo-CPj (Question Positive Answer Rate-Question Attention Coefficient) model (Figure 6), it is possible to find knowledge points that students understand and are reliable, knowledge points that students seem to understand but are unreliable, knowledge points that are not understood by students, and knowledge points that are not understood but have accidental factors. Based on this, we make scientific decisions on the selection and application of teaching media.

This research is based on the smart teaching platform environment and attaches importance to the value of the data captured by the platform in the process of monitoring student behavior and status. Therefore, it is considered to incorporate the data that can be used for decision-making into the decision model to form a teaching decision model based on the data of the smart teaching platform, as shown in Figure 7.

The users of the education platform are not only students but also teachers, so we have to think deeply about teachers’ needs for online education platforms. We can further analyze the teacher’s demand for the ergonomic experimental teaching platform in two aspects. Among them, one is the demand between the platform and the teacher, and the other is the demand between the teacher and the student, as shown in Figure 8.

The strategic layer of the platform is shown in Figure 9. User goals refer to what kind of needs the users hope to meet through the product. The main users of the platform are students. For students, the most important goal of this platform is to help students improve their learning effects. On the one hand, it shows that students can learn and apply knowledge faster through this platform, and on the other hand, it shows that students can quickly learn how to use this platform, that is, the ease of learning of the platform.

5. Performance Test of the Computer-Aided Teaching System Based on Data Mining

The above constructs a computer-aided teaching system based on data mining. After constructing the system, the performance of the system is verified, and the system operation performance and system teaching effect are mainly studied. The research on its operating performance is mainly the effect of teaching resource mining. This paper obtains effective resources from the massive network teaching resources through the simulation system, tests multiple sets of statistical data, and evaluates the collected data through expert evaluation methods. The results are shown in Table 1 and Figure 10.

From the above research, we can see that the system constructed in this paper can effectively tap the required teaching resources. After that, this paper evaluates the teaching effect of this system through a small-scale teaching test, and the statistical results are shown in Table 2 and Figure 11.

It can be seen from the above experiments that the system constructed in this paper has certain teaching and practical effects, so it can expand the scope of teaching in follow-up research and conduct practical research from the perspective of multiple subjects and multiple audiences.

6. Conclusion

The smart education system based on data mining services is a typical current smart education. It realizes the intelligence, visualization, and high efficiency of classroom teaching, and its application conforms to the concept of building a strong education nation and is more in line with the specific requirements of educational modernization. With the continuous progress of cloud computing, artificial intelligence, big data, and other technical means, the service quality of the smart education system has also been upgraded, which provides favorable conditions for the safe and effective operation of the system. This paper combines data mining technology to build a computer-aided teaching system and builds the function of the teaching system based on the actual situation. The constructed system can carry out multisubject teaching. Moreover, this paper uses a data mining system to mine teaching resources and combines digital technology to process teaching resources. After constructing the system, this paper designs experiments to verify the performance of the system. From the research results, it can be seen that the system constructed in this paper has certain teaching and practical effects.

Data Availability

The labeled datasets used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare no competing interests.

Acknowledgments

This study is sponsored by the following: (1) Research and Practice of Mixed Teaching Mode of Criminal Technology Courses Based on OBE Concept—take Speech Recognition and Appraisal Course as an example (No. 5-3 1017501)—and (2) application research of IPv6-based video transmission and control technology in teaching platform—Scientific research project (No. NGII20180124).