#### Abstract

In order to improve the visual display effect of the artistic conception of ancient poems, this paper combines AI technology to construct a multimedia-assisted artistic conception creation system of ancient poetry and combines probabilistic latent semantic analysis to analyze the semantic artistic conception of ancient poetry. Moreover, with the support of multimedia simulation technology, this paper improves the multimedia teaching mode to improve the effect of multimedia-assisted ancient poetry teaching. In addition, this paper constructs a multimedia-assisted artistic conception creation system of ancient poetry based on AI technology and designs experiments to conduct system performance research. The experimental research results show that the multimedia-assisted artistic conception creation system of ancient poetry proposed in this paper has a good teaching effect and has an important role in promoting the study of the artistic conception of ancient poetry.

#### 1. Introduction

It is difficult to understand the author’s thoughts and feelings and experience the beauty of the artistic conception of poetry. When teaching ancient poems to students, the teacher needs to combine the cognitive characteristics of different ages to make students fully understand the writing mentality and historical background of the poet during the creation period, so as to induce students’ interest in learning ancient poems. Therefore, before each study of an ancient poem, we will arrange a certain amount of spare time for students to go online and use network technology to find, view, and collect relevant information to understand the author’s profile, historical background, and corresponding introduction of the poem, so that students have a rough understanding of the poems they have learned and lay a certain teaching foundation.

In the current Chinese teaching in my country, whether it is before or after the release of the new curriculum standard reform, ancient poetry plays a very important role in Chinese teaching at the university level. The reason is that, compared with modern poems and essays, ancient poems have extraordinary significance. Ancient poetry contains a wealth of traditional culture, and through the study of ancient poetry, college students can effectively improve their language expression ability and understand the many situations of ancient society. The improvement that this brings to the students is an improvement in comprehensive literacy, not a unilateral improvement. Especially in recent years, college Chinese textbooks have continued to change, and the proportion of ancient poems in them is increasing, which also proves the importance of ancient poems in the current college Chinese teaching. Multimedia technology is an innovative teaching model that emerged after informatization has penetrated into teaching, which is very popular among students. In addition, it can effectively improve the quality of teaching and can also fully stimulate students' interest in learning. In the current college Chinese ancient poetry teaching, the use of multimedia technology will inevitably improve the teaching effect.

Although the ancient poems at the university stage are relatively simple ancient poems, and some can even be understood in the vernacular way, in the process of teaching, because the college students are younger, they are understanding a lot. In the process of the content, due to insufficient cognitive level and insufficient life experience, there will be more serious problems in understanding. For many ancient poems, college students cannot understand the meaning of ancient poems, not to mention the rich thoughts and feelings contained in them. Therefore, in the process of teaching ancient poetry, teachers need to use their own teaching strategies to help students enter a specific teaching situation and guide students to understand the meaning of poetry. To understand the text in ancient poems, multimedia will help us a lot in this process. Using multimedia to create teaching situations can not only effectively improve efficiency but also better integrate students into the teaching situation.

Most of the life experiences of ancient poets have many ups and downs, and the works created by these poets in different periods completely express different thoughts and feelings. Therefore, teachers can use this in the process of teaching. The feature allows students to understand the author’s life experience first, and then to understand the author’s work, so that students will experience the author’s mood at the time and have a better understanding of the thoughts and feelings to be expressed in the article. Poets are great. Most, in the process of writing poems, describe their own circumstances or ideals at the time. Therefore, it is more important to understand the circumstances of the author at that time. So the most effective way to help students understand the author is nothing more than multimedia. Teachers can use the Internet to search for the author’s life events in detail, highlighting the key points and the events experienced by the author in the process of writing this poem, so that students can understand the poet at that time more clearly.

This paper combines AI technology to improve the multimedia teaching mode and improve the effect of multimedia-assisted ancient poetry teaching, which lays the foundation for the subsequent improvement of the quality of ancient poetry teaching.

#### 2. Related Work

A physics-based simulation method simulates the painting process by building models of painting tools such as pens and paper, using image analogy, texture synthesis, and other methods to achieve the artistic effect of virtual reality. Literature [1] proposed a method of simulating hand-painting with style strokes, and literature [2] proposed a virtual brush model (HairyBrush), which can simulate simple ink painting effects. Reference [3] proposes a virtual art pen model based on the elasticity theory, which satisfies the elastic deformation law and models the art pen as an inverted cone to achieve real-time display effects; reference [4] proposes a virtual art pen model. brush model, which can generate more realistic writing effects. Literature [5] proposed a physics-based Mao Fei physical simulation method mainly to simulate the rendering effect. The model proposed in [6] can simulate the effect of ink diffusion on paper. The system developed in [7] can be used to draw trees, and the effect generated by the system is similar to the effect of oil painting. The method proposed in [8] simulates the effect of artistic painting and develops a corresponding system.

In terms of simulation based on painting effects, literature [9] uses the method of image analogy to achieve artistic style drawing. Literature [10] proposes a method of texture synthesis, proposes common textures from artistic paintings, establishes a texture library, and then selects appropriate textures from the texture library to synthesize new ink paintings. Literature [11] designed a paint model that can express a variety of painting forms including traditional Chinese paintings and watercolors. Many scholars are dedicated to studying how to use computers to generate artistic painting effects. Literature [12] proposes a method of synthesizing artistic paintings based on images and has conducted research on computer rendering of artistic paintings. Literature [13] proposed an art painting system simulation rendering algorithm based on texture synthesis. Literature [14] has done research on the rendering effect of art, which is an attempt to realize the style application of animation ink and wash effect. Literature [15] proposes a design method of dynamic artistic conception in order to protect cultural heritage. The 3D mountain rock model constructed in [16] successfully realized the 3D rendering of the art painting simulation effect of the pen and ink method. This method is based on summarizing the shape and distribution of the pen and ink method in the art painting and proposes a fractal modeling method based on the sketch line. Literature [17] designed a form that can automatically generate ancient cultural connotations from pictures in real time. Literature [18] proposes a tool based on the movement trajectory of the artistic pen and the painting material, which can vividly show the process of artistic painting. In recent years, the innovation of computer technology has led to the rapid development of programmable graphics hardware. Programmable graphics software has become an important tool for scholars to conduct research on 3D graphics rendering. Literature [19] proposed the use of programmable graphics hardware to achieve the effect of hand-drawn Chinese painting. In terms of interactive experience, [20] pointed out that, in order to protect art treasures but still allow people to appreciate them, an interactive desktop that can watch artworks at close range is developed. Viewers can achieve partial magnification of the screen through touch points on the desktop. Literature [21] proposed a new method of displaying and restoring ancient famous paintings from a three-dimensional perspective using TIP technology. Now this system can only display animation effects and cannot achieve interaction.

#### 3. Multimedia Virtual Semantic Extraction of Ancient Poetry

Probabilistic latent semantic analysis (PLSA) is a statistically significant model proposed by Hofmann, which is mainly used in text classification and natural language research. When researching natural language, it is often necessary to research synonyms. Probabilistic latent semantics is a better way to solve this kind of research problems. By mapping the high-dimensional sparse vector space of the actual observable “document-word” to the low-dimensional latent space, a connection beyond the vocabulary level is established.

In the PLSA model, we set the document set and the word set , and the cooccurrence matrix composed of the document set and the word set , as shown in

Here, represents the *i*-th document, represents the *j*-th word, and represents the number of times the *j*-th word appears in the *i*-th document. In matrix , each row represents the number of times each word appears in a certain document, and each column represents the number of times a certain word appears in each document. We assume that there is an implicit variable , that is, the subject variable. If is used to indicate the probability of selecting the *i*-th document, is the probability of the topic appearing in the *i*-th document. represents the probability that the word appears in the -th topic, and the relationship between the three is shown in Figure 1.

If it is assumed that document and word are conditionally independent under topic , formula (2) can be used to express the cooccurrence probability of document and word .

It can be seen from formula (2) that PLSA is actually a mixed model, and there are two model parameters and . Here, is the probability distribution of topic under a given document , and is the probability distribution of word under a given topic . In the PLSA model, the parameters and are estimated by using the maximum likelihood estimation method to maximize the probability of the generated sample data. The maximum likelihood function is shown in

Here, , and is not model parameters, so the formula can be reduced to

It can be seen from formula (4) that and are the parameters required by the maximum likelihood estimation function. However, since the likelihood function is difficult to maximize, the EM algorithm must be introduced.

The algorithm is an iterative algorithm, divided into two steps. The first step is a step, and what you ask for is expectations. The second step is step, which is the maximum value. The details are shown in formulas (5), (6), and (7).(1)Step *E*: the algorithm generates a posterior probability for each :(2)Step *M*: the algorithm reestimates the model parameters and :(3)Iteration termination condition is as follows: After initializing and , the EM algorithm alternately performs *E* and *M* steps for iterative calculation and generates new and until reaches convergence:

PLSA is mainly divided into two processes of training and testing in image quality evaluation. Among them, in the training process, the visual words and the visual word distribution under each latent semantics are obtained by extracting features and clustering from the training image set. In the testing process, the latent semantic distribution of the test image set is compared with the latent semantic distribution of the reference image, and then the quality score of the image is obtained. The specific process is shown in Figure 2.

It can be seen from Figure 2 that the image quality score is obtained by calculating the semantic similarity of the latent semantic distribution of the training image set and the latent semantic distribution of the test image set. The specific process is to separately propose the latent semantic distribution of the undistorted image and calculate the average value of the latent semantic distribution according to formula (9) as a reference value for comparison. The specific process is to separately propose the latent semantic distribution of the undistorted image and calculate the average value of the latent semantic distribution according to formula (9) as a reference value for comparison. Here, is the number of reference images.

There should be a certain nonlinear relationship between the subjective data and the objective data of image quality evaluation. Generally, the nonlinear function selected should be monotonous, and it is best to avoid data overfitting. Therefore, this paper uses the commonly used logarithmic regression function for fitting, and the function expression is

Here, represents the nonlinear relationship, is the fitted value, that is, the predicted subjective quality value, which is a deformation of , and is the objective evaluation value. can be obtained by fitting. Figure 3 shows the prediction accuracy of the subjective MOS value, where the curve is the fitting curve of the nonlinear regression function, and the scattered points near the curve represent the data pair .

In Figure 3, the smaller the error between and MOS, the higher the accuracy of the algorithm. Currently, the commonly used method is taken from the performance evaluation algorithm proposed by the Video Quality Evaluation Group (VQEG). That is, the five indicators of Pearson correlation coefficient (CC), root mean square error (RMSE), average absolute error (RMSE), Spearman rank correlation coefficient (ROCC), and rate of dispersion (OR) are used as methods to evaluate the accuracy of the algorithm: The Pearson correlation coefficient (CC) is Here, and are subjective and objective evaluation scores, respectively, and and are, respectively, the mean value of subjective and objective evaluation.(2)The root mean square error (RMSE) is Here, is the subjective evaluation score, and is the value.(3)The average absolute error (MAE) is Here, is the subjective evaluation score, and is the value.(4)The Spearman rank correlation coefficient (ROCC) is Here, and , respectively, represent the data after the subjective and objective scores are sorted in the same order.(5)The dispersion rate is Here, is the number of points where the deviation between the fitted value and the subjective value is greater than 1.5.

In formula (11), the Pearson correlation coefficient can measure whether two data sets are on a straight line. The greater the absolute value of the correlation coefficient, the stronger the correlation. The root mean square error in formula (12) can measure the deviation between the predicted value and the subjective value. The smaller the deviation, the better the algorithm performance. The Spearman rank correlation coefficient in formula (14) can measure the monotonicity of two data sets. The larger the absolute value of ROCC, the stronger the monotonicity. The smaller the OR in formula (15), the higher the degree of aggregation of the algorithm. Therefore, the larger the value of CC and ROCC, the better the performance of the image quality evaluation algorithm. The smaller the MAE, RMSE, and OR, the worse the image quality evaluation algorithm.

Image semantics is the basic description carrier of image information content. It can convert the complete image content into an intuitively understandable text-like language expression. It is the high-level content in the field of image research and plays a vital role in the objective quality evaluation of images.

In the PLSA-based image quality evaluation, the specific application of image semantics reflects two aspects. One aspect is the construction of the model. That is, after the low-level feature extraction is performed on the undistorted images and training images, the semantics are implicitly incorporated into them. This maps the high-dimensional “image-visual word” cooccurrence matrix to the low-dimensional space, thereby establishing a probability model between “image-latent semantics-visual word.” On the other hand, it is reflected in the analysis of semantics, that is, the measurement of semantic similarity. Based on the rule that the latent semantic distribution of distorted images of the same type and level has similar laws, the latent semantic distribution of distorted images of different types or different levels often has certain differences. The semantic similarity measure takes the latent semantic distribution of the undistorted image as the criterion for evaluation and comparison and then compares it with the latent semantic distribution of the test image to obtain the objective score of the image. Therefore, choosing a reasonable semantic similarity measurement method is the final key step to obtain a better image quality evaluation algorithm.

For a given distorted image, this paper first calculates the normalized brightness through local mean subtraction and standardized splitting.

Ruderman believes that a local nonlinear operator can be added to the logarithmic ratio brightness to remove the local mean deviation and the standardized local variance, and there is no correlation between the two. This operator can be obtained from a given image:

Here, is the height and width of the image. *C* = 1 is a constant, because when the denominator approaches 0, *C* = 1 can prevent system instability.

Here, , and is a set of two-dimensional circularly symmetric Gaussian weight functions.

Ruderman also observed that the normalized brightness value of natural images tends to unit Gaussian features, and this feature can be used in the contrast gain mask processing model in early human vision. In the image quality evaluation system, the brightness value can be converted into MSCN (mean subtracted contrast normalized) coefficients. In the picture shown on the left picture of Figure 4, it can be seen that there is a relatively high correlation between adjacent pixels, and a diagonal structure is observed in the left picture. As shown on the right side of Figure 4, the normalization method reduces the correlation between adjacent pixels of the image.

Of course, this problem can be solved by modulating the matrix of the topic distribution probability under each image, but this also causes the score of the test image to exceed the score of the reference image. Therefore, it is unreasonable. This also shows that, to some extent, KL distance is actually not suitable as a semantic similarity algorithm for image quality evaluation. However, Hellinger distance does not have this problem. Hellinger distance is to calculate the square of the difference between and , so we only need to subtract the two matrices and then square each element, and we do not need to consider other factors. Therefore, Hellinger distance is more suitable for image quality evaluation than KL distance, and it can be used as a semantic similarity algorithm together with cosine similarity in image quality evaluation.

The MSCN coefficient has the characteristic of changing with the distortion, and this feature can ensure that the distortion type that affects the image quality is predicted through the image distortion change. The distribution of MSCN coefficients of blur features (blur) is closer to the Laplacian feature, while the distribution of MSCN coefficients of white noise (wn) reduces the weight of the tail of the histogram. The generalized Gaussian distribution (GGD) can effectively capture the statistical characteristics of the distorted image, and its zero-mean expression is as follows:

Here, is the gamma function. Since the coefficient distribution is symmetric, it can be described by the zero-mean GGD distribution. Among them, the parameter of the GGD distribution can be fitted by the time-moment matching method.

In fact, in the GGD distribution model of the reference image and the distorted image, the parameter settings are almost the same. For each image, the MSCN coefficient can be fitted by estimating the two parameters in the GGD distribution model. These two parameters form the first two dimensions of the multidimensional feature and are used to capture the distortion type of the image. As for the other several-dimensional features of multidimensional features, the asymmetric generalized Gaussian distribution model can be used, and its formula is as follows:

Here,

is the shape parameter of the image, which determines the distortion type of the image, and and are the scale parameters that control the distribution of each side of the model. In fact, the AGGD model is a generalization of the GGD model. The reason is that the AGGD model is mainly for the asymmetric MSCN coefficient distribution, and the slope of the distribution is an equation about the left and right scale parameters. When , the AGGD model is transformed into the GGD model. When , the parameters of the best AGGD model can be fitted.

Due to the existence of image distortion, adjacent MSCN coefficients will show a certain “structure.” Therefore, this “structure” can be obtained by calculating the statistical relationship between adjacent pixels in the four directions of the horizontal direction (H), vertical direction (V), main diagonal (D1), and subdiagonal (D2), as shown in Figure 4 and formula (21).

In this way, the abovementioned four directions are considered at the same time, and each direction is described by four parameters; that is, a total of 16 parameters constitute the 16-dimensional feature of the image.

Since human vision is multiscale, when people are observing images, the acquisition of image feature information will be interfered by scales. According to recent research on image quality evaluation, the quality evaluation algorithm combined with multiscale information can better express the correlation with human perception. Therefore, two scales of the original image and the reduced one-time image can be used to extract a total of 36-dimensional features of the image.

#### 4. The Multimedia-Assisted Ancient Poetry Teaching System Based on AI Technology

The learning system creates virtual characters in the first person by default, and learners roam and learn in the virtual scene according to the prompts. The overall technical block diagram of the system is shown in Figure 5.

Through the intelligent teaching system, it is helpful to mobilize students' enthusiasm and initiative in the classroom, to improve their learning efficiency, and to cultivate students' practical ability and innovation ability. The overall design framework of the system is shown in Figure 6.

This paper uses Proteus and Keil software to build an embedded system virtual simulation platform. The simulation process of 5l one-chip computer embedded system is shown as in Figure 7. First, this paper uses Proteus to draw a schematic diagram, selects Keil software to design the source code, then compiles the code, generates the target file, and then loads the HEX file in the Proteus software.

This research designs its system architecture for VR/AR teaching needs. It mainly describes the abstract components that directly constitute the VR/AR teaching system, including VR/AR teaching research architecture, VR/AR teaching design and teaching strategy framework, and VR/AR education resource engineering framework. The specific research content is shown in Figure 8.

The AR/AR space layer contains various learning spaces of the system, including VR direct interaction space, AR superimposed space, morphological space, and virtual-real interlaced space. These spaces can be divided into four types according to their structure and properties: isomorphism and homogeneity, heterogeneity and homogeneity, isomorphism and heterogeneity, and heterogeneity and heterogeneity. In addition, the VR space also reflects the bearing relationship between people/story and space. There are four relationships between them: 1 : 1, 1 : N, N : 1, and N : N. It is one person and one space, one person and multiple spaces, multiple people and one space, multiple people and multiple spaces, one story and one space, one story and multiple spaces, multiple stories and one space, and multiple stories and multiple spaces. In the space, learners complete their own learning process through fixed navigation, adaptive navigation, and creative navigation, as shown in Figure 9.

This paper takes “Wanglu Mountain Waterfall” as an example to conduct multimedia-assisted research on ancient poetry teaching, and its image processing is shown in Figure 10.

Through the analysis of Figure 10, it can be seen that the multimedia-assisted artistic conception creation system of ancient poetry based on AI technology can basically meet the teaching needs. On this basis, this paper conducts research on the performance of the system and counts the immersion and teaching effect brought by the system to students. The results are shown in Tables 1 and 2.

From the above research, we can see that the multimedia-assisted artistic conception creation system of ancient poetry proposed in this paper has a good teaching effect and has an important role in promoting the study of the artistic conception of ancient poetry.

#### 5. Conclusion

The linguistic features of ancient poems are very obvious; generally each sentence does not exceed seven characters. In a brief narrative, it either portrays a magnificent scene or a melancholy picture, which depends entirely on the author's thoughts and feelings at the time and the theme of the work itself. Therefore, ancient poems are an important form of writing in ancient China, and they are also of great significance in contemporary times. Many of the profound ideas still have reference significance today. This paper combines AI technology to improve the multimedia teaching mode, improve the effect of multimedia-assisted ancient poetry teaching, and lay the foundation for the subsequent improvement of the quality of ancient poetry teaching. The research shows that the multimedia-assisted artistic conception creation system of ancient poetry proposed in this paper has a good teaching effect and plays an important role in promoting the study of the artistic conception of ancient poetry.

#### Data Availability

The labeled datasets used to support the findings of this study are available from the corresponding author upon request.

#### Conflicts of Interest

The author declares no conflicts of interest.

#### Acknowledgments

This study was sponsored by Henan Industry and Trade Vocational College.