Abstract

To enhance the quality and efficiency of computer-enabled generation of papers for Test for English Majors Band 8 (TEM-8), a paper generation model supported by sparrow search algorithm-genetic algorithm was studied. First, a simplified test paper generation mathematical model was set up after analyzing and studying types and characteristics of TEM-8 tasks. In the model, quantity, type, difficulty, discrimination degree, scores, exposure, and answering time of test questions were taken into consideration. To enhance the optimizing effect of the genetic algorithm for searching test questions, the traditional genetic algorithm was improved by introducing the sparrow search algorithm into the model to achieve a better crossover rate, variance rate, optimization precision, and speed of the genetic algorithm. A new sparrow search-genetic algorithm (SSA-GA) was designed, and the optimizing effect of SSA-GA was verified to be ideal through optimizing six standard test functions. Then, SSA-GA was applied to conduct experimentation with test paper generation, and comparison with traditional genetic algorithms was also made. The values of best and average fitness of SSA-GA were better than those of the traditional genetic algorithm (GA) in the paper generation. Exposure rate and success rate in TEM-8 paper generation of SSA-GA were higher than those of traditional GA in TEM-8 paper generation. Results showed that the studied SSA-GA could implement test paper generation with higher speed and better quality.

1. Introduction

In the context of increasing social development and exchanges in economy, culture, and science and technology among all countries, English has become one of the most widely used communication languages [1, 2]. The educational circles of all countries have become aware of the important role English play in boosting their integration into the international community and promoting their all-round development and, consequently, are all intensifying their efforts in advancing English teaching [35]. In the course of English teaching, conducting tests of all types is inevitable. Hence, test paper generation has become a heavy task in the everyday work of English teachers. With the development of artificial intelligence (AI) technology, computer can simulate human thinking to replace mankind in fulfilling tasks that require intelligence. So far, many scholars have studied computer-aided test paper generation. For example, Wu explored a computer-based paper generation model in view of learning effect predictions to enhance the quality of computer test paper generation. In this prediction, the deep knowledge tracking method is used, and the test paper scores consider the weight of the technique, difficulty degree of the test papers, and distribution laws of test questions. Wu’s model can ensure the distribution field of test questions and high validity of test paper generation effectively [6]. According to the fuzzy mathematical theory [7], test questions can be selected randomly in test data through quantitative processing of paper generation requirements. To solve the trouble in traditional test paper generation, such as the unclear objective functions and low precision of optimum solution, Yang et al. proposed the particle swarm optimization (PSO) algorithm and applied it to solve functions of test paper generation for multiple objectives and has greatly enhanced the level of intelligent tests of the online testing system [8]. El-Rahman and Zolait designed a test bank and studied a shuttle algorithm, which ensured the online formation of test papers with varying difficulty levels and requirements and could save the time and cost of teaching greatly [9]. Duan et al. introduced an automatic method of test paper generation via the knowledge embedding method. First, the embedded knowledge and mechanism of test paper generation are studied and analyzed. Second, modelling is completed. The model can enhance the efficiency of test paper generation significantly [10]. To avoid the problem of low speed in traditional test paper generation, Zhang et al. offered a new test paper generation method, which uses random functions to generate multiple numbers and increase the random numbers. On this basis, random numbers are sequenced according to test requirements; the method is used to enable the simultaneous selection from the test bank and test paper generation, thereby enhancing the efficiency and reducing the duration of test paper generation [11]. Nguyen et al. provided a submodular meme approximation algorithm to address the disturbance of optimization by dual objectives during test paper generation. While strengthening the local search mechanism of the sub-module, this algorithm has upgraded the quality of test paper generation to a large degree [12]. Paul presented a genetic algorithm that can be used to quickly create a test paper generation template [13]. Lu et al. used Delphi technology together with the SQL server in completing the establishment of test bank and the automatic test paper generation system that can work stably [14]. Härtel et al. designed a language test paper generation system based on grammar and a goal-oriented framework and has a satisfactory effect [15]. Du et al. presented a method of searching knowledge points based on ontology and, on this basis, designed an automatic system of test paper generation [16]. Li et al. proposed an improved test paper generation algorithm of back testing, fit for test paper generation in simultaneous examination [17]. Ruisen et al. put forward a bottom-up method of creating a test bank of engineering graphics to set up a multipath method for question selection and test paper generation on this basis [18]. To address the low success rate of the computer-enabled intelligent test paper generation algorithm, many scholars advanced test paper generation models by means of the genetic algorithm [13, 1921]. Practices prove that the coding method and iterative mode of the genetic algorithm are highly suitable for optimized solutions during test paper generation. In order to determine paper generation speed, the consuming time can be calculated on the basis of the complexity function [22]. The standard indexes can be set and the quality of the paper generation system can be evaluated through the fuzzy graph theory [23]. Therefore, many scholars have achieved many results in their studies [2426].

Therefore, this paper takes the automatic test paper generation system for TEM-8 as the study object and applies the genetic algorithm to create an automatic test paper generation model for TEM-8. However, although the genetic algorithm suits the multigoal optimization of intelligent paper generation well, its parameters (e.g., crossover rate and variation probability) are selected via designer’s experience, thereby resulting in different optimization effects of the genetic algorithm and unstable quality of test paper generation. Therefore, a wide space exists for studies and technical problems that are in need of solution. Theory of determining journey order based on graph’s Wiener absolute index and inverse fuzzy mixed graphs, proposed by Poulik and Ganesh, is suitable for finding the parameters of GA [27, 28]. However, a lot of calculation is still needed. Optimizing algorithms such as particle swarm optimizing algorithm, the simulated annealing algorithm can be employed to find the suitable key parameters for GA. SSA, proposed recently with greater searching ability and less parameters, has been employed to optimize complicated problems with good effect [29]. In order to simplify the process of finding good parameters, this paper introduced the sparrow search algorithm into intelligent automatic test paper generation for optimization of GA key parameters to design a new sparrow-genetic algorithm for test paper generation.

This paper is organized as follows. In the forth-coming section, models of intelligently generating TEM-8 paper are built on analyzing test organization and requirements. Subsequently, the sparrow search algorithm is employed to optimize the crossover and variation probabilities of the traditional genetic algorithm. On this basis, SSA-GA is developed and verified through six standard test functions. In the final section, experiments of intelligent test paper generation are carried out. Results show that SSA-GA can generate test papers for TEM-8 with higher quality and greater speed than traditional GA.

2. Analysis and Modeling of TEM-8 Intelligent Paper Generation

As an examination for undergraduates, TEM (including TEM-4 and TEM-8) is conducted to test how teaching syllabus for undergraduate English majors is implemented. TEM-8 tasks are as follows: listening comprehension, reading comprehension, language knowledge, translation, and writing. TEM-8 is an all-round examination to test the comprehensive linguistic proficiency of students majoring in English. Each year, a large number of students prepare for the examination by utilizing all types of professional English test banks. If a test bank can be utilized to quickly create the TEM-8 test papers that satisfy examination requirements, it will benefit a large number of students in their studies and examinations.

TEM-8 paper users can be divided into two groups: those preparing for an examination and those taking part in an examination. The former needs a personalized test paper that can provide more training aiming at their own weaknesses. At different stages, test papers with varying difficulty are needed. Meanwhile, the latter needs a standardized test paper, which should be similar to TEM-8 test papers in all aspects as much as possible so as to achieve a better simulation effect. To meet the needs of different users, a plan is adopted to make parallel search for multiattribute parameters. The TEM-8 test papers are defined as a collection of multiple attributes, which includes quantity, type, difficulty, discrimination degree, scores, exposure, and answering time of test questions. The model of a paper can be created in light of the above attributes. First, the target matrix of one single-question model is shown in the following formula:where S1 stands for the model of one certain question, a1 is the question type, a2 is the question knowledge points, a3 is the question score, a4 is the question discrimination degree, a5 is the question difficulty, a6 is the expected question-answering time, and a7 is the question exposure.

It is possible to create a question that meets the requirements by fully restraining the seven attributes. The target matrix of the single-question model is transformed into the matrix that includes the model S of n number of questions.

Through the target matrix, the paper generation issue turns into the issue of solving multitarget restraints. In the matrix, each line means one question (there are n number of questions) and includes the seven attributes of a question. Matrix S is the target-state matrix of this paper generation method. Through the restraint and solution of the target matrix parameters, the online TEM-8 test paper generation is completed. The attributes of TEM-8 test paper are calculated as follows.

2.1. Restraint of Question Types

The attribute of question type plays a vital role in evaluating score distribution of paper in later period. It is necessary to conclude this attribute when test questions are stored in test bank. TEM-8 question types, such as listening comprehension, reading comprehension, language knowledge, translation, and writing, are marked with numbers 1–5, respectively. When a certain type of questions is chosen, its a1 value is the number that corresponds to the question type.

The matrix of the attributes for question types in a TEM-8 test paper is as follows:where S_type is the matrix of question type and Sia1 is the question type of question no. i.

Through formula (4), the number of different elements in S_type is judged to obtain the quantity S_type_data in TEM-8 test papers.

2.2. Restraint of Knowledge Points in a Test Paper

Knowledge points are very important and determined by test syllabuses. If knowledge points tested in a paper are not from the syllabuses, the generated paper is not a qualified one. The knowledge points of test questions differ from questions types because a one-to-one relationship exists between the question type and one question, whereas a multiple-to-multiple relationship exists between the knowledge points and the question itself. The same knowledge point may exist in multiple questions, and one question may include multiple knowledge points. The quantity of knowledge points and their appearance frequency decide the test paper difficulty and the examination requirements and effect.

The knowledge points for a single test question are shown in the following formula:where A2 is the matrix of single questions and Ki is no. i knowledge point. Suppose TEM-8 includes p knowledge points. A specific question is taken as an example. Value ki, which corresponds to knowledge points included in the question, is set to 1, and value i, which corresponds to knowledge points not included, is set to 0.where A2_point_num is the number of knowledge points included in a single question.

The knowledge point matrix S_point for TEM-8 test paper is shown in the following formula:where SiA2 is the knowledge point matrix of question no. i.

The total knowledge included in a TEM-8 test paper is expressed by the following formula:where S_point_all is the matrix of the number of examinations for every knowledge point in a test paper.where S_point_num is the sum of knowledge points included in a paper.

The attribute parameters of knowledge points can be obtained through calculating the number of knowledge points in a test and its total number in syllabuses.where S_point_data is the attribute parameters of knowledge points in a paper.

2.3. Restraint of the Total Score

The total score is an important condition that restrains test paper generation. Under the conditions established for a test, it thereof should be an accurate value. The total score restraint is solved as follows:where S_sco_data is the total score of a paper and Sia4 is the score of question no. i. For example, in tests for TEM-8, S_sco_data is 100.

2.4. Restraint of Discrimination

The discrimination degree of test questions is used to evaluate students' mastery of relevant knowledge and content. It directly affects students’ scores and reflects students’ learning ability. It is an index that cannot be obtained directly but only through testing, statistics, and analysis. The discrimination degree of every question is obtained through the following formula with data from previous examinations:where S_disi is discrimination degree of question no. i, Dh is the average result of questions among examinees’ high-score group (top 25%), Dl is the average result of questions among examinees’ low-score group (bottom 25%), and W is the full score of questions.

The discrimination parameter S_dis_data of a test paper is expressed as follows:

For example, if S_dis_data ≥ 0.4, the quality of a generated paper is very good; if 0.3 ≤ S_dis_data < 0.4, the quality is comparatively good; if 0.2 ≤ S_dis_data <0.3, the quality is comparatively bad; if S_dis_dat < 0.2, the quality is very bad.

2.5. Restraint of Test Difficulty

Test difficulty, which can greatly affect test takers’ performance, is one of the core attributes in a test. The difficulty coefficient is an index used to test the difficulty degree of a test paper and is expressed by the following formula:where Sia5 is the difficulty of question no. i and is the average score of all examinees.

For example, Sia5 is in the domain (0.0, 1.0). The greater this value is, the higher the difficulty is, which means few students can answer the question correctly.

The difficulty degree is solved through the following formula:where S_dif_data is the difficulty degree of a test paper.

2.6. Expected Question-Answering Time

The time on answering every question should be based on students’ average answering time, and the total time spent on answering all questions should be less than the examination standard time.where S_tim_data is the expected total question-answering time for a test paper, and its value should be less than or equal to the examination standard time. Sia6 is the expected question-answering time of question no. i. For example, in TEM-8, S_tim_data is 150 MIN.

2.7. Exposure of Test Questions

The paper exposure determines the confidentiality of one paper. The smaller the exposure is, the higher test quality is. The exposure of test questions is a parameter constantly updated with the increasing frequency of test paper generation. The initial questions exposure is set to E. In the subsequent paper generation, the exposure increases by 1 each time when it is selected, as is shown in the following formula:where Sia7 is the exposure of question no. i.where S_exp_data is the exposure of a test paper.

3. Design of Sparrow Search-Genetic Algorithm

The traditional genetic algorithm is widely applied in optimization solution. However, in the operation of population crossover and variation, determining the coefficients of crossover and variation through designers’ experience will restrict the solution capability of the genetic algorithm. The solution capability of an algorithm can be enhanced to a certain extent by the probabilities of crossover and variation determined through people’s experience in test paper generation. However, that will be a heavy workload for the people. The continuous changes in test questions will lead to increasing workload in setting the probabilities of crossover and variation. Therefore, the sparrow search algorithm is used in online optimization of the probabilities of crossover and variation in the genetic algorithm. Hence, a new sparrow search-genetic algorithm is designed and used in TEM-8 test paper generation.

A method is proposed herein to use the sparrow search algorithm to select the probabilities of crossover and variation of the genetic algorithm. When the sparrow search algorithm continuously compares and selects different probabilities of crossover and variation, the genetic algorithm solves the precision of the mean value of results repetitively to select the optimal probabilities of crossover and variation after 100 iterations of the sparrow search algorithm, which are applied to the genetic algorithm for paper generation.

The sparrow search algorithm is a newly-proposed (in 2020) swarm intelligence algorithm [30, 31], whose fundamental principle is to simulate the foraging and antipredator behavior of a sparrow population. The algorithm model includes the detector model, participant model, and early warner model. With higher energy reserve and a high value of fitness, the detector mainly provides the sparrow population with foraging areas and orientation. The individuals selected as detectors are those with a fitness value among the top 20% of the entire population. A joiner follows a detector with the optimal value of fitness in looking for food to obtain his own energy reserve and increase its own value of fitness. The remaining 80% of the individuals in a population are joiners. Some joiners continuously observe the detectors to vie for a position that provides more food. An early warner will provide an alarm when he becomes aware of any danger and, in the meantime, will quickly transfer to a safer area for a better position. The sparrow population will walk randomly to approach other sparrows, which is an antipredator behavior. Meanwhile, when the value of alarm is larger than the safety value, the detector will push the joiners away from the dangerous area.

Given that a detector leads the flow of the entire sparrow population, he can search for food in any place. R2 ∈ [0,1] is a random number that stands for early warning value and ST ∈[0.5, 1] stands for safety value, which is 0.8 for this purpose.

R 2 < ST means that there is no predator nearby, and the detector will get into a broad search mode. His position is updated as follows:where is the current position of the sparrow, is the updated position, t is the current number of iterations, itermax is the maximum number of iterations, and α is a random number which is evenly distributed among (0, 1).

R 2 ≥ ST means that there is danger around and all sparrows have to fly to other safe areas. The specific position updating is presented as follows:where Q is a random number that follows a normal distribution and L is the matrix of 1 × d with elements that are all 1 (d is the dimension of variable), that is, every internal element is 1.

A joiner will follow a detector in search of food and may vie with the detector to increase his own food. i > n/2 means that no. i joiner with a poor fitness may starve to death. At this time, the joiner needs to look for food in other places. The formula of position updating is presented as follows:

Given i > n/2,where is the worst position in a sparrow population at the t time iteration.

Given i ≤ n/2, no. i joiner randomly finds a position for foraging near the optimal position of the detector.where is the optimal position of the detector at t+ 1 time iteration. A is a multidimensional matrix with internal elements that are 1 or −1, and A+ = AT (AAT)−1.

f i is the fitness value of current individual i, and and are the best and worst fitness values of the current sparrow population, respectively.  >  means that a sparrow is on the edge of its population. For the sparrows to be aware of the danger, their position updating formula is expressed as follows:where is the present global optimal position in a population and β is the step length parameter that follows a normal distribution with 0 mean value and 1 variance.

means the sparrow is in the population center and has become aware of the danger. The sparrow draws close to other individuals to protect itself from the attack of predators. Its position updating formula is expressed as follows:where ε is a very small constant used to avoid the denominator of 0. k∈[−1, 1] is used to control the sparrow’s movement direction. Figure 1 displays the flowchart of the sparrow search-genetic algorithm.

In order to verify solution capability of the studied sparrow search algorithm-genetic algorithm, we solved different test functions for an analytical comparison with the general genetic algorithm. Table 1 lists the functions and parameters used in the test.

F1–F3 are unimodal functions. Only one global optimum in the section, which can reduce the impact upon algorithm solution caused by local optimal solution, is considered. Therefore, these functions can evaluate and compare the solution capability of the algorithm before and after optimization very well. F4–F6 are multimodal functions. Different from unimodal functions, multimodal functions include multiple locally optimal solutions, and their number will show an exponential growth with increasing problems (number of design variables). Falling into local optimal solution, the problem is most likely to appear in optimized algorithms. Therefore, the solution of multimodal functions is leveraged to observe the algorithm ability to jump out of local optimum. Through solution of different types of functions, stability of the algorithm solution can be verified.

Figure 2 shows the solution and iteration processes of optimizing test functions by using the sparrow-genetic algorithm involved in our study. In the process, normal GA is the solution and iteration curve of the general genetic algorithm, with 0.2 being selected as the value for its crossover and variation probabilities. SSA-GA corresponds to the improved genetic algorithm that uses the sparrow search algorithm to select crossover and variation probabilities. This time, the sparrow search algorithm selected a crossover probability of 0.26958 and a variation probability of 0.43144. To better display the difference of the two algorithms in the convergence process, Figures 2(a)2(e) use logarithmic coordinate axis as longitudinal coordinates. Given that the optimal value of the functions in Figure 2(f) is a negative value, a linear coordinate axis is used as longitudinal coordinates.

As it is shown in Figures 2(a)2(f), the sparrow search-genetic algorithm displays a higher convergence speed and solution precision in the solution process of different test functions. This finding proves that the sparrow search-genetic algorithm has a better solution capability and a higher solution precision. Particularly, from the solution process of multimodal functions F4–F6, the sparrow search-genetic algorithm has markedly enhanced its ability to jump out of local optimum and achieved a higher convergence speed, thereby achieving a significant optimization of the genetic algorithm.

To verify the high solution precision and better universality and stability of the sparrow search-genetic algorithm, the mean values and standard deviation of 20 solution results are calculated. The results are presented in Table 2.

The comparison of the mean values and standard deviation of 20 solution results shows that the sparrow search-genetic algorithm has more accurate mean values of the solution results than a normal GA, indicating that the solution precision of SSA-GA is generally higher. For SSA-GA, the standard deviation of the results after 20 solutions is smaller, indicating that SSA-GA is more stable in terms of solution capability. Thus, the SSA-GA algorithm can be proven to have a higher convergence speed, and the solutions are more accurate and stable.

4. Experiment of Intelligent Test Paper Generation Based on SSA-GA Algorithm

A standard test bank is used to conduct simulation experiment of a separate TEM-8 paper generation through the traditional genetic algorithm and SSA-GA algorithm. The simulation experiment uses the Windows 7 operating system, a CPU of Intel Core i5, and an internal memory of 8 GB. The test bank includes banks for listening comprehension, reading comprehension, language knowledge, translation, and writing. Every test bank includes 5000 questions, which form a test paper composed of 25 questions of listening comprehension, 22 questions of reading comprehension, 10 questions of language knowledge, 1 translation task, and 1 writing task. Table 3 shows the question types, question number, and score share of a TEM-8 paper formed by the genetic algorithm.

The crossover and variation probabilities of the test paper generation via the SSA-GA algorithm are obtained through the use of sparrow search algorithm optimization. The values of the crossover and variation probabilities are 0.26274 and 0.48163. The number of individuals in a population and the evolutionary algebra is 50 and 250. The test paper generation experiments by the two algorithms were run for 100 times to form 100 sets of TEM-8 papers. Figure 3 shows the convergence curve of the best fitness of the two algorithms. Through analyzing convergence curve of the best fitness, the normal GA algorithm is proved to have certain advantages in the early period. However, after 60 iterations, the value of the fitness function changes slowly. The studied SSA-GA algorithm can better jump out of local optimum for an improved evolution. The fitness of the SSA-GA algorithm obtained after the quick excess of the normal GA algorithm can reach the optimal value sooner.

Meanwhile, we determined the relationship curve that reflects the iteration-based changes of average fitness of the two algorithms, as shown in Figure 4. In Figure 4, the SSA-GA algorithm manifests marked advantages with the continuation of the iteration process. Meanwhile, we calculated the average fitness of 50, 100, 150, 200, and 250 iterations, as shown in Table 4. In Table 4, compared with that of the traditional genetic algorithm, the average fitness of the SSA-GA algorithm after 50, 100, 150, 200, and 250 iterations is improved by 22.3%, 27.3%, 24.1%, 35.1%, and 38.5%, respectively.

To further examine the performance of the studied SSA-GA algorithm and normal GA algorithm, time consumed by test paper generation is calculated, as shown in Figure 5. In Figure 5, given that the size of English test database and iterations are the same, the studied SSA-GA algorithm consumes less time than normal GA and will show even more obvious advantages with increasing iterations.

The exposure of test questions is an important factor for evaluating test paper quality. The higher the exposure is, the lower quality a test has, and the more similar the papers are. Therefore, the exposure and success rate of the two algorithms were calculated, as shown in Table 5. For the TEM-8 papers created by the studied SSA-GA algorithm, the exposure rate is smaller than that of normal GA algorithms and success rate of SSA-GA is higher than that of normal GA. This comparison shows that the studied SSA-GA algorithm exceeds the normal GA algorithm in improving test quality and is more suitable for TEM-8 paper generation.

5. Conclusions

Taking TEM-8 intelligent test paper generation as the core, this study proposed a strategy for intelligent test paper generation based on the sparrow-genetic algorithm. Tasks that have been fulfilled are summarized as follows:(1)Firstly, types, scores, and focuses of test papers for TEM-8 examinees were analyzed. On this basis, test questions were divided into 7 restraints, such as types, knowledge points, scores, discrimination, difficulty and exposure of test questions, and expected question-answering time, to build a mathematical model.(2)To upgrade the precision and speed of the genetic algorithm optimization, a new sparrow search algorithm was introduced to optimize the crossover and variation probabilities of the traditional genetic algorithm for the design of the sparrow-genetic algorithm. Six standard test functions were also used to test the sparrow-genetic algorithm. The results proved that the studied sparrow search-genetic algorithm has a stronger optimization capability.(3)A new sparrow search-genetic algorithm was used to test TEM-8 intelligent paper generation system, and the results were compared with the test paper generation with the normal GA algorithm. The finding showed the new algorithm was better than the traditional one in terms of best fitness, average fitness, time of paper generation, and questions exposure. Hence, it can be proved that the designed sparrow search-genetic algorithm had a better effect on test paper generation.

In conclusion, experiments have proved that the SSA-GA algorithm proposed in this paper can generate test papers for TEM-8 with higher quality and greater speed. Therefore, the SSA-GA algorithm can help English teachers generate TEM-8 papers intelligently. However, with Internet being widely used in every aspect of human lives, many students prefer to study English online. In the future, we will focus our study on combining the SSA-GA algorithm with Internet and building online test paper generation system via the SSA-GA algorithm.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research was supported by Key Research Project of Humanities and Social Science of Universities in Anhui Province (no. SK2020A0226).