#### Abstract

This research is devoted to the development of a multicriteria model, valid only for testing, which takes into account the number of responses, the level of question complexity depending on the time spent on their response, time of the whole testing, doubts, number of passes, and the use of additional programs. Also the developed method of measuring the level of the user doubts is described, which gives a clearer and “transparent situation picture” for a more objective decision-making. The method makes it possible to reduce the probability of guessing the correct answer, which increases the objectivity of assessing the knowledge level in the diagnostic systems for E&M learning.

#### 1. Introduction

Distance education and e-learning have laid the foundation for a new global phenomenon, Smart Education, which is not so much about technology as it is about the new philosophy of education.

Developers of distance learning systems cannot always fully take into account all the laws of the learning process and therefore use not quite correct models. And even the most successful solutions in the field of online training rarely get widespread.

Undoubtedly, distance learning is by no means a complete substitute for traditional training, because it is difficult to replace live communication with a teacher or the educational atmosphere that develops in the classroom between subjects of the learning process. At the same time, modern ICT (information and communication technologies) can minimize the “narrow” places of distance learning. Modern pedagogy is also changing, and nowadays transfer of part of the training load to the online mode is quite an acknowledged educational scenario. This can be done when developing educational content on a modern IT basis, for example, using MOOCs (a mass open educational course).

Today, we can confidently state the existence of a new digital Z-generation of people for whom a mobile phone, a computer, and the Internet are natural elements of their living space. In modern conditions, effective education is education without reference to time and place. This education teaches through everyday life. The transition to this mobile learning technology involves the application of new methods, approaches, and principles of the learning process organization. When developing content for mobile devices, it is necessary to take into account that it is intended for young people of the Z-generation, as the method of preparing training content for them differs from the traditional learning content.

So, smart technologies in education, such as mobile applications, are of great importance as they allow to optimize the costs of the university logistics and also to raise the quality of educational services and products to a new level. It is smart technology that allows to develop revolutionary teaching materials, as well as to form individual trajectories of training.

In 2015, UNESCO published Recommendations on Mobile Learning Policies [1], which fully justifies the need for the introduction of mobile technologies in the educational process. According to these recommendations, “In a world in which dependence on means of communication and access to information is growing, mobile devices will not be a transient phenomenon. As the capacity and capabilities of mobile devices are constantly growing, they can be used more widely as educational tools and take a central place, both in formal and informal education.”

History shows that the competitiveness of the national economy as a whole depends on the development of information technologies. According to the World Economic Forum, the competitiveness index of the economies of states has a high level of correlation with the countries’ information and communication technologies. According to the 2012 rating of the competitiveness of 142 countries of the world by the World Economic Forum, countries that are actively developing information technologies are ahead of Kazakhstan which ranks 51^{st} in terms of creating demand for information technology (the USA-13, Germany-19, India-63, and Egypt-96) and occupies the 104^{th} place in the information technology business conditions place (USA-21, Germany-38, and India-72).

Scientific novelty consists in developing a method of objective measurement of students’ knowledge on the basis of a multicriteria decision-making model.

#### 2. Development of a Multicriteria Model for Diagnosing Knowledge

To solve the problem, an intellectual multicriteria model for assessing students’ knowledge has been developed, which makes it possible to take into account the following parameters: (1) time spent on the answer, (2) the number of correct answers, (3) time of all testing, (4) the doubts of the test person, (5) omission of questions, (6) use of additional programs, and (7) psychological characteristics of the tested person.

Given in the following sections are the coefficients of the model and the methods for measuring the coefficients.

##### 2.1. Coefficient of Accounting for Time Spent on the Answer

Tests contain questions of different levels of complexity (50% of them are easy (level *a*), 30% are average (level *b*), and 20% are difficult (level *c*)). Therefore, the score for each correct answer should depend on the complexity of the question, as well as on the time spent on it.

The maximum score *G*_{max} (subject to a quick response in less than 15 s) can be calculated from the formulawhere is the number of questions on levels of complexity , is the maximum score for answering easy questions (level *a*), is the maximum score for answering average questions (level *b*), and is the maximum score for answering difficult questions (level *c*).where is the number of questions in the test.

The maximum score can be typed with a different number of questions in the test.

Points scored for the elapsed time can be calculated by the formulawhere refers to the points for the correct *i*-question level *а*, refers to the points for the correct *i*-question level *b*, and refers to the points for the correct *i*-question level *c.*

The coefficient characterizing the share of knowledge at score “5” under the condition of a quick response (for a time not exceeding 15 seconds) is determined by the formulawhere is the number of points scored for the elapsed time, subject to the choice of the correct answer.

Provided that all the answers are correct and the time for making a decision of each answer will not exceed 15 seconds, the knowledge coefficient of the tested “excellent” is 1.

##### 2.2. Coefficient of the Number of Correct Answers

where is the coefficient that characterizes knowledge of all issues regardless of time, is the number of correct answers, and is the total number of questions.

##### 2.3. Coefficient for Time of All Testing

To take account of the time spent on the test (regardless of anything), we need to use the following formula:where is the coefficient characterizing the passage of the test, is the time spent on testing, and is the maximum allowable time that can be spent on testing.

##### 2.4. Coefficient of Doubt

The mathematical description of doubt as a significant component of the rational choice model, based on the axioms of independence, transitivity, convexity, and monotony of individual preferences, is represented in some way in the study, as discussed by Vinogradov and Kuznetsov [2]. A person builds a sequence of conclusions about the adequacy of his ideas in accordance with his subjective level of conviction. He is convinced of the adequacy of his views in the situation of choosing the type of Ω with respect to goal G, if he believes that the choice of the mode of action C on their basis will allow it to be achieved. At the same time, he perceives some of the characteristics of the X situation Ω; with respect to the other part, he makes assumptions and shows an intention to prove (to verify their plausibility).

The assumption is the default value of the observed characteristic or a description of the cause-effect relationship between the observed characteristics.

Representations of a person are characterized by a level of conviction. The level of his belief in his ideas about the situation of choosing type Ω with respect to goal G is determined by the frequency of its achievement when choosing the mode of action C based on them. The assessment of the level of conviction changes from zero to one. If the number of unsuccessful attempts to reach goal G when choosing the mode of action C based on human representations increases, then the level of conviction of a person decreases (and vice versa), which becomes an incentive for him to apply efforts for their modification or complete reconstruction due to the growing doubt in the plausibility of the assumptions made. The desire to verify the correctness of assumptions is a measure of a person’s doubts.

Thus, the efforts that a person expends to prove (refute) assumptions characterize the degree of his doubts about the ideas about the situation of choosing type Ω as he strives for goal G. According to the provisions of the theory of behavioral psychology, if the level of conviction that depends on the number of confirmations of the correctness of the choice based on representations increases, then the person’s desire for verification falls, since he does not see the point in this. The increase in the degree of doubt is an incentive for finding additional arguments (counterarguments).

A parameter that takes into account these two characteristics is the degree of conviction considered aswhere is the degree of conviction; is the level of conviction (past experience); is the degree of a person’s doubt about the correctness of his ideas about the situation of choice; and are the coefficients of significance that a person gives to his experience and the need to find evidence. Thus, from the preceding expression, one can derive a formula expressing the degree of doubt.

The degree of doubt in modern tests, in our opinion, is a latent parameter and can be measured only indirectly. If the degree of doubt and the degree of confidence *U* is estimated on a 100-point scale and expressed in %, then the formula for the degree of doubt will acquire the following form:

Quantitative characteristics are needed to measure latent parameters. The quantitative parameters for measuring the degree of doubt of the user are the following variables: the number of missed operations, the amount of unconfirmed information, the amount of heterogeneous information (ambiguous from the first time), and the state of the logical chain (the sequence of actions, levels of complexity, etc.).

To determine the level of doubt, we used a superposition of *n* models, each of which determines the latent parameter of doubt on the quantitative parameters.

Thus, we can postulate the following:(1)At the initial level of complexity, the measured level should be the lowest level of doubt.(2)On the average level of complexity, the level of doubt should be greater for the advanced level, but less for the initial level.

The quantitative parameters for measuring the level of doubt of the user are the following:(i)The number of missed operations(ii)The amount of unconfirmed information(iii)The amount of heterogeneous information (ambiguous from the first time)(iv)The state of the logical chain (the sequence of actions, levels of complexity, etc.)

To take into account the user’s level of doubt, the following method is suggested:(1)A test is organized, containing *i*-questions, under the condition *i* > 0.(2)Each *i*-question contains *j* variants of answers under condition 2 < *j* ≤ 5.(3)The testing questions are divided into training elements. Each training element contains questions of *S* different levels of complexity. The question contains only one correct variant *G*_{i} for the *i*-question, *G*_{i} > 0, *G*_{i} ≥ *n*_{ij}*.* Each *j* variant can be preselected and subsequently confirmed with the only accepted variant of the answer *k*_{ij} under the condition *j* = 1. Each variant of the question has the ability to switch *m* times. Each question contains a ability to skip an *i*-question only once, that is, . Then, the amount of heterogeneous information (ambiguous from the first time) can be determined by the number of switching options .

Consequently,where is the coefficient per doubt of *i*-question, is the number of variants of answers in the question, is the coefficient that characterizes the weight of switching from the correct to the wrong answer, is the coefficient that characterizes the weight of switching from wrong to correct answer option, is the coefficient that characterizes the weight of switching from wrong to wrong answer option, is the number of switches from right to wrong answer, is the number of switching from wrong to the correct answer, and is the number of switching from wrong to wrong answer option.

Let us find the average coefficients of doubt to questions of level , respectively,where is the number of difficulty levels, is the number of level *a* questions, is the number of level *b* questions, and is the number of level questions.

The final coefficient, which characterizes the average level of doubt of the learning element, is calculated by the formulawhere is the number of difficulty levels.

When S = 3 (3 levels: initial, intermediate, and advanced) of each educational element and states 2 (1, answered; 0, did not answer), there are 2^{3} = 8 various states of the logical chain, each of which has its own level of doubts in the matrix states of the level of complexity logical chain. The level of doubt in the matrix depends on the subject area, compiled by the expert.

The weight of the doubt level at *s* = 3 for this model corresponds to *A*-50%, *B*-30%, and *C*-20%. The user is given a portion of information of *A*, *B*, and *C* levels of this training element. After that at Table 1, the truth table is analyzed, and the decision is made.

Amount of unconfirmed information is *F*. If a negative answer was received to the question, then this question is randomly formed even *m* times during the check-up phase. If the answer is correct the second time, then the chain is simulated a second time, taking into account the level of complexity.

This model defines a latent parameter that is a response to a decision taking into account the level of complexity. The essence of the model is to confirm the doubt of the decision taken by comparing it with a decision of this type.

The latent parameter of this model determines the following quantitative characteristics:(i)Subjects(ii)Complexity(iii)Decision

We propose the following method. Testing of *x* questions is organized. All *x* questions are divided into *t*-groups on a certain topic. Each question has one correct answer.

In testing, all *x* questions are divided into *n* complexity levels. Each question has its own level of complexity. The proportion of questions on levels of complexity must satisfy the following condition: for easy-50%, medium-30%, and complex-20%. Each level of complexity corresponds to the weighting factor *F* (*F*_{max}-the most complex, *F*_{max−1}-less complex, etc.).

Doubt questions are exposed to which the correct answer was given at all levels of complexity, except the first. Doubts when answering a question of level *F* are calculated from the answers to questions of the same subject (i.e., the same group) whose complexity is lower than the level of complexity of the current question. Figure 1 shows graph of the dependence of the coefficient *K*_{i} of the weight of doubts on the current level of complexity of the question (logarithmic).where is the number of difficulty levels below that for which the calculation is carried out:where is the weighting factor of the current difficulty level; is the weight of doubts per question below the current level:

*W*_{i} is the total weight of doubt level *i*-th question:where is the number of questions of the *i*-th level of complexity of the same subject as the question for which the doubt is calculated; is the decision taken (the answer to the question).

Proceeding from the fact that there can be several questions of the same level of complexity in the test, there is a need to find the mean-square value of the doubt at each level:

The final coefficient characterizing the degree of doubt is calculated by the formula:where is a doubt in the answers to the question of complexity and is the doubt weight factor in the answers to the question of the complexity of :where is the maximum weight of the highest level. Figure 2 shows graph of dependence of the coefficient of total doubts on the current level of complexity of the question (logarithmic).

Despite the fact that this method uses a small number of characteristics (subject, complexity, and decision) for the calculation, it gives an estimate of the doubts of correctly accepted decisions, which is unquestionably significant for the interpretation of the final test result.

For understanding the developed method of calculating the level of doubt, it is suggested to consider the following example with specific input data.

Let the number of questions = 10, the number of levels *F* = 3, the coefficients of the complexity levels of the question be *A* (easy) = 1, *B* (medium) = 2, and *C* (complex) = 3. Correct answers of the user in difficulty levels: *N*_{A} = 4, *N*_{B} = 2, and *N*_{C} = 1. It is required to determine the coefficient of doubt on the average and complex levels of complexity of the questions on which the correct answers were given.

*Solution:*(1)The number of questions from 3 levels of complexity (according to the method) must satisfy the condition: Questions of the level of complexity *A* = 50%, level *B* = 30%, and level *C* = 20% Then, Number of questions of level *A* (simple) = 5 Number of level *B* questions (averages) = 3 Number of questions of level *C* (complex) = 2(2)Calculation of the weight coefficients of doubts of the level *C* in accordance with formula (13): *M* = *F* − 1 = 3 − 1 = 2 The weight of doubt for level *A* in accordance with formula (14): The weight of doubt for level *B* in accordance with formula (14):(3)Calculation of the total weights of doubt for level *C*: The user answered *N*_{A} = 4 out of 5 questions, *N*_{B} = 2 of 3, *N*_{C} = 1 of 2 questions, means number of questions on which the wrong answer was given: = 5 − 4 = 1, = 3 − 2 = 1, = 2 − 1 = 1 Let us find the total weight of doubts of level *A* (*i* = 1) in accordance with formula (15): Let us find the total weight of doubts of level *B* (*i* = 2) in accordance with formula (15):(4)Calculation of doubt on the level *C* questions in accordance with formula (17): Percent of doubts on the issue of the *C* level = 24%(5)Calculation of the weight coefficients of doubts of level *B* in accordance with formula (13): *M* = *F* − 1 = 2 − 1 = 1 The weight of doubt for level *A* in accordance with formula (14):(6)Calculation of the total weights of doubt for level *B*: Let us find the total weight of doubts of level *A* (*i* = 1) in accordance with formula (15):(7)Calculation of doubts on the level *B* questions in accordance with formula (17): Percentage of doubts about the level of *B* = 20%(8)The meaning of doubt in accordance with formula (19) = 20%

Answer: Doubts arose in 3 correct answers of the student, because questions are raised only on the average (*B*) and complex (*C*) levels of complexity.(1)In 2 responses of level *B*, where the correct answers were given, the level of doubt is 20%.(2)In the question of level *C*, where the correct answer was given = 24%.

##### 2.5. Coefficient of Missing Questions (Savings/Waste of Time)

If the test program is able to skip the question, then it is logical to keep a record of the gaps that affect the probability of nontripping:where is a coefficient characterizing nontransmissions and , , and are the number of passes in the levels *a*, *b*, and *c*.

In order to keep the coefficient correctly recorded, it is necessary to enter restrictions, the maximum permissible number of passes in the levels *a*, *b*, and *c*.

, , and are the maximum number of omissions in questions on levels *a*, *b*, and *c*. Provided that , , .

##### 2.6. Coefficient of Using Additional Programs

If, for example, there are problems in testing that require mathematical calculations, then it is necessary to include the “Calculator” in the testing interface. Thus, keep a record of the amount of use of the calculator:where is the utilization rate of additional programs, is the number of tasks requiring calculation, and is the amount of use of the calculator.

In this example, the calculator is a useful element of encouragement. There are undesirable but accessible elements of use-hints.

The final formula for calculating the total coefficient characterizing knowledge on “5” is as follows:where is the coefficient of knowledge which is excellent under the condition of rapid response, is the coefficient of knowledge of all questions regardless of time, is the test passing ratio, is the coefficient of confidence, is the nonrolling ratio, and is the coefficient of using additional programs.

It was experimentally deduced that *a*_{1} = 0.23, *a*_{2} = 0.5, *a*_{3} = 0.07, *a*_{4} = 0.1, *a*_{5} = 0.07, *a*_{6} = 0.03, and (*a*_{1} + *a*_{2} + *a*_{3} + *a*_{4} + *a*_{5} + *a*_{6}) = 1.

By calculating the coefficient *K*, the coefficient of knowledge for “excellent,” we can compile Table 2 and determine the assessment.

The final formula for *K*:

The multicriteria model is valid only for testing, which takes into account the number of answers, the level of complexity of questions depending on the time spent on their response, the time of all testing, doubts, the number of omissions, and the use of additional programs.

#### 3. Experimental Part

##### 3.1. Setting the Experiment

To test the developed model, a large-scale experiment was conducted during two academic years. The experiment involved about 1000 students of 29 courses. All students were divided into 2 groups.

All students took midterm control tests (Att), which the teacher administered and evaluated during the semester.

The first experimental group of students (500 people) passed the final exam (Exm) at the end of the course in the format of computer testing (5 answers and 1 correct one). For the first group, the final grade (*F*) for the course is calculated using

The second experimental group of students (500 people) at the end of the same courses passed the exam in the format of computer testing on the basis of the multicriteria model (*M*), proposed in this study. The results of their average estimates (*M*) are given in Table 3.

As a result of the experiment, it is required to determine the quality of the developed multicriteria model and its suitability for an objective assessment of the level of knowledge.

The hypothesis of the experiment is that the average values of the final assessment (*F*) based on the boundary control according to Formula (32) correlate with the average values of the estimates obtained on the basis of the model under study (*M*). In the experiment, the hypothesis will also be tested that there is a correlation between the mean values of the boundary control (Att) and the model estimates (*M*).

The results of the experiment will make it possible to determine the quality of the proposed multicriteria model, which is not worse than the totality of teacher evaluations (60%) and computer testing (40%).

However, the proposed multicriteria model for computer testing, which takes into account the number of answers, the level of questions complexity depending on the time spent for answer, the time of the entire test, doubts, the number of omissions, and the use of additional programs, allows us to bring the % of successful random answers to the minimum, thereby increasing the objectivity of the level of knowledge assessment, having made them close to an assessment of the teacher, but not of the computer program.

During the experiment, the following data were obtained (Table 3).

##### 3.2. Experiment Processing

The null hypothesis H0 is a statistical hypothesis to be verified by statistical data, the results of observations included in the sample. Of the possible statistical hypotheses, choose the one that assumes equity, and it is the most important for further conclusions.

An alternative hypothesis is a statistical hypothesis that is considered valid if the null hypothesis is incorrect.

Statistical criterion is a rule according to which on the bases of observation results, a decision is made whether to accept or reject the null hypothesis. In this article, the experimental test of the model adequacy is chosen as the criterion or the rule.

The article puts forward the null hypothesis that the coefficient of doubt, introduced into the model as one of the factors, affects the objective assessment of the knowledge of the person being tested.

In this regard, for a more objective assessment of test results, a multicriteria knowledge assessment model is proposed. This model was compiled and tested in previous works of the authors.

This article experimentally proves the effect of the additionally introduced factor, which indicates the level of doubt of the person being tested on an effective assessment of his knowledge. This shows the adequacy of the proposed model. In the article, this type of system evaluates according to the following criteria:(1)Time spent to respond(2)Number of correct answers(3)Time of the entire test(4)Level of doubt(5)Skipping questions(6)Using the additional functions

One of the most difficult tasks is to take into account consideration of doubts depending on the complexity of the issue. In order to keep records of doubts depending on the level of question complexity, it is necessary that the program be with a preliminary choice of the answer, and subsequently with its confirmation.

Consequently, it all comes down to the number of preliminary answers, i.e., to the number of doubts. In this regard, the authors came to the conclusion that the coefficient of doubt level included in the model depends on the following variables:(i)Number of missed operations(ii)Amount of unconfirmed information(iii)Reaction to decision-making(iv)Level of difficulty(v)Number of logical chain interrupts(vi)Doubtful user actions

The multicriterial model is valid only for adaptive testing, which takes into account the number of answers, the level of complexity of questions depending on the time spent on their answer, the time of the entire test, doubts, the number of omissions, and the use of additional functions.

According to the experimental data, the multicriteria model of knowledge testing described in the article objectively assesses the level of knowledge of the person being tested and sets the appropriate assessment.

As a result of processing the obtained data, the Pearson correlation coefficients were calculated:where refers to values of the variable *X*, refers to values of the variable *Y*, is the average value for variable *X*, and is the average value for variable *Y*.

Figure 3 shows the relationship between the values of the intermediate control (Att) and the examination evaluation of computer testing (Exm).

The correlation coefficient between Att and Exm = 0.65 (*K*_{1}).

The figure shows that the results of the assessments delivered by the teacher (Att) are weakly correlated with the assessments of the computer testing exam.

Figure 4 shows the relationship between the values of boundary control (Att) and computer testing based on the multicriteria model (*M*).

The correlation coefficient between Att and *M* = 0.94 (*K*_{2}).

Figure 5 shows the relationship between the final grade for the course (*F*) and computer testing based on the multicriteria model (*M*).

The correlation coefficient between *F* and *M* = 0.95 (*K*_{3}).

#### 4. Discussion

The hypotheses that have been advanced are confirmed by the results of processing the experimental data, which show that the model evaluation (based on the multicriteria approach) and the final assessment, as well as the model and assessment of the boundary control obtained on the basis of standard approaches, have a high level of coincidence, which is confirmed by a high degree of pair correlation of *K*_{2}, *K*_{3}, and visual confirmation (Figures 4 and 5).

Thus, it can be argued that there is a sufficiently high functional connection (close to linear) between the final assessment for the course and computer testing on the basis of the multicriteria model, which makes it possible to exclude the boundary control conducted directly by the instructor, without violating the high degree of objectivity in assessing student knowledge. This result is of great importance for the organization of distance learning with the automated assessment process.

#### 5. Related Work

Tojo et al. [3] aim to implement the prelearning and group discussion in a consistent e-learning system, which develops the human-robot interaction for the purpose of creating a social, autonomous robot capable of conversing using various human-like methods such as body language, hand gestures, facial expressions, gaze, and touching.

Joksimovic et al. [4] put forward a model, which intended to guide the future work studying the association between contextual factors, student engagement, and learning outcomes.

According to Firat et al. [5], motivation that initiates and sustains behavior is one of the most significant components of learning in any environment.

Farhan et al. concluded that students’ interaction and collaboration using the Internet of Things (IoT) based on interoperable infrastructure is an effective learning practice [6]. Measuring student attention is an essential part of educational assessment.

Eom and Ashill argue that a significant reduction in dependent and independent variables and their measures are necessary for building an e-learning success model, and such a model should incorporate the interdependent (not independent) nature of e-learning success [7].

Bhattacharya et al. present an intelligent recognizer of the cognitive state of an e-learner as an integral part of the confidence-based e-learning (CBeL) system [8]. It addresses the problem of providing technology-driven pedagogical support to an e-learner to achieve the desired cognitive state of mastery, which is endowed by high levels of both knowledge and confidence.

Marcelino et al. offered Learning Computational Thinking concepts which have gained an importance in the last years [9].

Debiec mentioned that, in this paper, a case study is presented based on one offering of an introductory digital system course taught with a combination of learner-centered strategies selected to overcome these barriers and improve student’s performance [10].

Gul et al. identified various issues and challenges faced by massive open online courses (MOOCs) while offering open online courses to vast number of learners [11].

Farid et al. propose a sustainable quality assessment approach (model) for the e-learning systems keeping software perspective under consideration [12].

Bradac and Walek present the proposal, design, and implementation of a new approach to adaptive e-learning systems [13]. Adaptivity is considered as an ability of the system to adapt to student’s knowledge and characteristics.

Ren et al. present an online course applicability assessment (OCAA) to assist learners in course selection [14]. Three main characteristics are taken into consideration, including “learning style,” “learning behavioral type,” and “prior knowledge” which considerably affect the e-learning effectiveness.

Cohen and Anat focus on the fact that identifying the changes in student activity during the course period could help in detecting at-risk learners in real time, before they actually drop out from the course [15].

In recent years, designing adaptive e-learning systems has become one of the striking topics of discussion in the literature. Hamada and Hassan describe a case study on the integration of the learning style index into an adaptive and intelligent e-learning system [16].

Levina et al. studied the continuous information development of all spheres of education: integration of new knowledge, accessibility of information technologies and computer facility aids, professionalization, and computerization of educational activities [17].

Sweta and Lal proposed learner model mines and learner’s navigational access data and found the learner’s behavioral patterns which individualize each learner and provide personalization in the learning process according to their learning styles [18].

Bralic and Divjak blended a learning model where a MOOC has been integrated in a traditional classroom [19]. The learning outcomes-based approach was implemented that supported a balanced student workload.

Gregori et al. aim to design and analyze the implementation of a number of guidelines that allow us to effectively unify a high-quality teaching methodology and the use of new technologies in distance learning [20].

Quintana and Perez have analyzed massive, open, and online model course, describing their advantages and disadvantages, besides the formative and motivating factors that guide the students, not only to enroll in particular courses but also to request for their own space to be able to create a MOOC [21].

For the first time, a multicriteria method for measuring the level of knowledge of students during computer testing based on the level of doubt for E&M learning has been described by the authors in a number of works, as discussed by Serbin et al., [22–24] and applied for distance training.

#### 6. Conclusion

The multicriteria model allows to objectively assess the student’s knowledge level and, accordingly, to estimate the evaluation during computer testing. The model includes an original method of measuring the level of doubt, as a function of the time, and complexity of issues differentiated by thematic blocks. In addition, the assessment model itself being multicriteria, besides the function of doubt, includes the number of correct answers, the testing time, the use of additional programs, spent time on each response, the number of periodic passes in the testing process, and more. In diagnosing the level of knowledge based on computer testing, use of this method will reduce the probability of the influence of an accidental correct answer (“guessing”) on the final result and to obtain the most objective assessment.

The proposed model includes the following factors:(i)Number of answers(ii)Level of complexity of the questions depending on the time(iii)Spent on their response(iv)Time of all testing(v)Doubt(vi)Number of passes(vii)Use of additional programs

The practical significance of this model lies in the objectivity of assessing the level of knowledge, where computer testing is applied, especially relevant for the organization of distance learning using E&M learning. Additional effects of the application of this model may be the reduction of the time for organization, intermediate control procedures, labor costs, operating costs, processing and analysis costs, and others, research planned in subsequent publications.

In conclusion, we can make a statement that the null hypothesis put forward by the authors is proven experimentally and the study demonstrates the original methodology for measuring the level of doubt of the tested students in the information and training system. The use of this methodology allows making the most objective decision on the assessment of students’ knowledge.

In diagnosing the level of knowledge based on computer testing, the use of this methodology makes it possible to reduce the probability of the random factor influence, such as “guessing” on the final result and to obtain the most objective assessment. Moreover, to control the learning process, measuring the level of doubt allows taking into account the psychological characteristics of human behavior during training.

#### Data Availability

The raw data that support the findings of this study are available from the corresponding author upon reasonable request.

#### Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.