Abstract

The accurate differentiation of the subtypes of benign paroxysmal positional vertigo (BPPV) can significantly improve the efficacy of repositioning maneuver in its treatment and thus reduce unnecessary clinical tests and inappropriate medications. In this study, attempts have been made towards developing approaches of causality modeling and diagnostic reasoning about the uncertainties that can arise from medical information. A dynamic uncertain causality graph-based differential diagnosis model for BPPV including 354 variables and 885 causality arcs is constructed. New algorithms are also proposed for differential diagnosis through logical and probabilistic inference, with an emphasis on solving the problems of intricate and confounding disease factors, incomplete clinical observations, and insufficient sample data. This study further uses vertigo cases to test the performance of the proposed method in clinical practice. The results point to high accuracy, a satisfactory discriminatory ability for BPPV, and favorable robustness regarding incomplete medical information. The underlying pathological mechanisms and causality semantics are verified using compact graphical representation and reasoning process, which enhance the interpretability of the diagnosis conclusions.

1. Introduction

Benign paroxysmal positional vertigo (BPPV) is a major cause of vertigo and accounts for approximately 17–42% of the cases. It has a lifetime prevalence of 2.4% in the general population [1]. BPPV is caused by displaced otoconia in the semicircular canal, and its clinical characteristics include a brief, episodic, and position-provoked vertigo. BPPV typically causes balance disturbance which considerably increases the risk of falls in patients. Furthermore, it causes malignant secondary damage, especially for elderly people [2]. The commonly used therapy for BPPV is the particle repositioning maneuver (PRM), also known as the canalith repositioning procedure. Theoretically, almost all patients can be readily treated with accurate diagnosis and pathogenesis [3, 4]. However, the average resolution rate with one PRM treatment for posterior semicircular canal BPPV in many trials was revealed to be only 8.6%. Notably, there are various treatment options; therefore, the effectiveness of PRM (such as the Epley and Semont maneuvers) depends on the accurate classification of BPPV. The classification process would include determining the underlying pathogeny, locating the calcium carbonate debris and identifying whether the material is free-floating or adherent to the cupula, among others. BPPV can be classified into many subtypes in clinical practice, such as the idiopathic BPPV and secondary BPPV, or the typical BPPV and atypical BPPV. However, distinguishing among several causes of vertiginous disorders with similar symptoms is quite complex. Therefore, clinical subtype differentiation of BPPV is important for effective repositioning maneuvers in treatment to reduce inappropriate vestibular suppressant medications and minimize unnecessary ancillary tests.

In recent years, some automatic multiaxial positioning devices, electronystagmograms, and videonystagmography have been applied in the management of BPPV [5]. These applications have digitized and standardized, laying a foundation for computer-aided intelligent diagnosis and clinical decision-making. Medical data typically contain hidden and complicated patterns and causality semantics; thus, appropriate encoding and interpretation of medical data result in better diagnosis, medicine, and treatment [6, 7]. Numerous research has been done on this; for instance, in the management of vertigo, there is a wide spectrum of potential ailments and intricate confounding factors which complicates the accuracy of identifying the different types of vertiginous disorders. Based on this, several otoneurological expert systems including VERTIGO [8], Carnisel [9], and otoneurological expert system (ONE) [10, 11] were developed. Notably, ONE is a mature system that has been validated in practice, and its inference approach resembles methods of weighted k-nearest neighbor (k-NN), genetic algorithm (GA), fuzzy logic, and decision tree.

Previous attempts have been made at exploring methods for automatic differential diagnosis of diseases. A method for differentiating pancreatic serous from mucinous cystadenomas based on morphological features extraction was proposed in [12]; this approach utilized classifiers such as Bayesian, k-NN, support vector machine (SVM), and artificial neural network (ANN). Furthermore, a method for integrating Hopfield networks, retrieval networks, and k-NN to mine the medical data for differential diagnosis was proposed in [6]. Moreover, Lopes et al. developed a decision-support system based on the fuzzy cognitive map to discriminate the diagnoses of alterations in urinary elimination [13]. A supervised machine learning algorithm based on the combination of principal components analysis and SVM for the differential diagnosis of Parkinsonian syndrome (PS) was proposed in [14]. Mudali et al. [15] studied the classification of PS using decision trees based on the scaled subprofile model and principal component analysis methods. Ota et al. [16] proposed a differential diagnosis tool for PS by applying the discriminant techniques derived by stepwise methods. The SVM and logistic regression-based models were developed in [17] for the classification and prediction of PS. A method based on the Bayesian network (BN) was proposed in [18] to perform classification of early glaucoma and cluster data into different stages of the disease. Moreover, multidimensional BN classifiers are proposed to assist the treatment of multiple sclerosis [19]. A BN-based method was used for classification of EEG-based multiclass motor imagery BCI [20]. Furthermore, a method was proposed in [21] for individualized characterization and diagnosis of cognitive impairments. Melin et al. [22] described a method of competitive neural networks and the learning vector quantization algorithm for classification of electrocardiogram signals. Finally, an expert system was provided in [23] for the differential diagnosis of strabismus based on the architecture of ANN and the Levenberg–Marquardt back propagation method.

There are several challenges in this field of research. First, it is difficult to faithfully model the pathogenesis and pathophysiology characterized by individual differences, and uncertain factors including intricate and confounding disease factors, and insufficient sample data, among others. To address this problem, we believe that medical knowledge and experience could play an important role. The knowledge derived from clinical data and that specified by the domain experts should be integrated to provide a basis for practical decision-making [24, 25]. The introduction of expert knowledge has been further recognized as an effective solution for reducing the inherent uncertainty of the models based on automatic learning methods [26].

Second, many diagnosis approaches are mostly deficient in handling incomplete examinations and tests in clinical practice. The dependence of the models on the completeness of medical data limits some methods’ functions in cases where the information is incomplete or imperfect. Some approaches have been proposed to deal with cases of missing data in the diagnosis of chronic obstructive pulmonary disease; these include evaluating the similarity of attributes with unknown values or filling the gaps statistically with plausible values [27]. Although there has been research on the plugging missing values by identifying the regularities in the occurrence model, very few studies have been devoted to effectively improving the diagnostic reasoning method in cases where there is incomplete information. Notably, in disease diagnosis, using incomplete information to accurately diagnose a disease reduces the need for unnecessary medical examinations.

Third, although the probability is known to be capable of providing a sound and flexible procedure for interpreting uncertain evidence with which the clinical practice is confronted, it is difficult to make a conclusion readily comprehensible to the users. It is important to determine the underlying etiology and pathology for disease occurrences, the process of systematic diagnosis, and a means of confirming that the right conclusion has been reached. Therefore, an explicable diagnostic reasoning model that matches the method for drawing conclusions is essential. Furthermore, undesirable interpretability or explicability is a common hinderance for the practical application of most methods.

In summary, it is well known that distinguishing the wide range of vertigo causes is a complicated endeavor, even for experienced physicians. Even though BPPV is suggested in the diagnostic reasoning outcome, the subtype cannot be easily discriminated. In our opinion, an in-depth understanding and appropriate representation of the pathogenesis and pathophysiology characterized by individual differences contribute to the diagnostic accuracy. This study, therefore, seeks to develop an efficient and easy-to-implement differential diagnosis method for BPPV based on the dynamic uncertain causality graph (DUCG). The main focus of the proposed method is solving the three aforementioned problems of the knowledge-based modeling, incompleteness of the medical data, and interpretability of the method.

The knowledge representation and inference methods of DUCG are introduced in [2831]. In recent years, DUCG has been applied in several medical diagnostic systems involving multiple diseases such as jaundice, syncope, and sellar region disease, among others. Our earlier work in [30] presented a modeling and reasoning methodology for vertigo diagnosis and developed a decision-support system which covered 22 common vertigo diseases. However, the methods are not completely applicable to the differential diagnosis problems. Therefore, this paper proposes new algorithms for reasoning about BPPV subtype differentiation.

The structure of this article is as follows: Section 2 describes the proposed method based on DUCG, including the uncertain causality representation, the BPPV model, and the differential diagnostic reasoning algorithms. In Section 3, a series of verification experiments using clinical cases are presented, followed by a comparative analysis and an application analysis. Finally, Section 4 contains the conclusions.

2. Methods

2.1. DUCG Causality Representation and Diagnostic Reasoning Method
2.1.1. Uncertain Causality Representation Scheme

The DUCG graphically and compactly represents the events and causalities among events. Figure 1 shows an example of DUCG. An X-type variable (oval-shaped node) represents an observable effect event, which can include medical information such as symptoms, signs, and examination findings. A B-type variable (square-shaped node) denotes a root cause event or a disease origin. A G-type (logic gate node) variable represents the combinational logic relations between its inputs and outputs (e.g., G12 in Figure 1(a) with its state expression specified in Table 1). A D-type variable (pentagon node) represents the default or unspecified cause of an X-type variable. A certain state j of variable ( can be a B-, X-, G-, or D-type), referred to as , represents an event; state number j = 0 indicates a normal state and j ≠ 0 indicates an abnormal state.

The arc represents a virtual functional event that describes the causality between a child event Xn,k and a parent event . denotes the independent causality function that causes . The intensity of causal relationship between and is defined as , and , . As a weighting factor of An;i, the coefficient () balances these independent causality functions for all parent events on a same child event. For A/F/a-type variables, the variable subscripts are in the format of “child; parent,” and the variable identifiers in lower case letters denote the probability parameters of the variables in corresponding upper case letters, for example, and .

The causality representation mechanism of DUCG is a weighted logic event expansion, which is denoted below:

A child event can be logically expanded into a weighted sum of independent causality functions from all its parent events . Thus, the multivalued causalities between child-parent events in each pair are independently represented, rather than using joint probability distribution as in the other models such as BN. The use of a compact causality representation accords the intuitive cognition of people to the real world and is more explicable. For DUCG, it is convenient to represent the causalities and parameters based on findings of medical research and clinical knowledge, or statistical learning from clinical sample data.

2.1.2. DUCG-Based Diagnostic Reasoning Method

The diagnostic inference method includes three steps: (1) simplifying the original causality graph based on observed evidence; (2) weighted logic event expansion and logical reasoning for evidential events; and (3) probabilistic reasoning. If Hk,j represents a candidate hypothesis, i.e., a possible root cause (disease origin) of evidence E, then the state probability of Hk,j, conditioned on E and denoted by , is calculated aswhere E is defined by . Every evidence event is independently expanded into logic expressions according to (1) by tracing their upstream causality chains back towards the root cause event(s), respectively. Such an inference process is called chaining inference. Meanwhile, some basic logic operations, including OR, AND, NOT, XOR, absorption, exclusion (events are mutually incompatible), and complementation, are performed. Thus, the candidate hypothesis space can be obtained. Finally, the posterior probabilities of the hypothesis event can be calculated. In case of more than one candidate hypotheses in SH, the state probability of is modified by the weight coefficient defined on the prior probability of evidence (referred to as , ) in the causality context of each hypothesis :

The probabilistic normalization of weighted logic event expanding expressions, i.e., , holds automatically because , , and . By virtue of such an automatic normalization feature, the chaining inference can be proven as self-reliant such that is calculated without the use of , and the irrelevant parameters (e.g., ) can be absent without affecting the inference accuracy. Moreover, the variation of parameter values in the numerator of (2) corresponds to the variation in the parameter values in the denominator. It is thus the ratio of the parameter values in the event expanding expressions of , to that of that has an actual effect on the final result. Based on the aforementioned features, the exact inference can be used in case of incomplete and inaccurate parameters and clinical data.

The causality graph in Figure 1 can be used as a calculation case to clarify the details of the diagnostic inference algorithm. Let us suppose that the parameter matrices are given as follows (for the sake of simplicity, the weighting factors rn;i are all set to 1). For a matrix element (e.g., a3,1;7,1, a5,1;2,2, and a6,2;8,1), k and j correspond to the row number and column number of the matrix, respectively. The symbol “−” indicates unknown, unavailable, or irrelevant parameters:

Besides the normal or unknown variables, suppose that the evidence for the DUCG example in Figure 1(a) is obtained as . By applying the simplification rules of DUCG [28, 30], all the irrelevant, incorrect, and meaningless causalities and events can be eliminated from the graph based on the medical evidence. Thus, the simplified graph is obtained as shown in Figure 1(b). DUCG uses different colored nodes to distinguish the X-type variables’ states: green indicates a normal state indexed by 0, sky blue and blue indicate state 1 and state 3, respectively, yellow and brown indicate state 2 and state 4, respectively, and colorless indicates an unknown state, implying an unconfirmed/unidentifiable pathological feature or an unexamined item.

Based on (1) and Figure 1(b), the expanding expressions of evidence X3,1, X5,1, and X11,2 in the causality context of B2,1, respectively, are shown below:

Likewise, other abnormal evidence is expanded, and the expanding expression for E is obtained as

Similarly, the expanding expression of E in the causality context of B1,1 is

Finally, hypothesis space under evidence E is obtained as follows: , where and . Therefore, the two hypotheses are determined as the probable causes of the observed evidence, as both can independently interpret all the symptoms. The probabilistic calculation results are and . The inference results can be explained using the graphical causality semantics as shown in Figure 1(b). Based on these semantics, users can not only deduce the results but also understand their logic.

Based on the self-reliant chaining inference, the missing parameters, such as a3,1;7,2, a7,2;8,1, a11,2;12,1, a10,2;10D, a9,1;6,1, and a6,1;8,1, do not have an impact on the reasoning result. Moreover, the most probable hypotheses could be typically determined uniquely, just within the logical reasoning process (before the probabilistic reasoning is performed). Thus the causality structure that accurately represents the disease pathogenesis is what really matters. It is evident that DUCG does not impose stringent requirements for completeness of the parameters and clinical examination data. Thus, it can be used to facilitate modeling through medical knowledge and experience.

2.2. Construction of Differential Diagnosis Model for BPPV
2.2.1. BPPV Characteristics

(1) Pathophysiology of BPPV. The vestibular receptor consists of two otolith organs (the utricle and saccule), which oversee linear acceleration and gravity, as well as three semicircular canals (SCCs) that sense angular acceleration. The SCCs include anterior SCCs (ASCCs), posterior SCCs (PSCCs), and lateral SCCs (LSCCs). The afferent nerves from otolith organs and SCCs project symmetrical activities to the central vestibular system to maintain the balance and spatial orientation. BPPV results from misplaced calcium carbonate crystals (otoconia) that are detached from utricle macula and collected within the SCCs due to trauma, infection, or even aging. These crystals either remain free-floating in the SCCs or become attached to the cupula. Normally, as the motion sensor of SCCs that are filled with endolymph, cupula is a gelatinous mass with the same density as endolymph. Since the calcium particles are denser than the endolymph and cupula, the SCCs become pathologically sensitive to linear acceleration and gravity; the afferent nerves from SCCs thus fire asymmetrically, leading to vertigo and nystagmus. When moving into the SCC, the calcium carbonate crystal debris may cause endolymph movement, which consequently stimulates the cupula of the affected canal during head movement or while stationary (this is called canalithiasis). Similarly, if adherent to the cupula, the particles may also activate it (this is called cupulolithiasis) [32].

(2) Clinical Characteristics of BPPV. BPPV is characterized by brief attacks of vertigo after lying down in bed, looking up or bending down, among others. The vertigo attacks in most cases last less than 1∼2 minutes and mostly occur at night or upon awakening [1]. The diagnosis rests on the observation of characteristic nystagmus accompanying symptoms of vertigo when a patient’s head is moved into a specific orientation with respect to gravity. This is due to shred calcium crystals from macula, which may cause age-related changes in the protein and gelatinous matrix of the otolithic membrane. Although most of BPPV cases are idiopathic, a significant proportion can be associated with preceding traumatic events including head trauma, dental treatment, and ear surgery. Other conditions, such as viral vestibular neuritis, otitis media, Ménière’s disease, idiopathic sudden sensorineural hearing loss, posterior circulation ischemia, and migraine, can also trigger cases of BPPV.

BPPV may affect each of the three SCCs. Alternatively, it can affect more than one canal simultaneously, resulting in varying nystagmus patterns [33]. Notably, due to its gravity-dependent position, the most commonly affected semicircular canal is the posterior canal. The positional and positioning tests (e.g., Dix–Hallpike diagnostic maneuver and supine roll test) can provide an accurate diagnosis for this condition. Furthermore, the features of nystagmus, including latency, direction, duration, reversal, and fatigability, are significant during diagnosis.

(3) Subtype Differentiation of BPPV. Based on the epidemiology, causes of disease, pathophysiology, anatomy pathology, clinical features, and treatment outcomes, a BPPV disease may be classified into distinct categories along four dimensions: (1) depending on the localization of the particles and the involved semicircular canals, there are four subtypes (PSCC, ASCC, LSCC, and multiple-SCC (MSCC)); (2) BPPV is subdivided into idiopathic BPPV and secondary BPPV, as it may occur as a primary disease or secondary to other otology disorders or systemic conditions [4]; (3) BPPV can be divided into canalithiasis and cupulolithiasis, depending on the nystagmus provoked characteristics from the positioning test based on the pathophysiology mechanism [32]; and (4) based on the clinical features and treatment outcomes, BPPV falls into two categories: typical and atypical. The atypical BPPV can be further identified as subjective BPPV, persistent BPPV, and recurrent BPPV. Subjective BPPV refers to the cases associated with a positive history of BPPV and Dix–Hallpike; or supine roll tests which are positive for vertigo, but negative for nystagmus [34]. A persistent BPPV refers to cases lasting for more than two weeks, while a recurrent BPPV indicates recurrence of BPPV in the same canal after a symptom-free interval of at least two weeks from a previously successful treatment [35].

2.2.2. Constructed Model for Differential Diagnosis of BPPV

As a causality representation to the aforementioned pathogenesis and pathophysiology of BPPV, a new DUCG-based causality graph was constructed based on the vertigo model presented in [30]. The BPPV differential diagnosis model is shown in Figure 2; it includes 125 variables (X-type: 85, D-type: 8, G-type: 21, and B-type: 11) and 286 arcs.

(1) Modularized Modeling Scheme. A modularized modeling scheme is applied in the construction of the causality graph shown in Figure 2. All BPPV subtypes are handled as individual sub-DUCGs. For example, the three sub-DUCGs representing the idiopathic, PSCC, LSCC, and cupulolithiasis BPPV are demonstrated in Figures 35, respectively. When the sub-DUCGs are merged, solutions are proposed for addressing the ambiguous, contradictory, and incomplete information to ensure global coherence [30]. Such a modularized modeling scheme reduces the difficulty in model construction using a divide-and-conquer approach.

(2) Definition of Variables. Based on the characteristics of BPPV and the international diagnostic criteria [36], the medical information of patients, including symptoms, signs, findings of examinations, medical histories, etiology and pathogenesis, pathophysiology, and socio-psychological and environmental factors is incorporated into the model. Table 2 lists some of the variables of the model shown in Figure 2.

(3) Representation of Causal Relationships. In this representation, arcs are established to represent and quantify causal correlations among related symptoms and disease origins; complex causalities are denoted by a combination of causal functions via logic gates. All causalities and parameters in Figure 2 are determined by otoneurologists, based on their knowledge, experience, epidemiology statistics, and research achievements. As stated before, both the incompleteness of knowledge and imprecision of parameters have little impact on the reasoning accuracy; thus, DUCG is more flexible for the model construction.

2.3. The Algorithm of Differential Diagnostic Reasoning

Differential diagnosis aims at narrowing down the diseases to the most probable one from a list of candidate diseases that show similar symptoms. The differential diagnostic reasoning algorithm is presented as Algorithm 1, which implements a hypothesis-driven process. The posterior probability of the hypothesis event is used to quantify ’s ability to interpret the medical evidence, E. Notably, a rare disease does not necessarily respond to a weak causality. If a rare disease (with a low prior probability) better explains a patient’s symptoms and medical evidence, then the strength of the causality functions can be high; thus, the disease will get a higher ranked probability as a hypothesis event.

Begin
For a BPPV patient
  Collect medical evidence, E, including symptoms, signs, examination findings, laboratory tests, and medical histories
  Perform causality simplification to the BPPV causality graph based on the evidence E
   Apply the causality simplification mechanisms including the pruning strategy and causality tracking process
   Obtain the simplified diagnostic causality graph
  Perform the algorithms of weighted logic inference based on the simplified causality graph
   Perform the weighted logic event expansion to the evidence according to (1)
   Perform logic inference according to (2)
    Apply the weighted logic inference rules 1∼6 of differential diagnosis for BPPV
    Apply the scheme of dummy basic variable
    Perform the basic logic operations
   Obtain the hypothesis event space,
  Perform the probabilistic reasonings to obtain the posterior probability of conditioned on E,
   Calculate the ranked independent probability of a BPPV subtype
End
2.3.1. New Causality Simplification Mechanisms

Based on the collected medical evidence E for a patient, the process of causality simplifications is performed to the BPPV causality graph. Causality simplification aims to eliminate the irrelevant, meaningless, and impossible events and relations from a causality graph. Therefore, the model scale and complexity are both significantly decreased, and the disease hypothesis space converges. The root cause event can even be preliminarily determined based on the simplified causality graph.

As a complement to the original simplification rules of DUCG [28, 30], several new causality simplification mechanisms, including a pruning strategy and causality tracking process, are presented.

Definition 1 (Pruning Strategy). The meaningless disease hypothesis and its corresponding causalities are identified and pruned from the causality graph using a scheme of causality tracking for all the abnormal evidences.

Definition 2 (Causality Tracking Process). The causality tracking process starts from an observed event and traces its upstream causality chains and ancestors back towards the B-type event(s). For any Bi, only if all the abnormal evidence is reachable to it via a causality tracking process, can it be identified as a candidate hypothesis; otherwise, Bi should be eliminated from the hypothesis space, and its corresponding causalities should be pruned from the graph.

2.3.2. Weighted Logic Inference Rules of Differential Diagnosis for BPPV

Based on the simplified causality graph and medical evidence, the process of logical inference is performed to obtain the hypothesis space according to (1)∼(3). Our earlier study showed that by applying event expanding and logical operations, the logical inference can effectively decrease the complexity of probabilistic reasoning [28]. However, in response to BPPV differential diagnosis, the weighted logic inference algorithm requires major modifications.

Some BPPV subtypes can be present in a patient simultaneously. For instance, a diagnosis can be termed as an ASCC-idiopathic-canalithiasis BPPV. Such a case is known as concurrency of subtypes. Meanwhile, the subtypes belonging to the same category are mutually exclusive when a diagnostic conclusion is drawn. Some examples of this are PSCC-BPPV (B23,1), LSCC-BPPV (B24,1), ASCC-BPPV (B25,1), and MSCC-BPPV (B26,1). Therefore, to deal with these cases, new weighted logic inference rules for exclusion and concurrency, among others, are developed as shown below.

Rule 1. Suppose that the basic events (namely, root cause events) in a BPPV differential diagnosis model are represented in terms of category sets as C1 = {B23,1, B24,1, B25,1, B26,1}, C2 = {B27,1, B28,1}, C3 = {B29,1, B30,1}, and C4 = {B31,1, B32,1, B33,1}. State 1 of a B-type event indicates that the root cause event has occurred; on the contrary, state 0 of a B-type event indicates that the root cause event has not occurred. The mutual exclusion rules for basic events in BPPV differential diagnosis can thus be defined as , where .

Proof. From the definition, the category sets C1, C2, and C3 indicate different classification perspectives of BPPV. Therefore, all basic events within the same category set are exclusive and exhaustive. Specifically, there is B23,1B24,1 = 0, B23,1B25,1 = 0, B23,1B26,1 = 0, B24,1B25,1 = 0, B24,1B26,1 = 0, B25,1B26,1 = 0, B27,1B28,1 = 0, and B29,1B30,1 = 0.

Rule 2. for () and for (, ).

Proof. The events in C4 describe the respective different types of atypical BPPV, in an overlapping manner. Namely, all the basic events in {B31,1, B32,1, B33,1} are likely to be concurrent. This sets the basis for Rule 2.

Definition 3 (Maximizing Concurrent Basic Event Set Strategy). To obtain an ultimate disease hypothesis , if the events that constitute are concurrent and consistent with each other, the maximum number of basic events within the hypothesis space SH should be incorporated.
It is essential to explore every possible outcome before ruling out any basic event. For example, as SH = {B24,1, B28,1, B30,1} is obtained for a case, we should get only the correct hypothesis , in which the three events together interpret the observed symptoms, rather than B24,1B28,1, B28,1B30,1, or so on.

Rule 3. Given , , and integer y ≥ 2, then, and .

Rule 4. Given integer y ≥ 2, , and , then, , Fnk;ijFnk′;ij = 0, Fnk;ijFnk;ij′ = 0, and Fnk;ijFnk′;ij′ = 0; if is given, , , then it follows that, Fnk;i′,1Bi,1 = 0, Fnk;i,1Bi′,1 = 0, and Fnk;i,1Fnk;i′,1 = 0.

Proof. The latter part of Rule 4 is based on Rule 1, and the other part has been proven in [28]. From Rule 1, which implies that the basic events Bi,1 and Bi′,1 cannot be concurrent and consequently brings about the mutual exclusions among the function events related to Bi,1 and Bi′,1. Therefore, Ank;i,1Ank;i′,1 = 0 and Fnk;i,1Fnk;i′,1 = 0, as Ank;i,1 cannot be present in combination with Ank;i’,1; similarly then, Fnk;i′,1Bi,1 = 0 and Fnk;i,1Bi′,1 = 0. Notably, Fnk;i,1Fnk;i′,0 ≠ 0 is true because Ank;i,1 and Ank;i′,0 are independent of each other (since they are given independently).

Rule 5. , where Sm denotes the set of variable number in a product item, m ∈ {1,2, …, M} and .

Rule 6. Suppose that no mutually exclusive basic events coexist in the candidate hypothesis space, then, , if j = ji is given.

Proof. To complement the ordinary conditions of Rule 14 discussed in [28], consider a situation related to mutually exclusive basic events, Vij = Bi,1 and Vij = Bi′,1, as proposed in Rule 1. The proposition of Rule 6 can thus be represented asThe inclusion of mutually exclusive events leads to the aforementioned outcomes (i.e., the inconsistence between Rule 1 and Rule 6), and the autonormalization feature no longer holds. Therefore, a solution for BPPV subtype differentiation known as a scheme of dummy basic variable (DBV) is proposed.

2.3.3. Scheme of Dummy Basic Variable

Definition 4 (DBV). If mutually exclusive candidate hypotheses coexist in the candidate hypothesis space, i.e., , , are converted into different states of a DBV; each state of the DBV elicits an individual pathogenesis causality graph according to the hypothesis related to that state; the specific causalities of different diseases can be distinguished by the A-parameter values of DBV, which can either be zero or not. The exclusive rules among basic events are thus transformed into an inherent exclusive law among the states of an event.
The problem of disobeying the autonormalization feature can thus be addressed. This method is illustrated via an example in Figure 6(a). Suppose that B1,1 and B2,1 are mutually exclusive events. They are then combined into a DBV- during the reasoning process, corresponding to and , respectively. From the definition of DBV,  ≡ a3,1;1,1,  ≡ a4,1;2,1,  ≡ a5,1;2,1, and  ≡ rn;i, where and . Figure 6(b) shows an example where an event (e.g., X3) is connected to more than one basic event. In this case, is defined as , if Fn;i ≠ 0. Thus,  = r3;1 + r3;2 for the case in Figure 6(b) which points out to a combination of basic events.
In the context of differential diagnosis, and upon the completion of causality simplifications, the obtained candidate hypotheses of C1 = {B23,1, B24,1, B25,1, B26,1} (there are more than one basic event in the set), should be combined into a DBV-Bm, with Bm,k denoting Bi,1, where k = 1, 2, 3, 4 and i = 23, 24, 25, 26; Pr{Bm,k} refers to the prior probability of Bi,1, and .

Definition 5 (Ranked Independent Probability). The ranked independent probability represents a new metric proposed for quantifying the degree of confidence in whether a BPPV subtype (e.g., Bi,1 as a part of the hypothesis Hk,j) can independently interpret all the medical evidence, in comparison with the other subtypes coexisting in Hk,j. The ranked independent probability of Bi,1 is calculated as follows: (1) get of Bi,1 according to (2) and (3), in which ζi is based on the individual causality tracking graph of Bi,1; and (2) for all the Bi,1 within the set of Hk,j, is ranked by .

2.4. Theoretical Analysis of the Reasoning Complexity

The inference algorithm for the proposed method reduces the reasoning complexity in two ways. First, the causality graph simplification and pruning strategy significantly decreases the model scale. The variables that are irrelevant or inconsistent with the medical evidence are excluded from the reasoning calculation. This significantly reduces the computational complexity. Moreover, the diagnostic reasoning conclusion can even be accurately drawn solely via the causality simplification. Second, prior to probabilistic reasoning, weighted logic inference is performed based on the simplified causality graph. Thus, the multiple connected causalities can be decomposed independently, leading the chaining inference to exhibit high efficiency. Furthermore, the proposed logical inference rules and logical operations can decrease the complexity and scale of logical event expressions.

The actual efficiency tests for the DUCG inference algorithm have been performed using online fault diagnosis data from several large-scale industrial systems (e.g., nuclear power plants), in which thousands of variables and causality arcs are involved [28, 31]. The diagnostic reasoning can typically be finished within 0.5–1 s (on a personal computer). Moreover, the inference efficiency has been verified from our past clinical diagnosis applications that involved dozens of diseases, such as jaundice and sellar region disease.

3. Experiments and Analyses

3.1. Clinical Validation of the BPPV Differential Diagnosis Method
3.1.1. Differential Diagnosis for BPPV—Case 1

Suppose that the medical evidence of a patient was observed in part as {X4,1, X21,2, X24,3, X184,4, X185,2, X209,4, X229,2, X230,2, X207,0, X208,0, X236,0} in Figure 2 and other X-type variables are normal or unknown (incomplete medical information). The medical information in this experiment is from a 65-year-old female patient who suffered from brief episodes of vertigo when rolling over in bed for two weeks and was suspected to be a BPPV case according to the supine roll test.

Based on the evidence, in this case, Figure 2 can be simplified into the graph shown in Figure 7, by applying the causality simplification rules and pruning strategy. Many meaningless relationships and events under current situation are eliminated accordingly, with SH = {B24,1, B28,1, B30,1} being inferred as the possible root causes. The resulting hypothesis space, SH, is concise with all unlikely hypotheses being excluded from concern. Moreover, the individual causality tracking graphs for the three root cause events B24, B28, and B30 in Figure 7 are shown in Figure 8, whereby each subgraph independently covers all evidence. The graphs thus intuitively and faithfully represent the causalities underlying the pathogenesis of Case 1.

By applying the algorithm of differential diagnostic reasoning for Case 1, the state probability of H1,1 = B24,1B28,1B30,1 can be obtained as . The obtained “ranked independent probabilities” of B24,1, B28,1, and B30,1 are listed in Table 3. The LSCC-idiopathic-cupulolithiasis BPPV is therefore determined as the diagnostic result of Case 1; this result is consistent with the clinical conclusions confirmed by otoneurologists.

3.1.2. Differential Diagnosis for BPPV—Case 2

A 72-year-old female patient complained about paroxysmal positional vertigo for a month; the attacks of vertigo were brief, lasting for one minute; the vertigo frequently occurred while looking up or down. Thus, the medical evidence for this case is {X4,1, X21,2, X24,3, X184,4, X185,2, X207,0, X208,0, X209,3, X222,3, X223,1, X224,1, X225,1, X226,1, X236,0}; the other X-type variables are normal or unknown (incomplete medical information).

By applying the algorithm of differential diagnostic reasoning, the hypothesis event H1,1 = B23,1B28,1B29,1B32,1 is uniquely determined as the origin of the disease, which indicates a PSCC-idiopathic-cupulolithiasis-persistent BPPV. The diagnostic causality graph is shown in Figure 9 which is based on the causality simplification of Figure 2. The resulting ranked independent probabilities of B23,1, B28,1, B29,1, and B32,1 are listed in Table 4. By explicitly representing the disease causes, symptoms, pathogenesis, and the latent correlations among medical information, Figure 9 offers intuitive insights into the underlying pathological mechanisms.

3.1.3. Other Validated BPPV Subtype Cases

A verification experiment on some other clinical cases was performed to evaluate the effectiveness of the proposed method. The correctness was based on the otoneurologists’ judgement on the diagnostic outcomes in contrast to the practical conclusions. In the clinical center for vertigo in Beijing Chaoyang Hospital, BPPV cases are common; however, there are multiple repetitions in the meaningful information for the BPPV patients on their medical histories and physical examinations regarding subtype differentiation. Based on this, 75 typical BPPV cases were selected from the thousands of cases containing medical records, interviews, and physical assessments in the verification experiments. The BPPV subtypes involved are outlined in Table 5. The selection principle of BPPV cases was based on the international diagnostic criteria for BPPV [36] and the typicality and representativeness of BPPV subtype diseases. Notably, since the nystagmus cannot be observed for a subjective BPPV case, it is difficult to classify the case either as canalithiasis BPPV or cupulolithiasis BPPV.

The diagnostic outcomes of the DUCG-based model agreed with the otoneurologists’ conclusions with a correctness index of 100%. Based on the available published literature, no other research has reported a model-based automatic subtype differentiation of BPPV in the past.

3.2. Comparative Analysis of DUCG and BN

The main difference between DUCG and BN in terms of the theoretical framework is that the model structure and parameters of DUCG are decoupled. By virtue of such a feature, the independent causality representation by the weighted causality function event An,k;i,j in DUCG makes it possible to simplify the original causality graph based on the collected medical evidence. The irrelevant and meaningless events and relations are thus eliminated from the model. Taking the simplified causality graph as the basis of diagnostic reasoning reduces the computational scale and complexity to a large extent. In contrast, BN cannot be easily simplified owing to the structural coupling feature among the variables: the causalities among the child-parent events in BN are typically represented through a joint probability distribution (e.g., a conditional probabilistic table, CPT); in case any variables and relations are removed from a BN, the remaining parameters in the CPTs may need major adjustments; moreover, the number of parameters specified in a CPT for a BN is exponential to the number of the parent variables and states involved, while the parameters needed in the DUCG are less than the parameters in a BN.

Furthermore, by the independence representation and automatic normalization feature for the weighted logic event expansion, the “self-reliant” chaining inference and logic operations can be performed in DUCG. The parameters and information that are eliminated during the causality simplification and those that are not involved in the chaining inference can all be missing (or some parameters which are not of concern are thus not provided) during this reasoning process and do not affect the diagnostic accuracy. The diagnostic inference method under incomplete information indicates that some unnecessary clinical tests and inappropriate medications can be omitted, thereby decreasing the patients’ medical costs. For a BN, the reasoning accuracy depends on the completeness of the parameters in the CPTs.

3.3. Application Analysis

The proposed method can be developed as an automatic diagnostic system for clinicians. Based on the compact, independent, and graphical causality representation using DUCG, the model construction and diagnosis application are both easy to implement. For a patient, the clinical data through can be obtained through inquiry, physical examination, and auxiliary examination, among others. Next, the clinical data, including symptoms, signs, examination, and test data, are input into the diagnostic reasoning interface of the system, and the diagnosis results can be obtained through reasoning calculation. The clinical application of this method is thus very convenient. Through incorporation into the automatic medical devices, this method could improve the efficiency of BPPV diagnosis and therapy, especially for general practitioners in primary health care institutions. In addition, the system can also be used in mobile and Internet terminals as a secondary tool for telemedicine; finally, it can serve as a teaching tool for doctor training in related disciplines.

4. Summary

This study proposes a DUCG method for differential diagnosis of BPPV. To distinguish among various causes of vertiginous disorders with similar symptoms, the proposed method involves a graphical and compact representation, in addition to logical reasoning algorithms for pathological mechanisms and related uncertain causalities. New algorithms of differential diagnostic reasoning are proposed by integrating a pruning strategy in the causality simplification process, a maximizing concurrent basic event set strategy in the formulation of hypothesis space, and the weighted logic inference rules for subtype differentiation of BPPV. Furthermore, a scheme of dummy basic variable is introduced to settle the problems related to mutually exclusive root cause events. The model manifests higher correctness, favorable robustness to incomplete evidence, and satisfactory discriminatory power for BPPV subtypes. Furthermore, using the diagnostic reasoning method and graphical representation mechanism not only provides a diagnostic result, but also explains the rationale for suggesting the diseases, which justifies the probability results.

In light of these encouraging preliminary results, more clinical studies will be performed. Furthermore, a new method of probabilistic prediction for the vestibular dysfunction-induced fall risk will be the subject of our future research endeavors.

Data Availability

The data used to support the findings of this study are selected from the clinical cases of Department of Otorhinolaryngology Head and Neck Surgery, Beijing Chaoyang hospital. Requests for access to these data should be made to Yanjun Wang ([email protected], Beijing Chaoyang hospital).

Disclosure

Chunling Dong and Yanjun Wang are cofirst authors.

Conflicts of Interest

The authors declare that there are no conflicts of interests regarding the publication of this paper.

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China under Grant no. 71671103 and in part by the Fundamental Research Funds for the Central Universities under Grant nos. CUC2019B023, CUC19ZD008, and CUC19ZD007.