Abstract

The CVLT-II provides standardized scores for each of the List A five learning trials, so that the clinician can compare the patient's raw trials 1–5 scores with standardized ones. However, frequently, a patient's raw scores fluctuate making a proper interpretation difficult. The CVLT-II does not offer any other methods for classifying a patient's learning and memory status on the background of the learning curve. The main objective of this research is to illustrate that discriminant analysis provides an accurate assessment of the learning curve, if suitable predictor variables are selected. Normal controls were ninety-eight healthy volunteers (78 females and 20 males). A group of MS patients included 365 patients (266 females and 99 males) with clinically defined multiple sclerosis. We show that the best predictor variables are coefficients 𝐡 3 and 𝐡 4 of our mathematical model 𝐡 3 βˆ— e x p ( βˆ’ 𝐡 2 βˆ— ( 𝑋 βˆ’ 1 ) ) + 𝐡 4 βˆ— ( 1 βˆ’ e x p ( βˆ’ 𝐡 2 βˆ— ( 𝑋 βˆ’ 1 ) ) ) because discriminant functions, calculated separately for 𝐡 3 and 𝐡 4 , allow nearly 100% correct classification. These predictors allow identification of separate impairment of readiness to learn or ability to learn, or both.

1. Introduction

Multiple sclerosis (MS) is an autoimmune disease of the central nervous system leading to demyelization and scarring within the cerebrum and the spinal cord [1, 2] as well as gray matter atrophy [3]. Cognitive impairment is among the main symptoms, affecting about half of all MS patients, and lower processing speed and defective retrieval from recent memory storage are frequently observed [4–6]. Impairment of cognitive functioning is correlated with brain atrophy revealed with MRI [7, 8]. Memory has multiple regional atrophy correlates, including deep gray matter [9], cerebral cortex volume [8], and medial temporal lobe volume [10, 11]. Therefore, it is reasonable to hypothesize that MS might impair attention-mediated or executive aspects such as encoding and retrieval as well as consolidation and perhaps even recognition in some circumstances [11, 12]. That is why we hypothesize that depending on the affected MS brain region, short-term memory or long-term memory might be worsened predominantly and independently.

The California Verbal Learning Test: Second Edition (CVLT-II) [13], which is the revised version of the test that was initially published in 1987 [14], was selected by a group of neuropsychologists and neurologists to assess auditory/verbal memory defects in MS [15]. Benedict and colleagues [16] recently confirmed the validity of the CVLT-II with a large cohort of MS data. Of the 23 measures assessed, 18 significantly discriminated MS patients from demographically matched controls at the conventional 𝑃 < . 0 5 threshold, and 14 measures discriminated the groups at 𝑃 < . 0 0 1 . The CVLT-II was able to distinguish between groups of healthy participants and MS patients. A discriminant function analysis of the data retained five measures: short delay free recall, recognition discrimination index, long delay free recall, semantic clustering, retroactive interference; with 70.0% of the participants correctly classified [16]. In addition to the conventional CVLT-II measures available from the test manual and its companion computer program, Stegen and coauthors [16] used learning measures derived from a mathematical model based on the first-order transfer function for the assessment of the learning curve [17, 18]. The model’s measures were valid in combination with some other CVLT-II standard measures in discriminating MS patients from demographically matched controls. However, Stegen and colleagues did not study whether modeling of the learning curve of the List A could be used to correctly classify between groups of healthy participants and MS patients.

Though the CVLT-II provides comprehensive analysis of short- and long-term memory status, administration of the standard CVLT-II form takes 30 minutes testing with a 20-minute delay interval [19]. If examination time is limited and a neuropsychologist requires less detailed information, then a quick method for assessing memory impairment in MS patients is needed. Delis and colleagues developed the California Verbal Learning Test-II Short Form (CVLT-II SF) that was designed for two purposes: (1) as a cost-effective screening tool to identify memory problems and (2) to assess verbal learning and memory without overly taxing patients [13]. However, the CVLT-II SF includes a list of nine words instead of sixteen. In our opinion, nine words are not enough for an accurate assessment of long-term memory because many individuals are able to learn and recall 15-16 words [16].

To assess both short-term and long-term memory, many MS researchers still rely on the predictive validity of CVLT-II learning trials 1–5 because on the one hand, it is part of a consensus battery [15] and, on the other, it is useful as a standalone free recall test for diagnosis of overall memory impairment. The latter feature attracts attention because the time required for clinical assessments of MS patients is important from both an economic standpoint and avoiding patient fatigue. Presentation of List A without the subsequent subtests (learning of the list B, delay trials, etc.) during initial examination of a patient lessens the burden on the patient. If List A does not reveal significant overall memory impairment, then additional memory testing for the patient under examination may not be required.

The CVLT-II provides standardized scores for each of the five learning trials, so that the clinician can assess the consistency of the patient’s immediate-recall performance across trials. The consistency means that the number of correct recalled words increases more or less monotonously from trial to trial. However, frequently fluctuating emotional state, incomplete effort, medication effects, pain symptoms, and so forth influence the raw scores. Thus, poor scores on one or two trials in the face of better recall on the other trials can lead to an inaccurate assessment of learning and memory.

In this paper, we advocate discriminant analysis in the evaluation of learning curves. The discriminant analysis creates a linear combination of predictor variables that provides the best discrimination between, for example, a control group of healthy individuals and a group of MS patients [20]. If the raw scores trials 1–5 are used as predictor variables, discriminant analysis outputs a multidimensional linear function 𝑦 = 𝐢 0 + 𝐢 1 * T r i a l 1 + 𝐢 2 * T r i a l 2 + 𝐢 3 * T r i a l 3 + 𝐢 4 * T r i a l 4 + 𝐢 5 * T r i a l 5 and a boundary value for this function π‘Œ 𝑏 . A clinician substitutes predictor variablesβ€”Trial 1, Trial 2, and so forth with raw scores trials 1–5β€”and calculates a value of the discriminant function 𝑦 . If 𝑦 β‰₯ π‘Œ 𝑏 , there is no memory impairment in the patient. If 𝑦 < π‘Œ 𝑏 , then the patient needs detailed clinical examination to find the major cause of memory impairment. Thus, a relevant question arises: what measures of the learning curve should be used as predictor variables for discriminant analysis. In our opinion, mathematical modeling of List A learning curve provides two coefficientsβ€” 𝐡 3 and 𝐡 4 (see below) that are better predictors than standard CVLT-II measures Trials 1–5.

Description of Our Mathematical Model
Control theory defines the general transfer function as follows [21, 22]. If we designate an input signal acting upon the system as 𝐹 and the output signal (the reaction of the system)β€”as 𝑦 , then the transfer function [ 1 / 𝐾 ] = 𝑦 / 𝐹 . If the input signal is equal to zero at time 𝑑 < 0 and is equal to 𝐹 at 𝑑 β‰₯ 0 , it is called β€œthe step function.” Reaction of the first-order system on the step function is described with the differential equation 𝜏 β‹… 𝑑 𝑦 / 𝑑 𝑑 + 𝑦 = 𝐹 / 𝐾 . The equation can be rewritten in the form 𝑑 𝑦 / 𝑑 𝑑 = ( 1 / 𝜏 ) β‹… ( 𝐹 / 𝐾 βˆ’ 𝑦 ) . Define 𝑦 s s ≑ 𝐹 / 𝐾 . This is the asymptotic (steady) value of the output signal at 𝑑 = ∞ . The time constant of the system is defined as 𝜏 . Define π‘Ž ≑ 1 / 𝜏 . The differential equation takes the form 𝑑 𝑦 / 𝑑 𝑑 = π‘Ž ( 𝑦 s s βˆ’ 𝑦 ) . Thus, it is seen that the rate of the transient process in the first-order system is proportional to the difference between the asymptotic and current value of the system’s output signal. The solution of the equation with initial condition 𝑦 = 𝑦 0 at 𝑑 = 0 gives the function of exponential type 𝑦 = ( 𝑦 0 βˆ’ 𝑦 s s ) 𝑒 βˆ’ π‘Ž 𝑑 + 𝑦 s s . The measure of the rate of the transient process is the time constant ( 𝜏 ) that reveals how much time is necessary for achievement of 63% from the difference between the initial ( 𝑦 0 ) and asymptotic ( 𝑦 s s ) levels of the output signal [21, 22].

We adopted the first-order transfer function for modeling the learning curves in the form 𝐡 3 βˆ— e x p ( βˆ’ 𝐡 2 βˆ— ( 𝑋 βˆ’ 1 ) ) + 𝐡 4 βˆ— ( 1 βˆ’ e x p ( βˆ’ 𝐡 2 βˆ— ( 𝑋 βˆ’ 1 ) ) ) . 𝑋 is the trial number and π‘Œ is the quantity of correctly recalled words without repetitions. The parameters are 𝐡 2 β€”the learning rate; 𝐡 3 β€”the value of correctly recalled words on the first trial (i.e., 𝐡 3 = π‘Œ at 𝑋 = 1 ); 𝐡 4 β€”the asymptotic value of recalled words at 𝑋 = ∞ .

Using the independent variable in the form of ( 𝑋 βˆ’ 1 ) means that 𝐡 3 is very close to the CVLT-II Trial 1 standard measure. The number of correct recalled words on the first trial assesses mainly short-term memory volume [23]. Delis and colleagues have indicated that β€œperformance on the first immediate-recall trial of List A (Trial 1) is thought to be especially dependent on auditory attention span” [13, page 28]. Factors such as motivation [24], general health, and previous experience with the test might also influence the first trial recall. Thus, we treat 𝐡 3 as an estimator of the general functional state of a participant before starting the test and call it β€œreadiness to learn” [18], hence 𝐡 3 is an abbreviation for β€œreadiness to learn.” In our model, 𝐡 3 primarily represents attention span and short-term memory encoding process.

Learning in either version of the CVLT begins when the list of words is presented repeatedly with trial 2 being the first repetition. As the developers of the CVLT-II indicate, β€œWhen neurologically intact individuals are provided with additional opportunities to learn a β€œsupraspan” list of words (i.e., a list of words longer than 7 Β± 2 words), their recall on the subsequent trials typically exceeds their auditory attention span and increases with each new trial.” Thus, encoding into and retrieval from LTM plays an increasingly greater role in recall performance with each representation of the same list. Performances on List A Trials 2 to 5 of the CVLT-II reflect the core verbal learning abilities of the examinee" [13, page 29]. Thus, in our model, 𝐡 4 predominantly assesses long-term memory. Other factors such as fluctuating emotional state, medication effects, pain symptoms, and effort also influence 𝐡 4 values [13]. We treat 𝐡 4 as an estimator of general ability to learn and call it β€œability to learn” [18]; hence, 𝐡 4 is an abbreviation for β€œability to learn”. In our model, 𝐡 4 is a measure of long-term memory.

Velocity of learning is characterized by the time constant that reveals how much trials (after trial 1) are necessary for achieving 63% from the difference between 𝐡 3 and 𝐡 4 . Coefficient 𝐡 2 is inverse value of the time constant, so that the higher the value of 𝐡 2 the faster is the learning rate. Hence, 𝐡 2 is an abbreviation for β€œthe learning rate.” The CVLT-II provides a measure of the learning rate in the form of the learning slope score that is calculated by a least squares regression of the linear model and reflects the average number of newly recalled words per trial. The learning slope has an important shortcoming. If a person does very well on trial 1, there tends to be a ceiling effect with little room for a learning slope. The slope is steep when a person does less well on Trial 1 and then does better in subsequent trials. In such a situation, the CVLT-II learning slope is misleading when a person earns a high score on trial one. Generally, from mathematical point of view, there might be a correlation between the CVLT-II learning slope and difference between maximal score over all trials ( π‘Œ m a x ) and a score on Trial 1 ( π‘Œ 1 ). Mathematically, l e a r n i n g s l o p e = π‘Ž 0 + π‘Ž 1 βˆ— ( π‘Œ m a x βˆ’ π‘Œ 1 ) . We found that the CVLT-II learning slope significantly correlated with the difference between the maximal number of recalled words and the number of recalled words on trial 1 [25].

Our coefficient 𝐡 2 , reflecting the number of trials needed to reach difference between 𝐡 3 and 𝐡 4 , differs in principal from the Learning Slope. The Pearson correlation between 𝐡 2 and ( π‘Œ m a x βˆ’ π‘Œ 1 ) is nonsignificant [25]. In other words, 𝐡 2 is more robust relative to a high score on trial 1. Nevertheless, we assume that, when comparing the learning rate between two people, 𝐡 2 gives reliable information, if readiness to learn ( 𝐡 3 ) and ability to learn ( 𝐡 4 ) are the same in both people.

At this point in our model construction, the meaning of 𝐡 2 is difficult to specify. We believe that on the whole 𝐡 2 might reflect the rate of information transfer from short-term memory to the hippocampus and the rate at which the hippocampus can encode new episodic memories. As the construct validity of 𝐡 2 is poorly understood, we focused on 𝐡 3 and 𝐡 4 as possible valid predictor variables for discriminant analysis.

Previously we applied our mathematical model to the CVLT learning curve for patients with type 2 diabetes mellitus [26] and next outlined four procedures that must be implemented to achieve nearly 100% of correct classification of learning deficits in these patients. These procedures include (1) dividing participants and patients by gender; (2) dividing participants and patients by high versus low recall with previous modeling each individual learning curve, forming samples of 𝐡 3 and 𝐡 4 , and next clustering healthy participants and MS patients by 𝐡 3 or 𝐡 4 ; (3) conducting separate discriminant analysis of readiness to learn (coefficient 𝐡 3 ) and ability to learn (coefficient 𝐡 4 ); (4) comparing a sample of healthy participants with high recall scores, (or with high values of 𝐡 3 or 𝐡 4 ) with a sample from patients with low recall scores, (or with low values of 𝐡 3 or 𝐡 4 ). Fulfillment of these procedures allows calculating correct boundary values for 𝐡 3 and 𝐡 4 with discriminant analysis [26].

Dividing Participants and Patients by Gender
First and foremost, each group of participants and patients should be divided by gender because women outperform men on the CVLT [27] and girls outperform boys on the CVLT-Children’s version [28]. In particular, women score significantly higher than men do on all immediate free recall trials, having recalled an average of approximately one word per trial more than men have. Women use active, semantically mediated strategies during encoding, whereas men prefer to recall words in the order in which they were presented, which is a less effective organizational strategy, and are more inclined to recall items from the primacy and recency regions [29].

The different encoding strategies among males and females have an effect on such CVLT/CVLT-II measures as the Semantic Clustering index and the Serial Clustering index. A semantic cluster occurs when two words in adjacent recall positions are members of the same category and a serial cluster occurs when two words are recalled in the same sequence in which they presented on the list [14, 30]. In a group of men and women, matched for age and education, a mean value of the semantic clustering index was 2.53 for females versus 1.94 for males ( 𝑃 < . 0 0 5 ), but the serial clustering index was 1.86 in females versus 2.48 in males ( 𝑃 < . 0 5 ), confirming that women use more efficient mode of organization of verbal information on free recall trials than men [27]. Thus, the CVLT/CVLT-II is constructed with an initial bias toward women and this bias must be taken into consideration to optimize modeling of the learning curve. In addition, gray matter and central atrophy are more advanced in male MS patients, whereas white matter atrophy is more advanced in female MS patients [31].

Separate Assessment of 𝐡 3 and 𝐡 4
Developers of the CVLT/CVLT-II/CVLT-C divide learning and memory impairment as assessed with List A into three patterns [13]. The first pattern is performed by individuals with impaired attention but normal learning and memory who may perform poorly on trial 1 but recoup and perform adequately on subsequent trials. In our terminology, this pattern stands for insufficient readiness to learn.

The second pattern may be observed in individuals with some neurological disorders that damage the memory circuitry of the brain but leave attentional processes relatively intact. These individuals perform in the average or near-average range on trial 1 but on subsequent trials fail to assimilate additional information beyond their initial attention span. In our terminology, this pattern stands for insufficient ability to learn.

The third pattern is seen in individuals with deficiencies in both attention and learning. These individuals perform poorly on trial 1 and continue to do so on subsequent immediate-recall trials. In our terminology, this pattern stands for insufficient readiness and insufficient ability to learn. As Delis and colleagues state, β€œperformance on the immediate-recall trials of the CVLT-II (i.e., the first five learning trials) usually reflects a nebulous combination of STM and LTM, which precludes precise conclusions about the relative integrity of these proposed memory systems [13, page 27].”

The purpose of this study was to assess which measures of the learning curveβ€”the standard CVLT-II measures Trials1–5 or coefficients 𝐡 3 and 𝐡 4 of our mathematical modelβ€”must be used as predictor variables for discriminant analysis between healthy persons and MS patients.

2. Materials and Methods

2.1. Participants

Normal controls were ninety-eight healthy volunteers (78 females and 20 males). A group of MS patients included 365 patients (266 females and 99 males) with clinically defined multiple sclerosis [32] being part of a larger previous study [16]. Throughout the text, the phrase β€œhealthy participants” represents normal controls, and the phrase β€œMS patients” represents participants with clinically defined multiple sclerosis.

All were at least 18 years of age and fluent in the English language. Exclusion criteria were (a) presence of a medical disorder other than MS affecting cognitive function, (b) psychiatric disorder [33] other than mood, personality, or behavioral change following the onset of MS, (c) drug or alcohol dependence or current abuse, (d) motor or sensory impairment that could compromise testing (e.g., corrected vision of at least 20/70), and (e) relapse or corticosteroid treatment within four weeks of assessment.

Prior to participation, all research participants provided written consent as approved by institutional review boards and consented for the electronic storage of their test data. The mean (Β±SD) age of MS patients was 4 6 . 3 Β± 9 . 1 years. Patients completed 1 4 . 2 Β± 2 . 4 years of education. Disease duration was 1 0 . 7 Β± 8 . 0 years. The majority were Caucasian (92%) and female ( 𝑛 = 2 6 6 or 72.9%), consistent with MS population demographics [34]. Disease course was as follows: relapsing remitting (RR) 𝑛 = 2 6 6 or 72.9%, secondary progressive (SP) 𝑛 = 8 0 or 21.9%, progressive-relapsing 𝑛 = 9 or 2.5%, primary progressive 𝑛 = 1 0 or 2.7%.

Healthy controls with a mean age of 4 5 . 3 3 Β± 7 . 3 and 1 4 . 6 Β± 1 . 4 years of education were recruited through newspaper advertisements, which targeted demographics consistent with the patient sample. The majority of the controls were Caucasian and women ( 𝑛 = 7 8 or 79.6%). The differences between the female versus male controls and MS patients in age or educational level did not reach significance using ANOVA or chi-squared tests. Other details are available elsewhere [16].

2.2. Procedure

A combined sample of MS patients seen for research ( 𝑛 = 9 1 , 24.9%), routine clinical monitoring ( 𝑛 = 1 5 1 , 41.4%), or clinical indication ( 𝑛 = 1 2 3 , 33.7%) was used. Research participants (including all normal controls) were contacted by mail or were approached during the course of their usual clinical care at an MS center. Clinical patients were evaluated for either routine monitoring of cognitive function or for specific clinical reasons. Participants were assessed using the entire MACFIMS battery (the Minimal Assessment of Cognitive Function in Multiple Sclerosis) administered individually by a psychologist, trained assistant, or a graduate student under the supervision of a board-certified neuropsychologist. All participants were assessed at affiliated hospitals in association with the Department of Neurology at the State University of New York at Buffalo.

2.3. Measures

Representing the auditory/verbal memory domain, the CVLT-II was administered in the standard manner [13] as part of the Minimal Assessment of Cognitive Function in multiple sclerosis protocol [15]. The CVLT-II assesses memory with two 16-item word lists. The words are grouped into four semantic categories of four words each. In general, participants are asked to recall the first list (List A) following each of five exposures. All details on CVLT-II standard measures are available elsewhere [16]. For the purposes of the quantitative analysis of the learning curve, free recall of 16 words from list A during five consecutive trials was used [13].

2.4. Estimation of the Parameters of the Model and Its Verification

SPSS and Mathematica were used to estimate the model’s parameters. 𝑅 squared ( 𝑅 2 ) was used for verification. The closer the learning data are to the model values the higher is 𝑅 2 , its maximal value being equal to 1. A step-by-step guide to make the calculations using SPSS and Mathematica is provided in our recent papers [18, 25, 26]. To make the calculations as easy as possible we have, developed software (β€œThe Learning Curve Modeling Tool”).

2.5. Standard Statistical Methods

SPSS was used for all statistical calculations. When discriminant analysis was used with trials 1–5, we checked an option β€œEnter independents together” to include each Trial value into calculations. Hierarchical cluster analysis based on Ward’s method was used with trials 1–5 measures as variables entered together. 𝐾 -means cluster analysis, which in our case is reduced to two means, was used with 𝐡 3 or 𝐡 4 as a single variable.

3. Results

3.1. Discriminant Analysis with Standard CVLT-II Measures Trials 1–5

To assess whether the mathematical modeling of the learning curve based on our model is more sensitive in detecting memory deficits than standard CVLT-II measures, we first calculated mean values for standard CVLT-II trials 1–5 for each trial (98 subjects). Means (Β± SEM) for the healthy participant group are presented in Table 1, item 1. Means for the MS patient group (365 subjects) are presented in the Table 1, item 2. These means were compared for each trial with independent sample 𝑑 -test. Significant difference was found for each trial ( 𝑃 < . 0 0 1 ). Thus, standard CVLT-II measuresβ€”trials 1–5β€”were used for discriminant analysis to assess whether the measures could distinguish healthy participants from MS patients.

3.1.1. Discriminant Analysis for Healthy Participant and MS Patient Groups

Wilk’s lambda was significant, πœ† = . 8 9 4 , πœ’ 2 = 5 1 . 2 5 , 𝑃 < . 0 0 1 , indicating that the linear discriminant function based on raw free recall scores was able to significantly discriminate between healthy participants and MS patients. A boundary value for the discriminant function is  .420. Discriminant function yielded the following coefficients: 𝑦 = βˆ’ 4 , 7 7 4 + 0 . 1 5 6 4 βˆ— T r i a l 1 βˆ’ 0 . 0 4 0 βˆ— T r i a l 2 + 0 . 0 4 7 βˆ— T r i a l 3 βˆ’ 0 . 0 4 9 0 6 βˆ— T r i a l 4 + 0 . 3 4 9 βˆ— T r i a l 5 . The discriminant function correctly predicted 80.6% of healthy participants and 61.9% of MS patients.

Taking into account that females outperform males on CVLT [27, 28], we compared each standard CVLT-II measuresβ€”trials 1–5 between males and females. The means (Β±S.E.M.) for the group of healthy male participants (20 subjects) are presented in Table 1, item 3. The means for female participants (78 subjects) are presented in Table 1, item 4. These means were compared for each trial with an independent sample 𝑑 -test. Mean value for each trial is higher in females ( 𝑃 ≀ . 0 0 1 ) . The means (Β±S.E.M.) of male MS patients (99 subjects) are presented in Table 1, item 5. The means for female patients (266 subjects) are presented in Table 1, item 6. Mean value of each trial is higher in females ( 𝑃 ≀ . 0 0 2 ). These results confirmed that healthy and MS females outperform males in the number of correctly recalled words from the CVLT-II List A. Therefore, we performed discriminant analysis for males and females separately.

3.1.2. Discriminant Analysis for Males

Wilk’s lambda was significant, πœ† = . 8 9 3 , πœ’ 2 = 1 2 . 9 9 6 , 𝑃 = . 0 2 3 , indicating that the linear discriminant function based on raw free recall scores was able to significantly discriminate between healthy male participants and MS male patients. A boundary value for the discriminant function is  .305. Discriminant function yielded the following coefficients: 𝑦 = βˆ’ 2 . 6 6 6 βˆ’ 0 . 1 0 4 βˆ— T r i a l 1 βˆ’ 0 . 2 4 5 βˆ— T r i a l 2 + 0 . 2 9 5 βˆ— T r i a l 3 βˆ’ 0 . 3 0 6 βˆ— T r i a l 4 + 0 . 5 2 3 βˆ— T r i a l 5 . The discriminant function correctly predicted 70% of healthy male participants and 61.6% of MS male patients.

3.1.3. Discriminant Analysis for Females

Wilk’s lambda was significant, πœ† = . 8 7 8 , πœ’ 2 = 4 4 . 0 6 4 , 𝑃 < . 0 0 1 , indicating that linear discriminant function based on raw free recall scores was able to significantly discriminate between healthy female participants and MS female patients. The boundary value for the discriminant function is  .242. The discriminant function yielded the following coefficients: 𝑦 = βˆ’ 5 . 3 7 8 + 0 . 2 0 βˆ— T r i a l 1 + 0 . 2 4 5 βˆ— T r i a l 2 βˆ’ 0 . 0 1 8 βˆ— T r i a l 3 + 0 . 0 1 1 βˆ— T r i a l 4 + 0 . 3 1 9 βˆ— T r i a l 5 . The results show that the discriminant function correctly predicted 78.2% of healthy female participants and 62.8% of MS female patients.

We compared each standard CVLT-II measuresβ€”trials 1–5 between healthy participants and MS patients with independent sample 𝑑 -test. In healthy males, mean values were significantly higher in comparison with MS male patients on Trial 3 ( 𝑃 = . 0 3 1 ) and Trial 5 ( 𝑃 = . 0 0 6 ). In healthy females, mean values were significantly higher in comparison with MS male patients on each trial ( 𝑃 ≀ . 0 0 1 ). Nevertheless, discriminant analysis based on standard measuresβ€”trials 1–5β€”could classify less than 70%, when all participants within a group are used for discriminant analysis.

To increase discrimination capacity, we further separated each group (male and female healthy participants and MS patients) into two samplesβ€”one sample with high and the other sample with low levels of CVLT-II recall scores using cluster analysis.

3.2. Cluster Analysis of Healthy Participants and MS Patients Using Standard CVLT-II Measures Trials 1–5
3.2.1. Cluster Analysis of Healthy Male Participants

The entire group was divided into two clusters: cluster referring to lower memory functioning (9 cases) with smaller values, and cluster referring to higher memory functioning (11 cases) with larger values (Figure 1(a)). Mean values (Β±S.E.M.) for the cluster of lower memory functioning are presented in Table 1, item 7; values for the cluster of higher memory functioning are presented in Table 1, item 8. Comparing means with a 𝑑 -test for independent samples revealed that the cluster of higher memory functioning means were higher in comparison with the cluster of lower memory functioning for each trial ( 𝑃 < . 0 5 ) with the exception of Trial 3 ( 𝑃 = . 0 7 5 ). Thus, cluster associated with higher memory functioning better characterized the healthy male sample and was used for further discriminant analysis.

3.2.2. Cluster Analysis of MS Male Patients

The entire group was divided into two clusters: one referring to lower memory functioning (46 cases) with smaller values and another referring to higher memory functioning (53 cases) with larger values (Figure 1(b)). Mean values (Β±S.E.M.) of the cluster representing lower memory functioning are presented in Table 1, item 9 and for cluster referring to higher memory functioning in Table 1, item 10. Comparing means with independent sample 𝑑 -tests revealed that clusters of higher memory functioning were higher in comparison with cluster referring to lower memory functioning ( 𝑃 ≀ . 0 0 1 ). The cluster of lower memory functioning better characterized MS male patients and was used for further discriminant analysis.

3.2.3. Cluster Analysis of Healthy Female Participants

The entire group was divided into two clusters: cluster referring to lower memory functioning (38 cases) with smaller values and the other referring to higher memory functioning (40 cases) with larger values (Figure 1(c)). Mean values (Β±S.E.M.) for the cluster of lower memory functioning are presented in Table 1, item 11 and for cluster of higher memory functioning in Table 1, item 12. Comparing means with a 𝑑 -test for independent samples revealed that the clusters of higher memory functioning were higher in comparison with the cluster of lower memory functioning for each trial ( 𝑃 ≀ . 0 0 1 ). The cluster of higher memory functioning better characterized healthy female participants and was used for further discriminant analysis.

3.2.4. Cluster Analysis of MS Female Patients

The entire group was divided into two clusters: one referring to lower memory functioning (133 cases) with smaller values and another, referring to higher memory functioning (133 cases) with larger values (Figure 1(d)). Mean values (Β±S.E.M.) for cluster of lower memory functioning are presented in Table 1, item 13 and for the cluster of higher memory functioning presented in Table 1, item 14. Comparing means with a 𝑑 -test for independent samples revealed that the cluster of higher memory functioning means were higher in comparison with the cluster of lower memory functioning for each trial ( 𝑃 ≀ . 0 0 1 ). Thus, cluster of lower memory functioning better characterized MS female participants sample and was used for further discriminant analysis.

3.3. Discriminant Analysis of Trials 1–5 Clusters Referring to Lower and Higher Memory Functioning
3.3.1. Discriminant Analysis for Male Groups Clusters

The cluster of higher memory functioning with high recall raw scores from healthy male participants and the cluster of lower memory functioning with low recall raw scores from MS male patients were used for discriminant analysis. Wilks’ lambda πœ† = . 4 2 1 , πœ’ 2 = 4 5 . 4 , 𝑃 < . 0 0 1 indicated that the linear discriminant function was able to significantly discriminate between healthy male participants and MS male patients. A boundary value for the discriminant function is 1.459. Discriminant function yielded the following coefficients: 𝑦 = βˆ’ 5 . 1 2 3 βˆ’ 0 . 1 6 βˆ— T r i a l 1 + 0 . 1 8 βˆ— T r i a l 2 + 0 . 1 8 6 βˆ— T r i a l 3 + 0 . 1 1 5 βˆ— T r i a l 4 + 0 . 2 8 7 βˆ— T r i a l 5 . The discriminant function correctly predicted 100% of healthy male participants and 100% of male MS patients. Verification of the cluster of lower memory functioning from healthy males revealed that 6 out of 9 cases were classified as healthy, that is, 66.7% were correctly classified. Verification of the cluster referring to higher memory functioning from MS male patients revealed that 47 out of 53, that is, 88.7% cases were classified as healthy.

3.3.2. Discriminant Analysis for Female Groups Clusters

The cluster of higher memory functioning with high recall raw scores from healthy female participants and the cluster referring to lower memory functioning with low recall raw scores from MS female patients were used for discriminant analysis. Wilks’ lambda πœ† = . 2 8 2 , πœ’ 2 = 2 1 3 , 𝑃 < . 0 0 1 , indicating that linear discriminant function was able to significantly discriminate between healthy female participants and MS female patients. A boundary value for the discriminant function is 1.881. Discriminant function yielded the following coefficients: 𝑦 = βˆ’ 7 . 3 2 6 + 0 . 1 2 1 βˆ— T r i a l 1 + 0 . 1 9 1 βˆ— T r i a l 2 + 0 . 2 9 5 βˆ— T r i a l 3 + 0 . 1 1 4 βˆ— T r i a l 4 + 0 . 0 6 3 βˆ— T r i a l 5 . The discriminant function correctly predicted 100% of healthy female participants and 100% of MS female patients. Verification of the cluster of lower memory functioning from healthy females revealed that only one out of 38 cases was classified as healthy. Verification of the cluster of higher memory functioning from MS female patients revealed that 72 out of 133, that is, 54.1% cases were classified as healthy and 61 cases, that is, 45.9% were classified as MS female patients.

In summary, first dividing participants and patients by gender and then further dividing them by high versus low recall using standard CVLT-II measures trials 1–5 substantially improves classification of memory impairment in patients with multiple sclerosis. However, the use of clusters is not able to identify clearly short-term or long-term memory impairment. In the following section, we tested the ability of our model to identify short-term and long-term memory impairment in MS patients.

3.3.3. The Averaged Learning Curves of the Trials 1–5 Clusters Used for Discriminant Analysis

Our mathematical model allows separate assessment of short-term and long-term memory, because 𝐡 3 represents attention span and short-term memory encoding process, whereas 𝐡 4 represents long-term memory consolidation process. Therefore, we modeled the learning curve for the cluster of lower memory functioning and the cluster of higher memory functioning from male and female healthy participants and MS patients.

The learning curves averaged from healthy male participants outputted the following values for the model’s coefficients: for the cluster referring to lower memory functioning 𝐡 3 = 4 . 3 9 Β± 0 . 2 9 , 𝐡 4 = 1 1 . 6 9 Β± 0 . 5 5 , and for the cluster referring to higher memory functioning 𝐡 3 = 6 . 9 0 Β± 0 . 2 7 , 𝐡 4 = 1 5 . 0 Β± 1 . 0 . Comparing coefficients revealed that 𝐡 3 and 𝐡 4 differed ( 𝑃 = . 0 0 3 and 𝑃 = . 0 4 4 accordingly). The learning curves averaged over each cluster from MS male patients outputted the following values for the coefficients: for cluster referring to lower memory functioning 𝐡 3 = 4 . 8 0 Β± 0 . 0 8 , 𝐡 4 = 8 . 2 9 Β± 0 . 1 4 , and for cluster referring to higher memory functioning 𝐡 3 = 6 . 4 0 Β± 0 . 2 1 , 𝐡 4 = 1 3 . 1 0 Β± 0 . 3 4 . Comparing the coefficients revealed that 𝐡 3 and 𝐡 4 differed ( 𝑃 = . 0 0 2 and 𝑃 = . 0 0 0 2 accordingly).

The learning curves from healthy female participants outputted the following values for the model’s coefficients: for the cluster referring to lower memory functioning 𝐡 3 = 6 . 9 0 Β± 0 . 2 6 , 𝐡 4 = 1 4 . 9 2 Β± 1 . 1 1 , and for the cluster referring to higher memory functioning 𝐡 3 = 8 . 3 4 Β± 0 . 0 5 , 𝐡 4 = 1 5 . 0 1 Β± 0 . 0 4 . Comparing of coefficients revealed that 𝐡 3 differed ( 𝑃 = . 0 0 6 ), but 𝐡 4 did not differ ( 𝑃 > . 2 ). Here, clustering with standard measuresβ€”trials 1–5 separated individual cases only by 𝐡 3 , but not 𝐡 4 . The learning curves averaged over each cluster from MS female patients outputted the following values for the model’s coefficients: for the cluster referring to lower memory functioning 𝐡 3 = 5 . 3 6 Β± 0 . 1 7 , 𝐡 4 = 1 0 . 9 9 Β± 0 . 4 4 , and for the cluster referring to higher memory functioning 𝐡 3 = 7 . 3 4 Β± 0 . 0 8 , 𝐡 4 = 1 4 . 2 2 Β± 0 . 0 9 . Comparing the coefficients revealed that 𝐡 3 and 𝐡 4 differed ( 𝑃 = . 0 0 0 5 , and 𝑃 = . 0 0 2 accordingly).

Our results suggest that clustering with standard measuresβ€”trials 1–5 separated subjects by 𝐡 3 and 𝐡 4 . Figure 2 illustrates that clustering of raw recall scores did not distinguish between 𝐡 3 and 𝐡 4 due to considerable overlap between the cluster of lower memory functioning and the cluster of higher memory functioning in healthy participants and MS patients. This overlap suggests the possibility of obtaining misleading information on the level of readiness to learn and ability to learn. Thus, modeling of the learning curve might be necessary to distinguish between short-term and long-term impairment in MS patients.

3.4. Use of Coefficients 𝐡 3 and 𝐡 4 as Predictor Variables for Discriminant Analysis

In the next sections, we present results of the use of coefficient 𝐡 3 and 𝐡 4 for discriminant analysis.

3.4.1. The Averaged Learning Curve of Healthy Participants

The averaged learning curve of all healthy participants (Table 2, item 1), the 78 female participants (Table 2, item 2), and the 20 male participants (Table 2, item 3) provided good fits with our model based on very high values of 𝑅 -squared. Comparison of the coefficients revealed that 𝐡 3 was higher in females ( 𝑃 = . 0 0 1 6 ). Other coefficients did not differ ( 𝑃 = . 0 7 5 for 𝐡 2 and 𝑃 = . 2 0 for 𝐡 4 ). The averaged learning curve of females (Figure 3, empty triangles) was positioned above the curve for males (Figure 3, empty down triangles).

3.4.2. The Averaged Learning Curve of MS Patients

The averaged learning curve sampled from MS patients also differed between men and women. The averaged curve of all MS patients (Table 2, item 4), 266 female MS patients (Table 2, item 5), and 99 male MS patients (Table 2, item 6) provided good fits with our model based on very high values of R-squared. Coefficient 𝐡 3 ( 𝑃 = . 0 0 6 ) and coefficient 𝐡 4 ( 𝑃 = . 0 0 2 ) was higher in MS female patients (Figure 3, filled triangles) in comparison with MS male patients (Figure 3, filled down triangles). The coefficient 𝐡 2 did not differ ( 𝑃 = . 3 2 ).

3.4.3. Comparison of the Averaged Learning Curve of Healthy Participants and MS Patients

The averaged learning curve in MS patients differs significantly from the averaged learning curve in healthy participants both for the total learning data and for learning data of females or males calculated separately. Comparison of the averaged learning curve for the group of healthy participants (Table 2, item 1) and the group of MS patients (Table 2, item 4) revealed that 𝐡 3 ( 𝑃 = . 0 0 2 ) and 𝐡 4 ( 𝑃 = . 0 0 1 ) were higher in healthy participants though 𝐡 2 did not differ ( 𝑃 = . 5 6 ). The averaged learning curve for healthy participants was positioned nearly in parallel above the averaged learning curve for MS patients.

Separate comparison of healthy female participants (Table 2, item 2) with female MS patients (Table 2, item 5) revealed lower values of 𝐡 3 ( 𝑃 = . 0 0 1 9 ) and 𝐡 4 ( 𝑃 = . 0 0 1 8 ) in female MS patients with no difference between 𝐡 2 ( 𝑃 = . 8 8 ). The averaged learning curve for healthy female participants was positioned above the averaged learning curve for female MS patients (see Figure 3).

Unlike females, comparison of healthy male participants (Table 2, item 3) with male MS patients (Table 2, item 6) revealed that 𝐡 2 ( 𝑃 = . 1 3 ) and 𝐡 3 ( 𝑃 = . 2 9 ) were not affected, but 𝐡 4 was higher in healthy male participants ( 𝑃 = . 0 0 6 ). Both learning curves practically coincide at trial 1 (see Figure 3). Coefficient 𝐡 3 ( 𝑃 = . 0 0 6 ) and 𝐡 4 ( 𝑃 = . 0 0 2 ) was higher in MS female patients in comparison with MS male patients.

Our results supported the existence of CVLT-II gender bias in favor of females as it was previously shown for CVLT [27]. The existence of a gender bias necessitates that the discriminant and cluster analyses be performed separately for males and females, if our model is to provide greater precision with respect to identifying memory impairment in patients with multiple sclerosis and ensure close to 100% correct classification relative to other CVLT-II measures.

3.5. Cluster Analysis of Healthy Male Participants and MS Male Patients Using Coefficients 𝐡 3 or 𝐡 4

Previous to performing cluster analysis, we modeled each individual learning curve and formed samples of 𝐡 3 and 𝐡 4 . Next, we clustered healthy participants and MS patients by 𝐡 3 or 𝐡 4 . Then, the raw recall scoresβ€”trials 1–5 of participants and patients from each clusterβ€”were used to calculate the averaged learning curves.

3.5.1. Healthy Male Participants: Clustering Using 𝐡 3 Only

Clustering using 𝐡 3 separated all cases into two clusters: one cluster of lower memory functioning with low values of 𝐡 3 in the range of 2.18 to 5.0 for 9 male participants and cluster referring to higher memory functioning with high values of 𝐡 3 in the range of 5.97 to 9.0 for 11 male participants (Figure 4(a)). Values of the model’s coefficients are given in Table 2, items 7 and 8. The value of 𝐡 3 is higher ( 𝑃 = . 0 0 2 ) in the cluster associated with higher memory functioning. The averaged learning curve for the cluster of higher memory functioning is shown in Figure 5, empty circles. The cluster of higher memory functioning values better characterized the healthy male sample and was used for further discriminant analysis.

3.5.2. Healthy Male Participants: Clustering Using 𝐡 4 Only

Clustering using 𝐡 4 also separated all cases into two clusters: one referring to lower memory functioning with low values of 𝐡 4 in the range 9.71 to 11.31 for 5 male participants and a cluster referring to higher memory functioning with high values of 𝐡 4 in the range 12.24 to 16.73 for 15 male participants (Figure 4(b)). Values of the model’s coefficients are in Table 2, items 9 and 10. Coefficient 𝐡 3 and coefficient 𝐡 4 was higher ( 𝑃 = . 0 1 and 𝑃 = . 0 0 0 7 , resp.) in the cluster of higher memory functioning compared to the cluster of lower memory functioning. The averaged learning curve for higher memory functioning is shown in Figure 5, empty diamonds. Comparison of the coefficients in the higher memory cluster using 𝐡 3 and using 𝐡 4 revealed that 𝐡 3 was higher ( 𝑃 = . 0 4 ) in the cluster of higher memory functioning using 𝐡 3 . Thus, cluster referring to higher memory functioning values better characterized the healthy male sample and was used for further discriminant analysis.

3.5.3. MS Male Patients: Clustering Using 𝐡 3 Only

Clustering using 𝐡 3 separated all cases into two clusters: one referring to lower memory functioning with low values of 𝐡 3 in the range 0.99 to 5.22 for 48 MS male patients and another referring to higher memory functioning with high values of 𝐡 3 in the range 5.64 to 10.02 for 51 MS male patients (Figure 4(c)). Values of the model’s coefficients are given in Table 2, items 11 and 12. The value of 𝐡 3 ( 𝑃 = . 0 0 0 0 6 ) and 𝐡 4 ( 𝑃 = . 0 0 2 ) was higher in the cluster referring to higher memory functioning. The averaged learning curve for the cluster referring to lower memory functioning is shown in Figure 5, filled circles. Though MS male patients were separated using 𝐡 3 , these patients also revealed low value of 𝐡 4 . Thus, cluster referring to lower memory functioning better characterized the MS male sample and was used for further discriminant analysis.

3.5.4. MS Male Patients: Clustering Using 𝐡 4 Only

Clustering using 𝐡 4 separated all cases into two clusters: one referring to lower memory functioning with low values of 𝐡 4 in the range of 3.95 to 12.35 for 59 MS male patients and one referring to higher memory functioning with high values of 𝐡 4 in the range of 12.85 to 16.43 for 40 MS male patients (Figure 4(d)). Values of the model’s coefficients are given in Table 2, items 13 and 14. The value of 𝐡 3 ( 𝑃 = . 0 1 3 ) and 𝐡 4 ( 𝑃 = . 0 0 1 4 ) was higher in cluster of higher memory functioning. Clustering based on coefficient 𝐡 4 separated the group of MS male patients into two different clusters. The averaged learning curve for the cluster of lower memory functioning is shown in Figure 5, filled diamonds. Though MS male patients were separated using 𝐡 4 , coefficient 𝐡 3 was also impaired to some extent. Comparison of the coefficients between the cluster of lower memory functioning using 𝐡 3 and 𝐡 4 revealed that 𝐡 3 was higher in the cluster of lower memory functioning using 𝐡 4 ( 𝑃 = . 0 0 0 2 ). However, 𝐡 4 was higher in the cluster of lower memory functioning using 𝐡 3 ( 𝑃 = . 0 0 0 8 ). Thus, cluster referring to lower memory functioning better characterized the MS male sample and was used for further discriminant analysis.

3.6. Discriminant Analysis of the Cluster of Higher Memory Functioning for Healthy Male Participants and the Cluster of Lower Memory Functioning for MS Male Patients

Cluster analysis allowed separating healthy participants with high values of 𝐡 3 or 𝐡 4 and MS patients with low values of the same coefficients using 𝐡 3 or 𝐡 4 . Both coefficients ( 𝐡 3 , 𝑃 = . 0 0 0 9 and 𝐡 4 , 𝑃 = . 0 1 3 ) differed significantly between the cluster referring to higher memory functioning for healthy males and the cluster referring to lower memory functioning for MS males using 𝐡 3 . 𝐡 3 ( 𝑃 = . 0 0 0 2 ) and 𝐡 4 ( 𝑃 = . 0 0 0 1 ) also differed significantly between the cluster of higher memory functioning in healthy males and the cluster of lower memory functioning for MS males using 𝐡 4 . We now use discriminant analysis on these clusters.

3.6.1. Discriminant Analysis Using 𝐡 3 Only

First, discriminant analysis was conducted to assess whether coefficient 𝐡 3 only could distinguish the cluster of higher memory functioning for healthy male participants from the cluster of lower memory functioning for MS male patients. Wilk’s lambda was significant, πœ† = . 4 0 6 , πœ’ 2 = 5 0 . 9 5 8 , 𝑃 < . 0 0 1 , indicating that 𝐡 3 only was able to significantly discriminate between the two groups. The boundary value for the discriminant function is  .958. Discriminant function yielded the following coefficients: 𝑦 = βˆ’ 4 . 5 1 8 + 0 . 9 3 7 βˆ— 𝐡 3 . The discriminant function correctly predicts 100% of healthy participants and 100% of MS patients. The quality of the discrimination was assessed with the cluster of lower memory functioning (low 𝐡 3 values) using 𝐡 3 for healthy males. Discriminant function values for all 9 of 9 cases were less than 0.958. We also assessed the quality of the cluster of higher memory functioning (high 𝐡 3 values) for MS male patients. Here, only one patient with 𝐡 3 = 5 . 6 4 was classified as belonging to the MS group, but 50 out of 51 patients were correctly classified as belonging to the healthy group. Thus, 98.0% of participants were classified with our model correctly.

3.6.2. Discriminant Analysis Using 𝐡 4 Only

Discriminant analysis was also conducted to assess whether coefficient 𝐡 4 only could distinguish between the cluster of higher memory functioning for healthy male participants from the cluster of lower memory functioning for MS male patients. Wilk’s lambda was significant, πœ† = . 5 3 4 , πœ’ 2 = 4 4 . 8 2 0 , 𝑃 < . 0 0 1 , indicating that the model using 𝐡 4 only was able to successfully discriminate between the two groups. The boundary value for the discriminant function is  .681. Discriminant function yielded the following coefficients: 𝑦 = βˆ’ 4 . 9 2 3 + 0 . 4 6 6 βˆ— 𝐡 4 . These results show that the discriminant function correctly predicts 100% of healthy participants (15 of 15) and 91.5% of MS patients (54 of 59). Five cases from the cluster of lower memory functioning using 𝐡 4 for MS males with 𝐡 4 values from 12.11 to 12.35 were classified as belonging to the healthy group. Thus, 93.2% of original grouped cases are correctly classified. The quality of the discrimination was assessed with the cluster of lower memory functioning (low 𝐡 4 values) using 𝐡 4 for healthy males. Discriminant function values for each of the 5 cases were less than 0.681 representing 100% correct classification. Next, the quality of the cluster of higher memory functioning (high 𝐡 4 values) was assessed for MS male patients. All 40 of 40 patients were classified as belonging to healthy group correctly. Thus, 100% are correctly classified.

3.7. Cluster Analysis of Healthy Female Participants and MS Female Patients with 𝐡 3 or 𝐡 4

Previous to conducting the cluster analysis, we modeled each individual learning curve and formed samples of 𝐡 3 and 𝐡 4 . Next we clustered healthy participants and MS patients by 𝐡 3 or 𝐡 4 . Then, the raw recall scoresβ€”trials 1–5 of participants and patients from each cluster were used to calculate the averaged learning curves.

3.7.1. Healthy Female Participants: Clustering Using 𝐡 3 Only

Clustering using 𝐡 3 alone separated all cases into two clusters: one referring to lower memory functioning with low values of 𝐡 3 in the range of 3.81 to 6.08 for 22 female participants and another referring to higher memory functioning with high values of 𝐡 3 in the range of 6.8 to 12.82 for 56 female participants (Figure 6(a)). Values of the model’s coefficients are in Table 2, items 15 and 16. The value of 𝐡 3 is higher ( 𝑃 = . 0 0 0 2 5 ) in the cluster of higher memory functioning. Averaged learning curve for the cluster of higher memory functioning using 𝐡 3 is shown in Figure 7, empty circles. Clustering using 𝐡 3 separated the group of healthy female participants into two distinct 𝐡 3 clusters. Thus, the cluster referring to higher memory functioning values better characterized the healthy female sample and was used for further discriminant analysis.

3.7.2. Healthy Female Participants: Clustering Using 𝐡 4 Only

Clustering using 𝐡 4 alone separated all cases into two clusters: one referring to lower memory functioning with low values of 𝐡 4 in the range of 9.58 to 11.51 for 3 female participants and another referring to higher memory functioning with high values of 𝐡 4 in the range of 12.65 to 16.91 for 75 female participants (Figure 6(b)). Values of the model’s coefficients are given in Table 2, items 17 and 18. The value of 𝐡 4 is higher ( 𝑃 = . 0 1 7 ) in the cluster associated with higher memory functioning. Averaged learning curve for the cluster of higher memory functioning using 𝐡 4 is provided in Figure 3, empty diamonds. In healthy female participants, 𝐡 3 is higher in the cluster of higher memory functioning using 𝐡 3 ( 𝑃 = . 0 1 9 ) in comparison with the cluster of higher memory functioning using 𝐡 4 . Thus, the cluster of higher memory functioning better characterized the healthy female sample and was used for further discriminant analysis.

3.7.3. MS Female Patients: Clustering Using 𝐡 3 Only

Clustering using 𝐡 3 separated all cases into two clusters: one referring to lower memory functioning with low values of 𝐡 3 in the range of 1.0 to 6.22 for 152 female participants and another referring to higher memory functioning with high values of 𝐡 3 in the range of 6.67 to 13.0 for 114 female participants (Figure 6(c)). Values of the model’s coefficients are given in Table 2, items 19 and 20. Here, clustering was performed using 𝐡 3 alone, but nevertheless, both the 𝐡 3 value ( 𝑃 = . 0 0 0 0 3 ), and the 𝐡 4 value ( 𝑃 = . 0 0 1 1 ) was higher in the cluster referring to higher memory functioning. The averaged learning curve for the cluster of lower memory functioning using 𝐡 3 is shown in Figure 7, filled circles. Though MS female patients were separated using 𝐡 3 , the value of 𝐡 4 was also low. Thus, the cluster of lower memory functioning values better characterized the MS female sample and was used for further discriminant analysis.

3.7.4. MS Female Patients: Clustering Using 𝐡 4 Only

Clustering using 𝐡 4 separated all cases into two clusters: one referring to lower memory functioning with low values of 𝐡 4 in the range of 4.0 to 12.83 for 109 MS female patients and another referring to higher memory functioning with high values of 𝐡 4 in the range of 12.98 to 16.99 for 157 MS female patients (Figure 6(d)). Values of the model’s coefficients are given in Table 2, items 21 and 22. Despite the fact that here clustering was performed using 𝐡 4 only, both the 𝐡 3 value ( 𝑃 = . 0 0 0 5 ), and the 𝐡 4 value ( 𝑃 = . 0 0 0 0 3 ) was higher in the cluster of higher memory functioning. The averaged learning curve for the cluster of lower memory functioning using 𝐡 3 is provided in Figure 7, filled diamonds. Though MS female patients were separated using 𝐡 4 , the value of 𝐡 3 was also low. Thus, the cluster of lower memory functioning better characterized the MS female sample and was used for further discriminant analysis.

3.8. Discriminant Analysis of the Cluster of Higher Memory Functioning for Healthy Female Participants and the Cluster of Lower Memory Functioning for MS Female Patients

Cluster analysis allowed us to separate healthy participants with high values of 𝐡 3 or 𝐡 4 and MS patients with low values of the same coefficients using 𝐡 3 or 𝐡 4 . Coefficient 𝐡 3 ( 𝑃 = . 0 0 0 0 5 ) and coefficient 𝐡 4 ( 𝑃 = . 0 0 0 8 ) differed significantly between the cluster of higher memory functioning for healthy females and the cluster of lower memory functioning for MS females using 𝐡 3 . 𝐡 3 ( 𝑃 = . 0 0 0 6 ) and 𝐡 4 ( 𝑃 = . 0 0 0 2 ) also differed significantly between the cluster of higher memory functioning for healthy females and the cluster of lower memory functioning for MS females using 𝐡 4 . We now use discriminate analysis on these clusters.

3.8.1. Discriminant Analysis Using 𝐡 3 Only

First, discriminant analysis was conducted to assess whether coefficient 𝐡 3 alone could distinguish the cluster of higher memory functioning for healthy female participants from the cluster of lower memory functioning for MS female patients. Wilk’s lambda was significant, πœ† = . 3 7 2 , πœ’ 2 = 2 0 2 . 9 6 4 , 𝑃 < . 0 0 1 , indicating that the model using 𝐡 3 alone significantly discriminate between the two groups. The boundary value for the discriminant function is  .672. Discriminant function yielded the following coefficients: 𝑦 = βˆ’ 4 . 9 2 3 + 0 . 8 2 7 βˆ— 𝐡 3 . The results show that the discriminant function correctly predicts 100% of healthy participants and 100% of MS patients. The quality of the discrimination was assessed for the cluster of lower memory functioning (low 𝐡 3 values) using 𝐡 3 for healthy females. The values of the discriminant function for each of the 22 cases were less than 0.672 indicating that all cases were correctly classified. Next, the quality of discrimination was assessed with the cluster of higher memory functioning (high 𝐡 3 values) for MS female patients. Here, only three patients with 𝐡 3 values 6.67, 6.69, and 6.76, respectively, were classified as belonging to MS group, but 111 out of 114 patients were correctly classified as belonging to the healthy group. Thus, 97.4% were correctly classified.

3.8.2. Discriminant Analysis Using 𝐡 4 Only

Discriminant analysis was conducted to assess whether coefficient 𝐡 4 alone could distinguish the cluster of higher memory functioning for healthy female participants from the cluster of lower memory functioning for MS female patients. Wilk’s lambda was significant, πœ† = . 3 0 1 , πœ’ 2 = 2 1 7 . 8 9 0 , 𝑃 < . 0 0 1 , which indicated that 𝐡 4 alone was able to significantly discriminate between the two groups. The boundary value for the discriminant function is  .285. The discriminant function yielded the following coefficients: 𝑦 = βˆ’ 8 . 3 9 9 + 0 . 6 7 9 βˆ— 𝐡 4 . The results show that the discriminant function correctly predicts 98.7% of healthy participants (74 out of 75) and 99.1% of MS patients (108 out of 109). Thus, 98.9% of original grouped cases are correctly classified.

The quality of the discrimination was assessed with the cluster of lower memory functioning (low 𝐡 4 values) using 𝐡 4 for healthy female participants. A discriminant function value for each of the three cases was less than 0.285 indicating 100% correct classification. Next, the quality of the discrimination was assessed with the cluster of higher memory functioning (high 𝐡 4 values) for MS female patients. All 157 of 157 patients were classified correctly as belonging to healthy group using coefficient 𝐡 4 only.

4. Discussion

The identification of memory impairment in multiple sclerosis is important in clinical practice. The CVLT-II is useful for assessing auditory/verbal memory defects in MS patients [15, 16]. We focused on validation of CVLT-II learning trialsβ€”trials 1–5β€”because it requires less time than the full test. We found that these CVLT-II measures contained sufficient information on short-term and long-term memory status and could be used for identification of overall memory impairment in patients with multiple sclerosis. However, adequacy of discriminant analysis depends on some factors that must be taken into account.

4.1. Dividing Participants and Patients by Gender

Our results confirm that females outperform males in the number of recalled correct words from the CVLT-II List A. In the group of healthy participants mean value at each trial is higher in females ( 𝑃 ≀ . 0 0 1 ). In the group of MS patients, mean value at each trial is also higher in females ( 𝑃 ≀ . 0 0 2 ) . Modeling of the learning curve revealed that readiness to learn ( 𝐡 3 ) was higher in healthy female participants. Moreover, both readiness to learn ( 𝐡 3 ) and ability to learn ( 𝐡 4 ) were significantly higher in MS female patients in comparison with MS male patients.

Women score significantly higher than men do on all immediate free recall trials, having recalled an average of approximately one word per trial more than men. Women use active, semantically mediated strategies during encoding, whereas men prefer to recall words in the order in which they were presented, which is a less effective organizational strategy [29].

4.2. Dividing Participants and Patients by High and Low Recall Scores

Our results confirm previous findings that cognitive impairment affects about half of all patients with MS [4–6, 15, 35]. Cluster analysis revealed that 46 out of 99 MS male patients and 133 out of 266 MS female patients fit the cluster associated with lower memory functioning. This result shows that grouping MS patients into two clustersβ€”one that refers to lower memory functioning, and the other associated with higher memory functioningβ€”is ideal for further calculating valid discriminant functions. Thus, the cluster of lower memory functioning should be used for further discriminant analysis.

We also confirmed our previous results that a sample of healthy participant might be nonhomogeneous [18, 26]. A control sample of healthy participants must first be separated into clusters prior to performing discriminant analysis. In this study, 9 out of 20 healthy male participants as well as 38 out of 78 healthy female participants fit the cluster associated with lower memory functioning. Thus, the cluster of healthy participants with higher memory functioning should be used for further discriminant analysis.

4.3. Separate Discrimination by Readiness to Learn and Ability to Learn

Our results show that first dividing participants and patients by gender and then further dividing them by high versus low recall using standard CVLT-II measures trials 1–5 substantially improves classification of memory impairment in patients with multiple sclerosis. Correct classification reached 100% cases. However, a serious shortcoming still remains regarding the separate assessment of short-term or long-term memory impairment. Impairment of cognitive functioning is correlated with brain atrophy revealed with MRI [7, 8]. Memory has multiple regional atrophy correlates, including deep gray matter [9], cerebral cortex volume [8] and medial temporal lobe volume [10, 11]. Therefore, it is reasonable to hypothesize that MS differentially affects multiple aspects of memory, including attention-mediated or executive aspects such as encoding and retrieval as well as consolidation and perhaps even recognition in some circumstances [10–12]. That is why we hypothesize that, depending on the affected MS brain region, short-term memory, or long-term memory might be impaired predominantly and independently. Therefore, 𝐡 3 , reflecting short-term memory, and 𝐡 4 , reflecting long-term memory, should be used as separate independent variables.

4.4. Discriminant Analysis Using 𝐡 3 or 𝐡 4

Prior to cluster analysis, we modeled each individual learning curve and formed samples of 𝐡 3 and 𝐡 4 . Next, we clustered male and female healthy participants and MS patients by 𝐡 3 or 𝐡 4 . Then, clusters with high 𝐡 3 and 𝐡 4 values from healthy participants were discriminated against clusters with low 𝐡 3 and 𝐡 4 values from MS patients. Our results show that clusterization of 𝐡 3 and 𝐡 4 samples instead of clusterization standard CVLT-II measures not only gives 100% correct classification as standard Trial 1–Trial 5 measures can do but allows separate assessments of readiness to learn ( 𝐡 3 ) and ability to learn ( 𝐡 4 ) that the standard Trial 1–Trial 5 measures cannot do.

Thus, if an MS male patient achieved 𝐡 3 < 5 . 8 4 , it suggests an impairment in readiness to learn and, if 𝐡 4 < 1 2 . 0 3 , it indicates an impairment in ability to learn. For females, if an MS patient achieved 𝐡 3 < 6 . 7 6 , it indicates an impairment in readiness to learn and, if 𝐡 4 < 1 2 . 7 9 , it indicates an impairment in ability to learn. By adjusting the sample by gender and overall CVLT-II performance, our method of mathematical modeling correctly classified 100% of cases.

5. Conclusions

The CVLT-II learning trials 1–5 allow using the learning curve for assessment of possible learning and memory impairment. The CVLT-II provides standardized scores for each of the five learning trials, so that the clinician can compare a patient’s raw trials 1–5 scores with standardized scores. However, frequently a patient’s raw scores fluctuate due to different factors, so that some of the raw scores are close to the corresponding standardized scores, but others are below normal values. The CVLT-II does not offer any other methods for classifying a patient’s learning and memory status based on the learning curve.

The main objective of our research was to illustrate that discriminant analysis is the method of choice, especially if suitable predictor variables are selected. Our results showed that the best predictor variables are coefficients 𝐡 3 and 𝐡 4 of our mathematical model because discriminant functions, calculated separately for 𝐡 3 and 𝐡 4 , allow nearly 100% precision of classification. These predictors also allow identification of separate impairment of readiness to learn or ability to learn, or both.

We believe that modeling of the learning curve offers a quick and effective method for assessing overall learning and memory impairment. The use of our mathematical model not only provides clinicians and researchers with additional information not possible with standard measures, it also provides a useful and rapid screening method for detection of early signs of memory impairment in MS patients.

Disclosure

The information in this paper and the paper itself has never been published either electronically or in print and the authors have no conflict of interests.

Acknowledgments

The authors wish to thank their colleagues Shane Stegen, Diane Cookfair, Eben Schwartz, David Hojnacki, and Bianca Weinstock-Guttman for taking part in some aspects of this work.