Abstract

Background. It is well known that laboratory markers could help in identifying risk factors of severe illness and predicting outcomes of diseases. Here, we performed a retrospective modeling study of severity and mortality predictors of hematological and biochemical laboratory parameters in Iranian COVID-19 patients. Methods. Data were obtained retrospectively from medical records of 564 confirmed Iranian COVID-19 cases. According to the disease severity, the patients were categorized into two groups (severe or nonsevere), and based on the outcome of the disease, patients were divided into two groups (recovered or deceased). Demographic and laboratory data were compared between groups, and statistical analyses were performed to define predictors of disease severity and mortality in the patients. Results. The study identified a panel of hematological and biochemical markers associated with the severe outcome of COVID-19 and constructed different predictive models for severity and mortality. The disease severity and mortality rate were significantly higher in elderly inpatients, whereas gender was not a determining factor of the clinical outcome. Age-adjusted white blood cells (WBC), platelet cells (PLT), neutrophil-to-lymphocyte ratio (NLR), red blood cells (RBC), hemoglobin (HGB), hematocrit (HCT), erythrocyte sedimentation rate (ESR), mean corpuscular hemoglobin (MCHC), blood urea nitrogen (BUN), and creatinine (Cr) also showed high accuracy in predicting severe cases at the time of hospitalization, and logistic regression analysis suggested grouped hematological parameters (age, WBC, NLR, PLT, HGB, and international normalized ratio (INR)) and biochemical markers (age, BUN, and lactate dehydrogenase (LDH)) as the best models of combined laboratory predictors for severity and mortality. Conclusion. The findings suggest that a panel of several routine laboratory parameters recorded on admission could be helpful for clinicians to predict and evaluate the risk of disease severity and mortality in COVID-19 patients.

1. Introduction

Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), which is responsible for the 2019 coronavirus disease (COVID-19), has been recognized worldwide and is referred to as the global pandemic of the century [1, 2]. As of January 2023, at the time of writing this article, about 665 million cases with different clinical symptoms, from mild to moderate to severe to critical, have been diagnosed, and more than 6.7 million deaths from COVID-19 have been reported worldwide [3]. Amidst the critical topics discussed during the COVID-19 pandemic is how to provide an early diagnosis and management of patients to control the outbreak efficiently. Early stages of the disease usually exhibit a mild clinical spectrum; thereby, timely detection, accurate screening, and isolation of infected individuals are vital in minimizing the risk of transmission and managing the disease [4].

Among laboratory diagnostic tests, real-time reverse transcription-polymerase chain reaction (rRT-PCR) is currently considered the most popular test for diagnosing COVID-19 patients [5, 6] Nevertheless, due to its unsatisfactory test sensitivity, rRT-PCR is prone to false negative results in low viral load samples, primarily in patients representing mild disease manifestations [4]. False-negative and false-positive results could lead to improper disease management and negative impacts on a pandemic containment program [7]. Several studies have shown that analyzing clinical and laboratory parameters in diseases assists in identifying risk factors of severe illness, predicting outcomes [8], and allocating proper medical resources at all levels of care to discount morbidity and mortality [9, 10]. Furthermore, considering laboratory parameters could be a cost-efficient and rapid diagnostic strategy in the later stages of an outbreak [11].

Laboratory hematological and biochemical markers may help to predict COVID-19 prognosis [12]. Many studies have pinpointed various prognostic markers, including D-dimer, C-reactive protein (CRP), lactate dehydrogenase (LDH), and high-sensitivity cardiac troponin, in serum of COVID-19 patients with poor outcomes [8, 9, 13, 14]. Deep analysis of abnormal levels of such factors and the interface between their functions in body organs and mechanisms of viral infection can provide the basis for first-line diagnosis as an efficient screening tool [9]. Further knowledge requires sufficient sampling from different countries and populations to power the research, advance health informatics, and turn the data into clinically valuable conclusions. In the present study, we assessed multiple laboratory hematological and biochemical markers of COVID-19 patients in Iran.

2. Methods

This retrospective case series was performed on data collected from 564 confirmed COVID-19 cases (based on RT-PCR test) admitted to the referral hospital, 5 Azar Hospital, Gorgan, north of Iran, during February and November 2020. A logistic regression was conducted to examine the odds of death based on independent variables, employing a sample size of 546 observations. The analysis achieved 80% power at a significance level of 0.05, enabling the detection of a change in the probability of from the baseline value of 0.2 to 0.31034. This change corresponds to an odds ratio of 1.8. The analysis utilized a two-sided Wald test. Demographic (sex and age) and laboratory data (hematological and biochemical findings) were obtained from patients’ electronic records and case record forms at admission. The patients were classified into severe or nonsevere groups based on the level of care received, intensive care unit or not. The patients were also classified into recovered or deceased groups based on their disease outcome. All the patients were from Golestan Province and had the same geographical origin. The study was approved by the Research Ethics Commission of Golestan University of Medical Sciences (IR.GOUMS.REC.1401.070).

Statistical analyses were performed using SPSS 22.0 software. Descriptive analysis variables were expressed as median (interquartile range (IQR)), , or number (%) when appropriate. The Kolmogorov-Smirnov test was performed to check the normality of the variables. Demographic and laboratory data were compared between groups (severity and outcome) using Student’s -test for normal distribution; otherwise, the Mann–Whitney test was used. Sex was compared using the test, and age was shown as deviation.

To include appropriate parameters in the logistic regression model, we first checked the univariate relationships between the parameters and severity and mortality, and then variables were included in the multiple models. Binary logistic regression analysis was also performed to define independent predictors of disease severity and mortality in COVID-19 patients. To remove the effects of potential confounding factors, i.e., the age parameter in the model, we used the enter method with parameters of a value greater than 0.05 removed for the final model with the forward Wald test. The Hosmer-Lemeshow goodness-of-fit test was used to evaluate how well the model fit with data, reflecting the association between predicted and observed risk. A receiver operating characteristic (ROC) curve was utilized for the predictive performance of disease severity and mortality. values less than 0.05 were regarded as statistically significant. Data for the Bonferroni correction analyses are available for each hypothesis separately.

3. Results

In this study, among 564 COVID-19 patients admitted to the referral hospital, 5 Azar Hospital, Gorgan, northern Iran, from February to November 2020, 298 cases were men (52.8%), and 266 cases were women (47.2%). The average age of the patients was years. Among 564 cases, 97 (17.2%) were deceased, and 467 (82.8%) recovered and could be discharged from hospital. One hundred four (18.5%) and 460 (81.5%) were in the severe and nonsevere groups, respectively. In terms of age, the severe and deceased groups were significantly older than the nonsevere and recovered groups. There was no significant relationship concerning sex between the studied groups (Table 1).

As shown in Table 2, the levels of several hematological parameters, namely, WBC, LYM, NEU, NLR, EOS, RBC, HGB, PLT, HCT, MCHC, ESR, INR, and PT, are significantly different in those who died during hospitalization due to COVID-19 disease compared to those who recovered. Further, the levels of WBC, LYM, NEU, NLR, MO, EOS, RBC, HGB, PLT, HCT, MCHC, INR, and PT were different in severely versus nonseverely diseased groups. Some other parameters did not differ between deceased and recovered groups or those who needed intensive care (severe disease) or were not severely affected.

Table 3 shows significant differences in the level of biochemical parameters such as BUN, Cr, Na, K, Ca, P, LDH, AST, ALP, CPK, D-dimer, D-bili, T-bili, UA, and albumin in deceased compared to recovered groups. There were also significant differences in the level of BUN, Cr, Na, K, Ca, P, LDH, ALP, CK-MB, D-dimer, D-bili, T-Bili, CRP, and albumin between severe and nonsevere groups.

The results of multiple logistic regression models for predictive factors of disease severity and mortality are shown in Table 4. All regression models were age-adjusted to avoid confounding effects on the relationship between severity and mortality. Gender did not significantly differ between groups and was not included in the model as a confounder. Based on the Pearson correlation coefficient, NLR was significantly correlated to lymphocyte and neutrophil counts and percentage and was included in the regression analysis.

In the age-adjusted multiple model, parameters WBC, NLR, ESR, INR, PT, creatinine, BUN, Na, K, P, LDH, ALP, CRP, and D-dimer were independent risk factors of mortality. PLT, RBC, HGB, MCHC, HCT, and albumin were independently protective factors for mortality (Table 4). Moreover, WBC, NLR, INR, PT, BUN, creatinine, K, P, LDH, D-dimer, and ALP were all independent risk factors for the severity of COVID-19 disease. In addition, PLT, RBC, HGB, HCT, MCHC, and albumin were independent protective factors for COVID-19 severity (Table 4).

We then assessed the discriminative power of age-adjusted laboratory variables in predicting the risk of disease severity and mortality in COVID-19 patients by calculating the area under the receiver operating characteristic (ROC) curves (AUC). The results are shown in Table 5.

Logistic regression analysis revealed the best models for combined laboratory predictors for severity and mortality. Model 1 contained a combination of five common laboratory predictors and age variables among hematological parameters (AUC 0.87; 95% confidence interval (CI: 0.83-0.91)). Model 2 included a combination of two common laboratory predictors, age and biochemical parameters (AUC 0.85; 95% CI: 0.80-0.91), which showed a better ability to predict mortality than model 1 (Table 5). Logistic regression found model 3 to be the best model for the hematological parameters (AUC 0.86; 95% confidence interval (CI: 0.81-0.90)), whereas model 4 was best for biochemical parameters to predict disease severity (AUC 0.85; 95% confidence interval (CI: 0.80-0.91)) (Figures 1 and 2).

4. Discussion

Early identification of patients with a potential to develop severe or critical COVID-19 helps to reduce fatality rates and efficient utilization of limited medical resources [15]. Determining predictive laboratory variables can assist physicians in treating and managing patients in time. Previous studies showed that severe COVID-19 patients had more comorbidities, higher levels of LDH, D-dimer, CRP, leukocytes, and neutrophils, as well as lower levels of albumin, platelet, and lymphocyte counts [1618]. While these parameters are typically normal at admission, they usually worsen with the course of the disease [19]. Moreover, it is demonstrated that although COVID-19 affects people at any age, more serious consequences may occur among the elderly [20]. In this study, we identified a panel of hematological and biochemical markers associated with the severe outcome of COVID-19 and constructed different predictive models for severity and mortality. Based on the results, the disease severity and mortality rate were significantly higher in elderly inpatients, whereas gender was not a determining factor of the clinical outcome. In this study, age-adjusted WBC, PLT, NLR, RBC, HGB, HCT, ESR, MCHC, BUN, and creatinine also showed high accuracy in predicting severe cases at the time of hospitalization, and logistic regression analysis suggested grouped hematological parameters (age, WBC, NLR, PLT, HGB, and INR) and biochemical markers (age, BUN, and LDH) as the best models of combined laboratory predictors for severity and mortality.

The virus-induced cytokine storm changes the immune cells’ homeostasis, resulting in severe outcomes with leukocytosis and neutrophilia and susceptibility to bacterial infections [21]. The systemic inflammation caused by the infection, especially in severe and critical cases, leads to an accelerated migration of lymphocytes from the peripheral blood to the lungs, which could be a reason for subsequent lymphopenia [22]. Results of previous studies are consistent with ours in that patients deceased from COVID-19 showed higher numbers of leukocytes and neutrophils but lower platelets, lymphocytes, and eosinophils [23, 24]. Neutrophil counts were significantly higher in the studied patients, and neutrophilia is shown to be associated with poor outcomes such as acute respiratory distress syndrome, intensive care, and mortality. Lymphocytes, the cells that immune responses to viral infections primarily rely on, have been reported with low numbers in severe cases of COVID-19. Accordingly, levels of neutrophil to lymphocyte ratio could be a useful index for early diagnosis of severe COVID-19 [25]. In the study, in agreement with another study [12], we observed high levels of NLR correlate with disease severity and mortality with an odds ratio of 1.1. NLR also had the best single-parameter differential diagnostic effectiveness (NLR ), suggesting it is a promising predictive index. Altogether, while leukocytosis, lymphopenia, and neutrophilia were found as important predictive risk indicators for disease severity and mortality, eosinophils and monocytes did not represent a strong correlation with COVID-19 outcomes, according to the present study.

Our study aligns with another study [26], demonstrating a significant decrease in RBC, HGB, HCT, and MCHC values among critically ill and deceased patients, but not in healthier patients. Although HGB level varies in different studies [2628], the lower levels might be due to the intense inflammation disrupting progenitor cell and erythrocyte functions. Viral inflammation also affects other hematological parameters. Several studies have reported significant alterations in specific coagulation indices, such as the level of D-dimer, prothrombin time, INR, and the number of platelets [29]. A considerably higher incidence of disseminated intravascular coagulation (DIC) has been reported among COVID-19 patients who died from the severe disease compared with those who survived (71.4% vs. 0.6%) [30]. Coagulation cascade activation contributes to higher mortality and is possibly related to the changes in platelet production and destruction in viral infections. This results in coagulation imbalance and microthrombosis in the lungs and other organs [8, 31]. The resulting thrombocytopenia (platelets less than ) seems to be an important indicator of COVID-19 severity and fatal outcomes [31]. In this study, the platelet levels were significantly lower in the severe and deceased groups compared to the nonsevere and recovered groups, although the average platelet level was generally within normal ranges. Data regarding PTT in COVID-19 patients varied in previous studies [30, 32], which may be an effect of therapies and comorbidities. Nevertheless, it is reported that higher levels of D-dimer and prolonged PT are associated with a poor prognosis in COVID-19 patients [30, 31, 33, 34]. Such reports are in line with our observations on D-dimer, PT, and INR, all of which are known to be independent risk markers for COVID-19 severity and subsequent fatal outcomes.

Among other inflammatory markers, CRP and ESR are the two most frequently used in some settings, with the former being a direct indicator of the acute phase inflammation and the latter providing an indirect measure. Many studies have found significant correlations between higher CRP levels and complications such as acute lung injury (ALI), acute respiratory disorder stress (ARDS), and mortality in COVID-19 patients [3537]. Elevated CRP could be due to bacterial coinfection that may occur in severe COVID-19 [38, 39]. Moreover, ESR has been reported as an effective predictor of pneumonia, drug-induced pulmonary toxicity, and coronary heart disease [4042]. Nonetheless, the relationship between levels of ESR and COVID-19 severity is reported controversially in various studies. While many findings indicate a positive correlation between increased ESR levels and COVID-19 severity in hospitalized patients [43, 44], other studies, including ours, showed no significant relationship [45]. In addition, ferritin, an important inflammatory protein associated with thromboembolic events, ARDS, and COVID-19 severity [37, 46], tended to be higher in severe cases in our study, but probably due to the small number of samples not statistically different.

Nonrespiratory presentations of COVID-19 include laboratory abnormalities in different biochemical parameters. Liver, heart, and renal dysfunctions are among the fatal complications contributing to disease mortality [47]. According to a systematic review [48], the most common laboratory abnormality mentioned in COVID-19 patients was hypoalbuminemia, followed by an increase in AST, ALT, GGT, ALP, and total bilirubin [48, 49], all of which are related to kidney, liver, and bile damage. Our results, however, did not show significant evidence of changes in albumin perhaps, possibly because of the small size of the tested population. Noteworthy, higher AST levels were more prevalent than ALT in severe cases of COVID-19, in line with other reports [48]. Based on the present and previous studies, ALP is at higher levels in COVID-19 patients, in whom the raised LDH could be observed in severe cases where a large number of cells may undergo necrosis due to subclinical tissue damage [26, 50]. The pathogenesis of liver damage in SARS-CoV-2 infection may be due to exacerbation of the underlying liver disease, cytopathic effects caused by the virus, hypoxemia-induced ischemia, drug damage, immune system disorder, and cytokine storm [26, 51, 52].

It is worth mentioning that elevated aminotransferases could be related to myositis rather than liver damage in COVID-19. AST levels can be high along with CK-MB, LDH, and troponin, indicating cardiac problems associated with COVID-19 [49]. In our study, we also report a significant increase in CK-MB levels in severe cases compared to nonsevere patients as well as an increase in creatinine kinase in deceased compared to recovered patients. In addition, acute kidney injury (AKI) and electrolyte disturbances are important complications of hospitalized COVID-19 patients. AKI is thought to occur due to multiple pathophysiological mechanisms, such as multiple organ dysfunction, direct viral entry into the renal tubules, and cytokine release syndrome [53]. BUN is reported in many cases of AKI and was significantly higher in severe cases of COVID-19 [54]. Our findings also suggest that enhanced creatinine levels correlate with a two-fold increased risk of COVID-19 severity and fatal outcomes. Furthermore, findings on the relationship between uric acid levels and disease severity in COVID-19 vary in different studies [5557]. In this study, we show hyperuricemia in COVID-19 patients and a significant correlation to the outcome of death. Hyperuricemia might be due to direct pathophysiological effects from inflammation, endothelial dysfunction, activation of the renin-angiotensin-aldosterone system, oxidative stress, and insulin resistance, whereas hypouricemia reported in some studies may be due to the effects of cytokine storm on urate transporters [58].

Electrolyte imbalances like hypocalcemia, hypokalemia, and hyponatremia are common manifestations in hospitalized COVID-19 cases. Hypocalcemia is the most common electrolyte disturbance in COVID-19 disease, resulting from viral effects on calcium signaling pathways, vitamin D deficiency, parathyroid imbalance, malnutrition during critical illness, and unsaturated fatty acids in inflammatory responses [29]. In the context of SARS-CoV-2 infection, hypokalemia may result from hyperactivity of the renin-angiotensin-aldosterone system (RAAS), gastrointestinal losses, anorexia due to concurrent illness, or tubular damage caused by ischemia or nephrotoxic agents [59]. Moreover, the occurrence of COVID-19 symptoms, including anorexia, sweating, diarrhea, and vomiting, may lead to excessive sodium excretion and cause hyponatremia. Again, there are contrasting reports on the relationship between electrolyte imbalance and COVID-19 severity. Our study, however, agrees with Martha et al. [60] regarding the association between hypocalcemia and extended hospital and ICU duration in COVID-19, as well as enhanced levels of sodium and potassium positively correlating with disease severity and mortality. This association is true for levels of phosphorus in severe and deceased patients compared to nonsevere and recovered cases. Our OR analysis shows that phosphate levels were higher than reference values accompanied with a higher probability of death, 1.7-fold, in agreement with Hadavi et al. (1.73-fold) [61]. The proven hyperphosphatemia in COVID-19 may be due to cytokines’ damaging effects on cells and the subsequent release of phosphate ions [62].

So far, several approaches have been used to predict a poor prognosis in COVID-19 patients, including scoring systems based on physiological factors to assess major organ function [63] and analysis of complex medical datasets based on machine learning studies [64]. The studies showed that scoring systems are not specifically designed for COVID-19, and because of limited clinical data, they may not be applicable for nonsevere patients at the time of admission. Also, there are contradictory results of the SOFA (sequential organ failure assessment) score in predicting the severity of patient mortality [65]. Studies have shown an ability of machine learning methods (logistic regression (LR), support vector machine (SVM), decision tree (DT), random forest (RF), and a deep learning-based method (to improve the accuracy and early screening of COVID-19 diagnosis. The higher AUC of our LR-based predictive model makes it a more conducive method for assisting COVID-19 diagnosis [64]. The findings of this study and previous studies with demographics and existing laboratory measurements are routinely available and useful prognostic markers when the patient is admitted to the hospital. However, risk prediction models for COVID-19 require external validation before widespread clinical use, as severity and prognosis depend on the care of settings and hospital systems.

The current study has some limitations. As a single-center, retrospective study on hospitalized patients, results cannot be generalized to the entire population. The sample size is relatively small, and all 46 laboratory tests were not performed for all patients. This may negatively affect the reproducibility of our data, and we may have underestimated the role of some tests in predicting the severity and mortality. Moreover, information about concomitant diseases is not included in this study due to the lack of access to the patients’ personal medical history.

The current study has some limitations. As a single-center, retrospective study on hospitalized patients, its results cannot be generalized to the entire population. The sample size is relatively small, and all 46 tests were not performed for all patients. This issue may negatively affect the reproducibility of our data and have underestimated the role of some tests in predicting the severity and mortality of the disease. Moreover, information about concomitant diseases is not included in this study due to the lack of access to the patients’ personal medical history.

5. Conclusion

In conclusion, we suggest age, WBC, NLR, PLT, HGB, and INR and age, BUN, and LDH models as potential indicators for severe disease or mortality outcome. Considering the potential of routine hematological and biochemical tests in predicting the course of the COVID-19 disease, we believe that results in various studies using different methods, statistical approaches, population characteristics, and geographical locations, along with our study, may provide the best models for predicting severity. This will improve the careful monitoring of severity predictors and facilitate to enable early clinical intervention for patients, thereby reducing the mortality rate of COVID-19 patients and hopefully helping to control and prevent future epidemics and pandemics.

Abbreviations

SARS-CoV-2:Severe acute respiratory syndrome coronavirus-2
COVID-19:Coronavirus disease
rRT-PCR:Real-time reverse transcription-polymerase chain reaction
CRP:C-reactive protein
LDH:Lactate dehydrogenase
IQR:Interquartile range
ROC:Receiver operating characteristic
NLR:Neutrophil to lymphocyte ratio
ALI:Acute lung injury
Acute ARDS:Respiratory disorder stress
AKI:Acute kidney injury
LR:Logistic regression
SVM:Support vector machine
DT:Decision tree
RF:Random forest
WBC:White blood cells
LYM:Lymphocytes
NEU:Neutrophil
MO:Monocyte
EOS:Eosinophil
RBC:Red blood cells
HGB:Hemoglobin
PLT:Platelet cells
HCT:Hematocrit
MCV:Mean corpuscular volume
MCH:Mean corpuscular hemoglobin
MCHC:Mean corpuscular hemoglobin concentration
ESR:Erythrocyte sedimentation rate
INR:International normalized ratio
PT:Prothrombin time
PTT:Partial thromboplastin time
BUN:Blood urea nitrogen
Cr:Creatinine
Na:Sodium
K:Potassium
Ca:Calcium
Mg:Magnesium
P:Phosphorus
LDH:Lactate dehydrogenase
AST:Aspartate aminotransferase
ALT:Alanine transaminase
ALP:Alkaline phosphatase
CPK:Creatine phosphokinase
CK-MB:Creatine kinase-MB
TG:Triglyceride
FBS:Fasting blood sugar
BS:Blood sugar
HbA1C:Hemoglobin A1C
TIBC:Total iron-binding capacity
UA:Uric acid.

Data Availability

The datasets during and/or analyzed during the current study are available from the corresponding author on reasonable request.

Disclosure

This manuscript was extracted from a general physician (GP) thesis.

Conflicts of Interest

The authors report that there are no conflicts of interest.

Authors’ Contributions

A Tahamtan and Britt Nakstad conceptualized and designed the study. G Alizad, AA Ayatollahi, S Samadizadeh, A Shariati Samani, and B Aghcheli collected the data. A Rajabi and G Alizad analyzed the collected data. G Alizad, AA Ayatollahi, S Samadizadeh, and B Aghcheli drafted the manuscript. All authors evaluated and edited the manuscript and have read and approved the final manuscript.

Acknowledgments

The authors appreciate the financial support of the Research Deputy at Golestan Medical University (111699).