Research Article

Identifying Glucose Metabolism Status in Nondiabetic Japanese Adults Using Machine Learning Model with Simple Questionnaire

Table 1

Review of the recent and important studies on prediabetes screening.

Ref. no.Screening targetFactorsModelsTool challenges

[14]FPG 100–125 mg/dL, 120 mPG 100–125 mg/dL, or HbA1c 5.7–6.4%25 of socioeconomic, clinical, and biochemical factorsRF, GBM, LR, and ANNInvasive measurement factors were required for screening
[16]FPG ≥100 mg/dLGlobal diet quality score, age, smoking, alcohol drinking, unable to walk, use of rations card, time spent in sedentary activitiesRF, GLMM, LASSO, and ENWell-trained interviewers were needed to obtain dietary information
[17]HbA1c 5.7–6.4%Age, sex, BMI, waist circumference, and blood pressureRF, GBM, XGB, LR, and DLLack of individuals with high blood glucose levels from screening targets
Some of factors could not be answered on the spot and may require the linkage of the laboratory data
[18]FPG 110–125 mg/dL or HbA1c 5.7–6.4%Age, BMI, waist-to-hip ratio, systolic blood pressure, waist circumference, sleep duration, smoking status, and vigorous recreational activity time per weekXGB and LRLack of individuals with hyperglycemia after glucose loading from screening targets
Some of factors could not be answered on the spot and may require the linkage of the laboratory data
[19]FPG ≥110 mg/dL or 120 mPG ≥140 mg/dLAge, sex, BMI, smoking, FPG, fasting plasma triglyceride level, and history of high FPGLRResearch participants were limited to staffs in an oil field in China invasive measurement factors were required for screening
[20]FPG 100–125 mg/dL, HbA1c 5.7–6.4%, or 120 mPG 140–199 mg/dLSemiquantitative food frequency questionnaire answers and clinical and anthropometric measurements scoresLRWell-trained interviewers were needed to obtain dietary information
Invasive measurement factors were required for screening

Abbreviation: FPG: fasting plasma glucose level; 120 mPG: 120-min postload plasma glucose level during OGTT; HbA1c: hemoglobin A1c; BMI: body mass index; RF: random forest; GBM: gradient boosting machine; LR: logistic regression; ANN: artificial neural network; GLMM: generalized linear mixed model; LASSO: least absolute shrinkage and selection operator; EN: elastic net; XGB: XGBoost; DL: deep learning.