Research Article

BERT-m7G: A Transformer Architecture Based on BERT and Stacking Ensemble to Identify RNA N7-Methylguanosine Sites from Sequence Information

Table 4

Hyperparameter optimization results of stacking ensemble classifier.

ClassifierHyperparametersMeaningSearch rangesOptimal values

Base classifiersLRC1The reciprocal of the regularization coefficient (1, 50)0.0181
LightGBMlearning_rateLearning rate(0.01, 1.0)0.2533
max_depthMaximum depth of the tree(1, 50)12
max_binThe max number of bins that feature values will be bucketed in(10, 100)84
boosting_typeTraining methodgbdt; goss; dartgbdt
num_leavesNumber of leaf nodes(1, 50)10
n_estimatorsNumber of iterations(100, 600)255
SVMC2Regularized constant which determines regularized penalty to estimation errors(1, 50)1.1322
KernelKernel function which uses to realize the nonlinear map from the raw feature space to high-dimensional feature spaceLinear; sigmoid; poly; rbfrbf
MetaclassifierLRC3The reciprocal of the regularization coefficient (1, 50)35.5133