Total number of samples in the datasets

Training dataset

Testing dataset

Dataset

Positive samples

Negative samples

Total samples

Positive samples

Negative samples

Total samples

HA5449526110710134413572701
M15479081455135219354
M264410381682178268446
NA39454315826096310512014
NP114821403288282537819
NS11706294046464187481166
NS247511571632133246379
PA2135406762025739971570
PB11995318951845047971301
PB1-F272222062928167588755
PB22157332754845658601425
Combined3272392371957999891788



10-fold cross-validation performance

Model

Accuracy

Sensitivity

Specificity

AUC

MCC

HA98.620.9860.9930.9980.972
M197.660.9770.9870.9850.950
M296.730.9670.9730.9890.931
NA98.350.9840.9910.9960.967
NP97.510.9750.9790.9920.945
NS197.480.9750.9810.9920.946
NS296.570.9660.9710.9800.916
PA98.210.9820.9920.9950.960
PB197.260.9730.9900.9920.942
PB1-F297.990.9800.9870.9920.945
PB298.290.9830.9920.9950.964
Combined99.720.9970.9990.9990.994



Independent testing dataset performance

Model

Accuracy

Sensitivity

Specificity

AUC

MCC

HA98.780.9880.9920.9910.976
M197.180.9720.9840.9840.940
M297.090.9710.9710.9930.939
NA98.560.9860.9870.9980.971
NP97.560.9760.9650.9910.946
NS197.860.9790.9760.9940.953
NS297.630.9761.0000.9760.948
PA97.520.9750.9910.9950.947
PB197.230.9720.9880.9940.942
PB1-F298.540.9850.9880.9940.957
PB297.890.9790.9910.9960.956
Combined99.830.9981.0000.9980.997