Figures (6)  Tables (1)
    • Figure 1. 

      Flowchart showing that SCCP database was created in this study as a dataset for training of machine learning algorithms.

    • Figure 2. 

      Using Violin plots to compare the number of SCCGs among different plant categories. (a) Comparison of SCCGs number between Fruit trees and medicinal plants, ornamental plants, and vegetables. (b) Comparison of SCCGs number between higher plants and lower plants. (c) Comparison of SCCGs number between dicots, monocots, and other higher plant species.

    • Figure 3. 

      The top 15 GO enrichment items of genes related to sulfur-containing compounds in B. rapa.

    • Figure 4. 

      Comparative analysis of sulfur compound-related genes unearthed by Blast+ and SCCGs_Prediction tools. (a) Common and differential genes detected by Blast+ and SCCGs_Prediction tools. (b) KEGG enrichment analysis of 501 specifically identified by SCCGs_Prediction tool.

    • Figure 5. 

      Home page of the SCCP website.

    • Figure 6. 

      The SCCGs_Prediction tool page of the SCCP website.

    • MethodsNumber of
      features
      F1scoreACCAUC
      SVM-ACC1000.9040.9060.895
      SVM-Kmer4000.9450.9380.936
      SVM-PC-PseAAC250.8080.8310.911
      SVM-Kmer-ACC5000.9220.9230.910
      SVM-Kmer-PC-PseAAC4250.9440.9350.933
      SVM-ACC-PC-PseAAC1250.9160.9170.907
      SVM-ACC-Kmer-PC-PseAAC5250.9210.9230.911

      Table 1. 

      The prediction performance of the SVM model.