2024 Volume 17
Article Contents
About this article
ARTICLE   Open Access    

Luteolin-7-O-glucoside and kaempferol 3-O-glucoside are candidate inhibitors of the Apis mellifera DNMT3 protein

More Information
  • Received: 19 August 2024
    Revised: 01 September 2024
    Accepted: 04 September 2024
    Published online: 19 September 2024
    Epigenetics Insights  17 Article number: e001 (2024)  |  Cite this article
  • Honeybees use royal jelly-controlled DNMT3-mediated epigenetic mechanisms to produce two distinct female castes, a long-lived fertile queen and a short-lived sterile worker. DNMT3 inhibition in larvae mimics the effect of royal jelly in terms of phenotypic changes that occur in adult female bees. A key question to be addressed in the honeybee genome is to identify epigenetically active compounds in royal jelly that inhibit DNMT3 and thereby determine developmental fate. Molecular docking, MMGBSA analysis, and MD simulation were performed to identify the lead candidate polyphenolic compounds from royal jelly that inhibit DNMT3. Thirteen polyphenolic compounds were docked to DNMT3 and two basic metrics, XP GScore and MMGBSA dG Bind, were used to evaluate the binding affinity. The highest binding affinity was observed for luteolin 7-O-glucoside with a docking score of −10.3 and kaempferol 3-O-glucoside with −8.9. Furthermore, the two compounds exhibited high total binding energies of −52.8 and −64.85 kJ/mol, respectively. MD simulations show that, unlike kaempferol 3-O-glucoside, luteolin-7-O-glucoside maintains a consistent interaction with the DNMT3 throughout the simulation period. These results suggest that of the 13 polyphenolic compounds in royal jelly, luteolin-7-O-glucoside is the most promising candidate to be the component responsible for most of the DNMT3 inhibitory activity in this diet.
  • Columnar cacti are plants of the Cactaceae family distributed across arid and semi-arid regions of America, with ecological, economic, and cultural value[1]. One trait that makes it possible for the columnar cactus to survive in the desert ecosystem is its thick epidermis covered by a hydrophobic cuticle, which limits water loss in dry conditions[1]. The cuticle is the external layer that covers the non-woody aerial organs of land plants. The careful control of cuticle biosynthesis could produce drought stress tolerance in relevant crop plants[2]. In fleshy fruits, the cuticle maintains adequate water content during fruit development on the plant and reduces water loss in fruit during postharvest[3]. Efforts to elucidate the molecular pathway of cuticle biosynthesis have been carried out for fleshy fruits such as tomato (Solanum lycopersicum)[4], apple (Malus domestica)[5], sweet cherry (Prunus avium)[6], mango (Mangifera indica)[7], and pear (Pyrus 'Yuluxiang')[8].

    The plant cuticle is formed by the two main layers cutin and cuticular waxes[3]. Cutin is composed mainly of oxygenated long-chain (LC) fatty acids (FA), which are synthesized by cytochrome p450 (CYP) enzymes. CYP family 86 subfamily A (CYP86A) enzymes carry out the terminal (ω) oxidation of LC-FA[9]. Then, CYP77A carries out the mid-chain oxidation to synthesize the main cutin monomers. In Arabidopsis, AtCYP77A4 and AtCYP77A6 carry out the synthesis of mid-chain epoxy and mid-chain dihydroxy LC-FA, respectively[10,11]. AtCYP77A6 is required for the cutin biosynthesis and the correct formation of floral surfaces[10]. The expression of CYP77A19 (KF410855) and CYP77A20 (KF410856) from potato (Solanum tuberosum) restored the petal cuticular impermeability in Arabidopsis null mutant cyp77a6-1, tentatively by the synthesis of cutin monomers[12]. In eggplant (Solanum torvum), the over-expression of StoCYP77A2 leads to resistance to Verticillium dahlia infection in tobacco plants[13]. Although the function of CYP77A2 in cutin biosynthesis has not yet been tested, gene expression analysis suggests that CaCYP77A2 (A0A1U8GYB0) could play a role in cutin biosynthesis during pepper fruit development[14].

    It has been hypothesized that the export of cuticle precursors is carried out by ATP binding cassette subfamily G (ABCG) transporters. ABCG11/WBC11, ABCG12, and ABCG13 are required for the load of cuticle lipids in Arabidopsis[1517], but ABCG13 function appears to be specific to the flower epidermis[18]. The overexpression of TsABCG11 (JQ389853) from Thellungiella salsugineum increases cuticle amounts and promotes tolerance to different abiotic stresses in Arabidopsis[19].

    Once exported, the cutin monomers are polymerized on the surface of epidermal cells. CD1 code for a Gly-Asp-Ser-Leu motif lipase/esterase (GDSL) from tomato required for the cutin formation through 2-mono(10,16-dihydroxyhexadecanoyl)glycerol esterification[20]. GDSL1 from tomato carries out the ester bond cross-links of cutin monomers located at the cuticle layers and is required for cuticle deposition in tomato fruits[21]. It has been shown that the transcription factor MIXTA-like reduces water loss in tomato fruits through the positive regulation of the expression of CYP77A2, ABCG11, and GDSL1[22]. Despite the relevant role of cuticles in maintaining cactus homeostasis in desert environments[1], the molecular mechanism of cuticle biosynthesis has yet to be described for cactus fruits.

    Stenocereus thurberi is a columnar cactus endemic from the Sonoran desert (Mexico), which produces an ovoid-globose fleshy fruit named sweet pitaya[23]. In its mature state, the pulp of sweet pitaya contains around 86% water with a high content of antioxidants and natural pigments such as betalains and phenolic compounds, which have nutraceutical and industrial relevance[23]. Due to the arid environment in which pitaya fruit grows, studying its molecular mechanism of cuticle biosynthesis can generate new insights into understanding species' adaptation mechanisms to arid environments. Nevertheless, sequences of transcripts from S. thurberi in public databases are scarce.

    RNA-sequencing technology (RNA-seq) allows the massive generation of almost all the transcripts from non-model plants, even if no complete assembled genome is available[24]. Recent advances in bioinformatic tools has improved our capacity to identify long non-coding RNA (lncRNA), which have been showed to play regulatory roles in relevant biological processes, such as the regulation of drought stress tolerance in plants[25], fruit development, and ripening[2629].

    In this study, RNA-seq data were obtained for the de novo assembly and characterization of the S. thurberi fruit peel transcriptome. As a first approach, three transcripts, StCYP77A, StABCG11, and StGDSL1, tentatively involved in cuticle biosynthesis, were identified and quantified during sweet pitaya fruit development. Due to no gene expression analysis having been carried out yet for S. thurberi, stably expressed constitutive genes were identified for the first time.

    Sweet pitaya fruits (S. thurberi) without physical damage were hand harvested from plants in a native conditions field located at Carbó, Sonora, México. They were collocated in a cooler containing dry ice and transported immediately to the laboratory. The superficial part of the peels (~1 mm deep) was removed carefully from the fruits using a scalpel. Peel samples from three fruits were pooled according to their tentative stage of development defined by their visual characteristics, frozen in liquid nitrogen, and pulverized to create a single biological replicate. Four samples belonging to four different plants were analyzed. All fruits harvested were close to the ripening stage. Samples named M1 and M2 were turning from green to ripe [~35−40 Days After Flowering (DAF)], whereas samples M3 and M4 were turning from ripe to overripe (~40−45 DAF).

    Total RNA was isolated from the peels through the Hot Borate method[30]. The concentration and purity of RNA were determined in a spectrophotometer Nanodrop 2000 (Thermo Fisher) by measuring the 260/280 and 260/230 absorbance ratios. RNA integrity was evaluated through electrophoresis in agarose gel 1% and a Bioanalyzer 2100 (Agilent). Pure RNA was sequenced in the paired-end mode in an Illumina NextSeq 500 platform at the University of Arizona Genetics Core Facility. Four RNA-seq libraries, each of them from each sample, were obtained, which include a total of 288,199,704 short reads with a length of 150 base pairs (bp). The resulting sequence data can be accessed at the Sequence Read Archive (SRA) repository of the NCBI through the BioProject ID PRJNA1030439. Libraries are named corresponding to the names of samples M1, M2, M3, and M4.

    FastQC software (www.bioinformatics.babraham.ac.uk/projects/fastqc) was used for short reads quality analysis. Short reads with poor quality were trimmed or eliminated by Trimmomatic (www.usadellab.org/cms/?page=trimmomatic) with a trailing and leading of 25, a sliding window of 4:25, and a minimum read length of 80 bp. A total of 243,194,888 reads with at least a 25 quality score on the Phred scale were used to carry out the de novo assembly by Trinity (https://github.com/trinityrnaseq/trinityrnaseq/wiki) with the following parameters: minimal k-mer coverage of 1, normalization of 50, and minimal transcript length of 200 bp.

    Removal of contaminating sequences and ribosomal RNA (rRNA) was carried out through SeqClean. To remove redundancy, transcripts with equal or more than 90% of identity were merged through CD-hit (www.bioinformatics.org/cd-hit/). Alignment and quantification in terms of transcripts per million (TPM) were carried out through Bowtie (https://bowtie-bio.sourceforge.net/index.shtml) and RSEM (https://github.com/deweylab/RSEM), respectively. Transcripts showing a low expression (TPM < 0.01) were discarded. Assembly quality was evaluated by calculating the parameters N50 value, mean transcript length, TransRate score, and completeness. The statistics of the transcriptome were determined by TrinityStats and TransRate (https://hibberdlab.com/transrate/). The transcriptome completeness was determined through a BLASTn alignment (E value < 1 × 10−3) by BUSCO (https://busco.ezlab.org/) against the database of conserved orthologous genes from Embryophyte.

    To predict the proteins tentatively coded in the S. thurberi transcriptome, the best homology match of the assembled transcripts was found by alignment to the Swiss-Prot, RefSeq, nr-NCBI, PlantTFDB, iTAK, TAIR, and ITAG databases using the BLAST algorithm with an E value threshold of 1 × 10−10 for the nr-NCBI database and of 1 × 10−5 for the others[3134]. An additional alignment was carried out to the protein databases of commercial fruits Persea americana, Prunus persica, Fragaria vesca, Citrus cinensis, and Vitis vinifera to proteins of the cactus Opuntia streptacantha, and the transcriptomes of the cactus Hylocereus polyrhizus, Pachycereus pringlei, and Selenicereus undatus. The list of all databases and the database websites of commercial fruits and cactus are provided in Supplementary Tables S1 & S2. The open reading frame (ORF) of the transcripts and the protein sequences tentative coded from the sweet pitaya transcriptome was predicted by TransDecoder (https://github.com/TransDecoder/TransDecoder/wiki), considering a minimal ORF length of 75 amino acids (aa). The search for protein domains was carried out by the InterPro database (www.ebi.ac.uk/interpro). Functional categorization was carried out by Blast2GO based on GO terms and KEGG metabolic pathways[35].

    LncRNA were identified based on the methods reported in previous studies[25,29,36]. Transcripts without homology to any protein from Swiss-Prot, RefSeq, nr-NCBI, PlantTFDB, iTAK, TAIR, ITAG, P. americana, P. persica, F. vesca, C. cinensis, V. vinifera, and O. streptacantha databases, without a predicted ORF longer than 75 aa, and without protein domains in the InterPro database were selected to identify tentative lncRNA.

    Transcripts coding for signal peptide or transmembrane helices were identified by SignalP (https://services.healthtech.dtu.dk/services/SignalP-6.0/) and TMHMM (https://services.healthtech.dtu.dk/services/TMHMM-2.0/), respectively, and discarded. Further, transcripts corresponding to other non-coding RNAs (ribosomal RNA and transfer RNA) were identified through Infernal by using the Rfam database[37] and discarded. The remaining transcripts were analyzed by CPC[38], and CPC2[39] to calculate their coding potential. Transcripts with a coding potential score lower than −1 for CPC and a coding probability lower than 0.1 for CPC2 were considered lncRNA. To characterize the identified lncRNA, the length and abundance of coding and lncRNA were calculated. Bowtie and RSEM were used to align and quantify raw counts, respectively. The edgeR package[40] was used to normalize raw count data in terms of counts per million (CPM) for both coding and lncRNA.

    To obtain the transcript's expression, the aligning of short reads and quantifying of transcripts were carried out through Bowtie and RSEM software, respectively. A differential expression analysis was carried out between the four libraries by edgeR package in R Studio. Only the transcripts with a count equal to or higher than 0.5 in at least one sample were retained for the analysis. Transcripts with log2 Fold Change (log2FC) between +1 and −1 and with a False Discovery Rate (FDR) lower than 0.05 were taken as not differentially expressed (NDE).

    For the identification of the tentative reference genes two strategies were carried out as described below: i) The NDE transcripts were aligned by BLASTn (E value < 1 × 10−5) to 43 constitutive genes previously reported in fruits from the cactus H. polyrhizus, S. monacanthus, and S. undatus[4143] to identify possible homologous constitutive genes in S. thurberi. Then, the homologous transcripts with the minimal coefficient of variation (CV) were selected; ii) For all the NDE transcripts, the percentile 95 value of the mean CPM and the percentile 5 value of the CV were used as filters to recover the most stably expressed transcripts, based on previous studies[44]. Finally, transcripts to be tested by quantitative reverse transcription polymerase chain reaction (qRT-PCR) were selected based on their homology and tentative biological function.

    The fruit harvesting was carried out as described above. Sweet pitaya fruit takes about 43 d to ripen, therefore, open flowers were tagged, and fruits with 10, 20, 30, 35, and 40 DAF were collected to cover the pitaya fruit development process (Supplementary Fig. S1). The superficial part of the peels (~1 mm deep) was removed carefully from the fruits using a scalpel. Peel samples from three fruits were pooled according to their stage of development defined by their DAF, frozen in liquid nitrogen, and pulverized to create a single biological replicate. One biological replicate consisted of peels from three fruits belonging to the same plant. Two to three biological replicates were evaluated for each developmental stage. Two technical replicates were analyzed for each biological replicate. RNA extraction, quantification, RNA purity, and RNA integrity analysis were carried out as described above.

    cDNA was synthesized from 100 ng of RNA by QuantiTect Reverse Transcription Kit (QIAGEN). Primers were designed using the PrimerQuest™, UNAFold, and OligoAnalyzer™ tools from Integrated DNA Technologies (www.idtdna.com/pages) and following the method proposed by Thornton & Basu[45]. Transcripts quantification was carried out in a QIAquant 96 5 plex according to the PowerUp™ SYBR™ Green Master Mix protocol (Applied Biosystems), with a first denaturation step for 2 min at 95 °C, followed by 40 cycles of denaturation step at 95 °C for 15 s, annealing and extension steps for 30 s at 60 °C.

    The Cycle threshold (Ct) values obtained from the qRT-PCR were analyzed through the algorithms BestKeeper, geNorm, NormFinder, and the delta Ct method[46]. RefFinder (www.ciidirsinaloa.com.mx/RefFinder-master/) was used to integrate the stability results and to find the most stable expressed transcripts in sweet pitaya fruit peel during development. The pairwise variation value (Vn/Vn + 1) was calculated through the geNorm algorithm in R Studio software[47].

    An alignment of 17 reported cuticle biosynthesis genes from model plants were carried out by BLASTx against the predicted proteins from sweet pitaya. Two additional alignments of 17 charaterized cuticle biosynthesis proteins from model plants against the transcripts and predicted proteins of sweet pitaya were carried out by tBLASTn and BLASTp, respectively. An E value threshold of 1 × 10−5 was used, and the unique best hits were recovered for all three alignments. The sequences of the 17 characterized cuticle biosynthesis genes and proteins from model plants are showed in Supplementary Table S3. The specific parameters and the unique best hits for all the alignments carried out are shown in Supplementary Tables S4S8.

    Cuticle biosynthesis-related transcripts tentatively coding for a cytochrome p450 family 77 subfamily A (CYP77A), a Gly-Asp-Ser-Leu motif lipase/esterase 1 (GDSL1), and an ATP binding cassette transporter subfamily G member 11 (ABCG11) were identified by best bi-directional hit according to the functional annotation described above. Protein-conserved domains, signal peptide, and transmembrane helix were predicted through InterProScan, SignalP 6.0, and TMHMM, respectively. Alignment of the protein sequences to tentative orthologous of other plant species was carried out by the MUSCLE algorithm[48]. A neighbor-joining (NJ) phylogenetic tree with a bootstrap of 1,000 replications was constructed by MEGA11[49].

    Fruit sampling, primer design, RNA extraction, cDNA synthesis, and transcript quantification were performed as described above. Relative expression was calculated according to the 2−ΔΔCᴛ method[50]. The sample corresponding to 10 DAF was used as the calibrator. The transcripts StEF1a, StTUA, StUBQ3, and StEF1a + StTUA were used as normalizer genes.

    Normality was assessed according to the Shapiro-Wilk test. Significant differences in the expression of the cuticle biosynthesis-related transcripts between fruit developmental stages were determined by one-way ANOVA based on a completely randomized sampling design and a Tukey honestly significant difference (HSD) test, considering a p-value < 0.05 as significant. Statistical analysis was carried out through the stats package in R Studio.

    RNA was extracted from the peels of ripe sweet pitaya fruits (S. thurberi) from plants located in the Sonoran Desert, Mexico. Four cDNA libraries were sequenced in an Illumina NextSeq 500 platform at the University of Arizona Genetics Core Facility. A total of 288,199,704 reads with 150 base pairs (bp) in length were sequenced in paired-end mode. After trimming, 243,194,888 (84.38%) cleaned short reads with at least 29 mean quality scores per read in the Phred scale and between 80 to 150 bp in length were obtained to carry out the assembly. After removing contaminating sequences, redundancy, and low-expressed transcripts, the assembly included 174,449 transcripts with an N50 value of 2,110 bp. Table 1 shows the different quality variables of the S. thurberi fruit peel transcriptome. BUSCO score showed that 85.4% are completed transcripts, although out of these, 37.2% were found to be duplicated. The resulting sequence data can be accessed at the SRA repository of the NCBI through the BioProject ID PRJNA1030439.

    Table 1.  Quality metrics of the Stenocereus thurberi fruit peel transcriptome.
    Metric Data
    Total transcripts 174,449
    N50 2,110
    Smallest transcript length (bp) 200
    Largest transcript length (bp) 19,114
    Mean transcript length (bp) 1,198.69
    GC (%) 41.33
    Total assembled bases 209,110,524
    TransRate score 0.05
    BUSCO score (%) C: 85.38 (S:48.22, D:37.16),
    F: 10.69, M: 3.93.
    Values were calculated through the TrinityStats function of Trinity and TransRate software. Completeness analysis was carried out through BUSCO by aligning the transcriptome to the Embryophyte database through BLAST with an E value threshold of 1 × 10−3. Complete (C), single (S), duplicated (D), fragmented (F), missing (M).
     | Show Table
    DownLoad: CSV

    A summary of the homology search in the main public protein database for the S. thurberi transcriptome is shown in Supplementary Table S1. From these databases, the higher homologous transcripts were found in RefSeq with 93,993 (53.87 %). Based on the E value distribution, for 41,685 (44%) and 68,853 (49%) of the hits, it was found a strong homology (E value lower than 1 × 10−50) to proteins in the Swiss-Prot and RefSeq databases, respectively (Supplementary Fig. S2a & b). On the other hand, 56,539 (52.34%) and 99,599 (71.11%) of the matches showed a percentage of identity higher than 60% in the Swiss-Prot and RefSeq databases, respectively (Supplementary Fig. S2c & d).

    Figure 1 shows the homology between transcripts from S. thurberi and proteins of commercial fruits, as well as proteins and transcripts of cacti. Transcripts from S. thurberi homologous to proteins from fruits of commercial interest avocado (P. americana), peach (P. persica), strawberry (F. vesca), orange (C. sinensis), and grapefruit (V. vinifera) ranged from 77,285 (44.30%) to 85,421 (48.96%), with 70,802 transcripts homologous to all the five fruit protein databases (Fig. 1a).

    Transcripts homologous to transcripts or proteins from the cactus dragon fruit (H. polyrhizus), prickly pear cactus (O. streptacantha), Mexican giant cardon (P. pringlei), and pitahaya (S. undatus) ranged from 76,238 (43.70%) to 114,933 (65.88%), with 64,009 transcripts homologous to all the four cactus databases (Fig. 1b). Further, out of the total of transcripts, 44,040 transcripts (25.25%) showed homology only to sequences from cactus, but not for model plants Arabidopsis, tomato, or the commercial fruits included in this study (Fig. 1c).

    Figure 1.  Venn diagram of the homology search results against model plants databases, commercial fruits, and cactus. The number in the diagram corresponds to the number of transcripts from S. thurberi homologous to sequences from that plant species. (a) Homologous to sequences from Fragaria vesca (Fa), Persea americana (Pa), Prunus persica (Pp), Vitis vinifera (Vv), and Citrus sinensis (Cs). (b) Homologous to sequences from Opuntia streptacantha (Of), Selenicereus undatus (Su), Hylocereus polyrhizus (Hp), and Pachycereus pringlei (Pap). (c) Homologous to sequences from Solanum lycopersicum (Sl), Arabidopsis thaliana (At), from the commercial fruits (Fa, Pa, Pp, Vv, and Cs), or the cactus included in this study (Of, Su, Hp, and Pap). Homologous searching was carried out by BLAST alignment (E value < 1 × 10−5). The Venn diagrams were drawn by ggVennDiagram in R Studio.

    A total of 45,970 (26.35%), 58,704 (33.65%), and 48,186 (27.65%) transcripts showed homology to transcription factors, transcriptional regulators, and protein kinases in the PlantTFDB, iTAK-TR, and iTAK-PK databases, respectively (Supplementary Tables S1, S9S11). For the PlantTFDB, the homologous transcripts belong to 57 transcriptional factors (TF) families (Fig. 2 & Supplementary Table S9), from which, the most frequent were the basic-helix-loop-helix (bHLH), myeloblastosis-related (MYB-related), NAM, ATAF, and CUC (NAC), ethylene responsive factor (ERF), and the WRKY domain families (WRKY) (Fig. 2).

    Figure 2.  Transcription factor (TF) families distribution of S. thurberi fruit peel transcriptome. The X-axis indicates the number of transcripts with hits to each TF family. Alignment to the PlantTFDB database by BLASTx was carried out with an E value threshold of 1 × 10−5. The bar graph was drawn by ggplot2 in R Studio.

    Based on the homology found and the functional domain searches, gene ontology terms (GO) were assigned to 68,559 transcripts (Supplementary Table S12). Figure 3 shows the top 20 GO terms assigned to the S. thurberi transcriptome, corresponding to the Biological Processes (BP) and Molecular Function (MF) categories. For BP, organic substance metabolic processes, primary metabolic processes, and cellular metabolic processes showed a higher number of transcripts (Supplementary Table S13). Further, for MF, organic cyclic compound binding, heterocyclic compound binding, and ion binding were the processes with the higher number of transcripts. S. thurberi transcripts were classified into 142 metabolic pathways from the KEGG database (Supplementary Table S14). The pathways with the higher number of transcripts recorded were pyruvate metabolism, glycerophospholipid metabolism, glycolysis/gluconeogenesis, and citrate cycle. Further, among the top 20 KEEG pathways, the cutin, suberin, and wax biosynthesis include more than 30 transcripts (Fig. 4).

    Figure 3.  Top 20 Gene Ontology (GO) terms assigned to the S. thurberi fruit peel transcriptome. Bars indicate the number of transcripts assigned to each GO term. Assignment of GO terms was carried out by Blast2GO with default parameters. BP and MF mean Biological Processes and Molecular Functions GO categories, respectively. The graph was drawn by ggplot2 in R Studio.
    Figure 4.  Top 20 KEGG metabolic pathways distribution in the S. thurberi fruit peel transcriptome. Bars indicate the number of transcripts assigned to each KEGG pathway. Assignment of KEGG pathways was carried out in the Blast2GO suite. The bar graph was drawn by ggplot2 in R Studio.

    Out of the total of transcripts, 43,391 (24.87%) were classified as lncRNA (Supplementary Tables S15 & S16). Figure 5 shows a comparison of the length (Fig. 5a) and expression (Fig. 5b) of lncRNA and coding RNA. Both length and expression values were higher in coding RNA than in lncRNA. In general, coding RNA ranged from 201 to 18,629 bp with a mean length of 1,507.18, whereas lncRNA ranged from 200 to 5,198 bp with a mean length of 481.51 (Fig. 5a). The higher expression values recorded from coding RNA and lncRNA were 12.83 and 9.45 log2(CPM), respectively (Fig. 5b).

    Figure 5.  Comparison of coding RNA and long non-coding RNA (lncRNA) from S. thurberi transcriptome. (a) Box plot of transcript length distribution. The Y-axis indicates the length of each transcript in base pairs. (b) Box plot of expression levels. The Y-axis indicates the log2 of the count per million of reads (log2(CPM)) recorded for each transcript. Expression levels were calculated by the edgeR package in R studio. (a), (b) The lines inside the boxes indicate the median. The higher and lower box limits represent the 75th and 25th percentiles, respectively. The box plots were drawn by ggplot2 in R Studio.

    To identify the transcripts without significant changes in expression between the four RNA-seq libraries, a differential expression analysis was carried out. Of the total of transcripts, 4,980 were not differentially expressed (NDE) at least in one paired comparison between the libraries (Supplementary Tables S17S20). Mean counts per million of reads (CPM) and coefficient of variation (CV)[44] were calculated for these NDE transcripts. Transcripts with a CV value lower than 0.113, corresponding with the percentile 5 of the CV, and a mean CPM higher than 1,138.06, corresponding with the percentile 95 of the mean CPM were used as filters to identify the most stably expressed transcripts (Supplementary Table S21). Based on its homology and its tentative biological function, five transcripts were selected to be tested as tentative reference genes. Besides, three NDE transcripts homologous to previously identified stable expressed reference genes in other species of cactus fruit[4143] were selected (Supplementary Table S22). Homology metrics for the eight tentative reference genes selected are shown in Supplementary Table S23. The primer sequences used to amplify the transcripts by qRT-PCR and their nucleotide sequence are shown in Supplementary Tables S24 & S25, respectively.

    The amplification specificity of the eight candidate reference genes determined by melting curves analysis is shown in Supplementary Fig. S3. For the eight tentative reference transcripts selected, the cycle threshold (Ct) values were recorded during sweet pitaya fruit development by qRT-PCR (Supplementary Table S26). The Ct values obtained ranged from 16.85 to 30.26 (Fig. 6a). Plastidic ATP/ADP-transporter (StTLC1) showed the highest Ct values with a mean of 27.34 (Supplementary Table S26). Polyubiquitin 3 (StUBQ3) showed the lowest Ct values in all five sweet pitaya fruit developmental stages (Fig. 6a).

    Figure 6.  Expression stability analysis of tentative reference genes. (a) Box plot of cycle threshold (Ct) distribution of candidate reference genes during sweet pitaya fruit development (10, 20, 30, 35, and 40 d after flowering). The black line inside the box indicates the median. The higher and lower box limits represent the 75th and 25th percentiles, respectively. (b) Bar chart of the geometric mean (geomean) of ranking values calculated by RefFinder for each tentative reference gene (X-axis). The lowest values indicate the best reference genes. (c) Bar chart of the pairwise variation analysis and determination of the optimal number of reference genes by the geNorm algorithm. A pairwise variation value lower than 0.15 indicates that the use of Vn/Vn + 1 reference genes is reliable for the accurate normalization of qRT-PCR data. The Ct data used in the analysis were calculated by qRT-PCR in a QIAquant 96 5 plex (QIAGEN) according to the manufacturer's protocol. The box plot and the bar graphs were drawn by ggplot2 and Excel programs, respectively. Abbreviations: Actin 7 (StACT7), alpha-tubulin (StTUA), elongation factor 1-alpha (StEF1a), COP1-interactive protein 1 (StCIP1), plasma membrane ATPase 4 (StPMA4), BEL1-like homeodomain protein 1 (StBLH1), polyubiquitin 3 (StUBQ3), and plastidic ATP/ADP-transporter (StTLC1).

    The best stability values calculated by NormFinder were 0.45, 0.51, 0.97, and 0.99, corresponding to the transcripts elongation factor 1-alpha (StEF1a), alpha-tubulin (StTUA), plastidic ATP/ADP-transporter (StTLC1), and actin 7 (StACT7), respectively (Supplementary Table S27). For BestKeeper, the most stable expressed transcripts were StUBQ3, StTUA, and StEF1a, with values of 0.72, 0.75, and 0.87, respectively. In the case of the delta Ct method[51], the transcripts StEF1a, StTUA, and StTLC1 showed the best stability.

    According to geNorm analysis, the most stable expressed transcripts were StTUA, StEF1a, StUBQ3, and StACT7, with values of 0.74, 0.74, 0.82, and 0.96, respectively. All the pairwise variation values (Vn/Vn + 1) were lower than 0.15, ranging from 0.019 for V2/V3 to 0.01 for V6/V7 (Fig. 6c). The V value of 0.019 obtained for V2/V3 indicates that the use of the best two reference genes StTUA and StEF1a is reliable enough for the accurate normalization of qRT-PCR data, therefore no third reference gene is required[47]. Except for BestKeeper analysis, StEF1a and StTUA were the most stable transcripts for all of the methods carried out in this study (Supplementary Table S27). The comprehensive ranking analysis indicates that StEF1a, followed by StTUA and StUBQ3, are the most stable expressed genes and are stable enough to be used as reference genes in qRT-PCR analysis during sweet pitaya fruit development (Fig. 6b).

    Three cuticle biosynthesis-related transcripts TRINITY_DN17030_c0_g1_i2, TRINITY_DN15394_c0_g1_i1, and TRINITY_DN23528_c1_g1_i1 tentatively coding for the enzymes cytochrome p450 family 77 subfamily A (CYP77A), Gly-Asp-Ser-Leu motif lipase/esterase 1 (GDSL1), and an ATP binding cassette transporter subfamily G member 11 (ABCG11/WBC11), respectively, were identified and quantified. The nucleotide sequence and predicted amino acid sequences of the three transcripts are shown in Supplementary File 1. The best homology match for StCYP77A (TRINITY_DN17030_c0_g1_i2) was for AtCYP77A4 (AT5G04660) from Arabidopsis and SmCYP77A2 (P37124) from eggplant (Solanum melongena) in the TAIR and Swiss-Prot databases, respectively (Supplementary Table S23).

    TransDecoder, InterPro, and TMHMM analysis showed that StCYP77A codes a polypeptide of 518 amino acids (aa) in length that comprises a cytochrome P450 E-class domain (IPR002401) and a transmembrane region (residues 10 to 32). The phylogenetic tree constructed showed that StCYP77A is grouped in a cluster with all the CYP77A2 proteins included in this analysis, being closer to CYP77A2 (XP_010694692) from B. vulgaris and Cgig2_012892 (KAJ8441854) from Carnegiea gigantean (Supplementary Fig. S4).

    StGDSL1 (TRINITY_DN15394_c0_g1_i1) alignment showed that it is homologous to a GDSL esterase/lipase from Arabidopsis (Q9LU14) and tomato (Solyc03g121180) (Supplementary Table S23). TransDecoder, InterPro, and SignalP analysis showed that StGDSL1 codes a polypeptide of 354 aa in length that comprises a GDSL lipase/esterase domain IPR001087 and a signal peptide with a cleavage site between position 25 and 26 (Supplementary Fig. S5).

    Supplementary Figure S6 shows the analysis carried out on the predicted amino acid sequence of StABCG11 (TRINITY_DN23528_c1_g1_i1). The phylogenetic tree constructed shows three clades corresponding to the ABCG13, ABCG12, and ABCG11 protein classes with bootstrap support ranging from 40% to 100% (Supplementary Fig. S6a). StABCG11 is grouped with all the ABCG11 transporters included in this study in a well-separated clade, being closely related to its tentative ortholog from C. gigantean Cgig2_004465 (KAJ8441854). InterPro and TMHMM results showed that the StABCG11 sequence contains an ABC-2 type transporter transmembrane domain (IPR013525; PF01061.27) with six transmembrane helices (Supplementary Fig. S6b).

    The predicted protein sequence of StABCG11 is 710 aa in length, holding the ATP binding domain (IPR003439; PF00005.30) and the P-loop containing nucleoside triphosphate hydrolase domain (IPR043926; PF19055.3) of the ABC transporters of the G family. Multiple sequence alignment shows that the Walker A and B motif sequence and the ABC signature[15] are conserved between the ABCG11 transporters from Arabidopsis, tomato, S. thurberi, and C. gigantean (Supplementary Fig. S6c).

    According to the results of the expression stability analysis (Fig. 6), four normalization strategies were tested to quantify the three cuticle biosynthesis-related transcripts during sweet pitaya fruit development. The four strategies consist of normalizing by StEF1a, StTUA, StUBQ3, or StEF1a+StTUA. Primer sequences used to quantify the transcripts StCYP77A (TRINITY_DN17030_c0_g1_i2), StGDSL1 (TRINITY_DN15394_c0_g1_i1), and StABCG11 (TRINITY_DN23528_c1_g1_i1) by qRT-PCR during sweet pitaya fruit development are shown in Supplementary Table S24.

    The three cuticle biosynthesis-related transcripts showed differences in expression during sweet pitaya fruit development (Supplementary Table S28). The same expression pattern was recorded for the three cuticle biosynthesis transcripts when normalization was carried out by StEF1a, StTUA, StUBQ3, or StEF1a + StTUA (Fig. 7). A higher expression of StCYP77A and StGDSL1 are shown at the 10 and 20 DAF, showing a decrease at 30, 35, and 40 DAF. StABCG11 showed a similar behavior, with a higher expression at 10 and 20 DAF and a reduction at 30 and 35 DAF. Nevertheless, unlike StCYP77A and StGDSL1, a significant increase at 40 DAF, reaching the same expression as compared with 10 DAF, is shown for StABCG11 (Fig. 7).

    Figure 7.  Expression analysis of cuticle biosynthesis-related transcripts StCYP77A, StGDSL1, and StABCG11 during sweet pitaya (Stenocereus thurberi) fruit development. Relative expression was calculated through the 2−ΔΔCᴛ method using elongation factor 1-alpha (StEF1a), alpha-tubulin (StTUA), polyubiquitin 3 (StUBQ3), or StEF1a + StTUA as normalizing genes at 10, 20, 30, 35, and 40 d after flowering (DAF). The Y-axis and error bars represent the mean of the relative expression ± standard error (n = 4−6) for each developmental stage in DAF. The Ct data for the analysis was recorded by qRT-PCR in a QIAquant 96 5 plex (QIAGEN) according to the manufacturer's protocol. The graph line was drawn by ggplot2 in R Studio. Abbreviations: cytochrome p450 family 77 subfamily A (StCYP77A), Gly-Asp-Ser-Leu motif lipase/esterase 1 (StGDSL1), and ATP binding cassette transporter subfamily G member 11 (StABCG11).

    Characteristics of a well-assembled transcriptome include an N50 value closer to 2,000 bp, a high percentage of conserved transcripts completely assembled (> 80%), and a high proportion of reads mapping back to the assembled transcripts[52]. In the present study, the first collection of 174,449 transcripts from S. thurberi fruit peel are reported. The generated transcriptome showed an N50 value of 2,110 bp, a TransRate score of 0.05, and a GC percentage of 41.33 (Table 1), similar to that reported for other de novo plant transcriptome assemblies[53]. According to BUSCO, 85.4% of the orthologous genes from the Embryophyta databases completely matched the S. thurberi transcriptome, and only 3.9% were missing (Table 1). These results show that the S. thurberi transcriptome generated is not fragmented, and it is helpful in predicting the sequence of almost all the transcripts expressed in sweet pitaya fruit peel[24].

    The percentage of transcripts homologous found, E values, and identity distribution (Supplementary Tables S1 & S2; Supplementary Fig. S2) were similar to that reported in the de novo transcriptome assembly for non-model plants and other cactus fruits[4143,54] and further suggests that the transcriptome assembled of S. thurberi peel is robust[52]. Of the total of transcripts, 70,802 were common to all the five commercial fruit protein databases included in this study, which is helpful for the search for conserved orthologous involved in fruit development and ripening (Fig. 2a). A total of 34,513 transcripts (20%) show homology only to sequences in the cactus's databases, but not in the others (Supplementary Tables S1 & S2; Fig. 1c). This could suggest that a significant conservation of sequences among plants of the Cactaceae family exists which most likely are to have a function in this species adaptation to desert ecosystems.

    To infer the biological functionality represented by the S. thurberi fruit peel transcriptome, gene ontology (GO) terms and KEGG pathways were assigned. Of the main metabolic pathways assigned, 'glycerolipid metabolism' and 'cutin, suberine, and wax biosynthesis' suggests an active cuticle biosynthesis in pitaya fruit peel (Fig. 4). In agreement with the above, the main GO terms assigned for the molecular function (MF) category were 'organic cyclic compound binding', 'transmembrane transporter activity', and 'lipid binding' (Fig. 3). For the biological processes (BP) category, the critical GO terms for the present research are 'cellular response to stimulus', 'response to stress', 'anatomical structure development', and 'transmembrane transport', which could suggest the active development of the fruit epidermis and cuticle biosynthesis for protection to stress.

    The most frequent transcription factors (TF) families found in S. thurberi transcriptome were NAC, WRKY, bHLH, ERF, and MYB-related (Fig. 2), which had been reported to play a function in the tolerance to abiotic stress in plants[55,56]. Although the role of NAC, WRKY, bHLH, ERF, and MYB TF in improving drought tolerance in relevant crop plants has been widely documented[57,58], their contribution to the adaptation of cactus to arid ecosystems has not yet been elucidated and further experimental pieces of evidence are needed.

    It has been reported that the heterologous expression of ERF TF from Medicago truncatula induces drought tolerance and cuticle wax biosynthesis in Arabidopsis leaf[59]. In tomato fruits, the gene SlMIXTA-like which encodes a MYB transcription factor avoids water loss through the positive regulation of genes related to the biosynthesis and transport of cuticle compounds[22]. Despite the relevant role of cuticles in maintaining cactus physiology in desert environments, experimental evidence showing the role of the different TF-inducing cuticle biosynthesis has yet to be reported for cactus fruits.

    Out of the transcripts, 43,391 were classified as lncRNA (Supplementary Tables S15 & S16). This is the first report of lncRNA identification for the species S. thurberi. In fruits, 3,679 lncRNA has been identified from tomato[26], 3,330 from peach (P. persica)[29], 3,857 from melon (Cucumis melo)[28], 2,505 from hot pepper (Capsicum annuum)[27], and 3,194 from pomegranate (Punica granatum)[36]. Despite the stringent criteria to classify the lncRNA of sweet pitaya fruit (S. thurberi), a higher number of lncRNAs are shown when compared with previous reports. This finding is most likely due to the higher level of redundancy found during the transcriptome analysis. To reduce this redundancy, further efforts to achieve the complete genome assembly of S. thurberi are needed.

    Previous studies showed that lncRNA is shorter and has lower expression levels than coding RNA[6062]. In agreement with those findings, both the length and expression values of lncRNA from S. thurberi were lower than coding RNA (Fig. 5). It has been suggested that lncRNA could be involved in the biosynthesis of cuticle components in cabbage[61] and pomegranate[36] and that they could be involved in the tolerance to water deficit through the regulation of cuticle biosynthesis in wild banana[60]. Nevertheless, the molecular mechanism by which lncRNA may regulate the cuticle biosynthesis in S. thurberi fruits has not yet been elucidated.

    A relatively constant level of expression characterizes housekeeping genes because they are involved in essential cellular functions. These genes are not induced under specific conditions such as biotic or abiotic stress. Because of this, they are very useful as internal reference genes for qRT-PCR data normalization[63]. Nevertheless, their expression could change depending on plant species, developmental stages, and experimental conditions[64]. Reliable reference genes for a specific experiment in a given species must be identified to carry out an accurate qRT-PCR data normalization[63]. An initial screening of the transcript expression pattern through RNA-seq improves the identification of stably expressed transcripts by qRT-PCR[44,64].

    Identification of stable expressed reference transcripts during fruit development has been carried out in blueberry (Vaccinium bracteatum)[65], kiwifruit (Actinidia chinensis)[66], peach (P. persica)[67], apple (Malus domestica)[68], and soursop (Annona muricata)[69]. These studies include the expression stability analysis through geNorm, NormFinder, and BestKeeper algorithms[68,69], some of which are supported in RNA-seq data[65,66]. Improvement of expression stability analysis by RNA-seq had led to the identification of non-previously reported reference genes with a more stable expression during fruit development than commonly known housekeeping genes in grapevine (V. vinifera)[44], pear (Pyrus pyrifolia and P. calleryana)[64], and pepper (C. annuum)[70].

    For fruits of the Cactaceae family, only a few studies identifying reliable reference genes have been reported[4143]. Mainly because gene expression analysis has not been carried out previously for sweet pitaya (S. thurberi), the RNA-seq data generated in this work along with geNorm, NormFinder, BestKeeper, and RefFinder algorithms were used to identify reliable reference genes. The comprehensive ranking analysis showed that out of the eight candidate genes tested, StEF1a followed by StTUA and StUBQ3 were the most stable (Fig. 6b). All the pairwise variation values (Vn/Vn + 1) were lower than 0.15 (Fig. 6c), which indicates that StEF1a, StTUA, and StUBQ3 alone or the use of StEF1a and StTUA together are reliable enough to normalize the gene expression data generated by qRT-PCR.

    The genes StEF1a, StTUA, and StUBQ3 are homologous to transcripts found in the cactus species known as dragonfruit (Hylocereus monacanthus and H. undatus)[41], which have been tested as tentative reference genes during fruit development. EF1a has been proposed as a reliable reference gene in the analysis of changes in gene expression of dragon fruit (H. monacanthus and H. undatus)[41], peach (P. persica)[67], apple (M. domestica)[68], and soursop (A. muricata)[69]. According to the expression stability analysis carried out in the present study (Fig. 6) four normalization strategies were designed. The same gene expression pattern was recorded for the three target transcripts evaluated when normalization was carried out by the genes StEF1a, StTUA, StUBQ3, or StEF1a + StTUA (Fig. 7). Further, these data indicates that these reference genes are reliable enough to be used in qRT-PCR experiments during fruit development of S. thurberi.

    The plant cuticle is formed by two main layers: the cutin, composed mainly of mid-chain oxygenated LC fatty acids, and the cuticular wax, composed mainly of very long-chain (VLC) fatty acids, and their derivates VLC alkanes, VLC primary alcohols, VLC ketones, VLC aldehydes, and VLC esters[3]. In Arabidopsis CYP77A4 and CYP77A6 catalyze the synthesis of midchain epoxy and hydroxy ω-OH long-chain fatty acids, respectively[10,11], which are the main components of fleshy fruit cuticle[3].

    The functional domain search carried out in the present study showed that StCYP77A comprises a cytochrome P450 E-class domain (IPR002401) and a membrane-spanning region from residues 10 to 32 (Supplementary Fig. S4). This membrane-spanning region has been previously characterized in CYP77A enzymes from A. thaliana and Brassica napus[11,71]. It suggests that the protein coded by StCYP77A could catalyze the oxidation of fatty acids embedded in the endoplasmic reticulum membrane of the epidermal cells of S. thurberi fruit. Phylogenetic analysis showed that StCYP77A was closer to proteins from its phylogenetic-related species B. vulgaris (BvCYP772; XP_010694692) and C. gigantea (Cgig2_012892) (Supplementary Fig. S4). StCYP77A, BvCYP77A2, and Cgig2_012892 were closer to SlCYP77A2 and SmCYP77A2 than to CYP77A4 and CYP77A6 proteins, suggesting that StCYP77A (TRINITY_DN17030_c0_g1_i2) could correspond to a CYP77A2 protein.

    Five CYP77A are present in the Arabidopsis genome, named CYP77A4, CYP77A5, CYP77A6, CYP77A7, and CYP77A9, but their role in cuticle biosynthesis has only been reported for CYP77A4 and CYP77A6[72]. It has been suggested that CYP77A2 from eggplant (S. torvum) could contribute to the defense against fungal phytopathogen infection by the synthesis of specific compounds[13]. In pepper fruit (C. annuum), the expression pattern of CYP77A2 (A0A1U8GYB0) and ABCG11 (LOC107862760) suggests a role of CYP77A2 and ABCG11 in cutin biosynthesis at the early stages of pepper fruit development[14].

    In the case of the protein encoded by StGDSL1 (354 aa), the length found in this work is similar to the length of its homologous from Arabidopsis (AT3G16370) and tomato (Solyc03g121180) (Supplementary Fig. S5). A GDSL1 protein named CD1 polymerizes midchain oxygenated ω-OH long-chain fatty acids to form the cutin polyester in the extracellular space of tomato fruit peel[20,21]. It has been suggested that the 25-amino acid N-signal peptide found in StGDSL1 (Supplementary Fig. S5), previously reported in GDSL1 from Arabidopsis, B. napus, and tomato, plays a role during the protein exportation to the extracellular space[21,73].

    A higher expression of StCYP77A, StGDSL1, and StABCG11 is shown at the 10 and 20 DAF of sweet pitaya fruit development (Fig. 7), suggesting the active cuticle biosynthesis at the early stages of sweet pitaya fruit development. In agreement with that, two genes coding for GDSL lipase/hydrolases from tomato named SGN-U583101 and SGN-U579520 are highly expressed in the early stages and during the expansion stages of tomato fruit development, but their expression decreases in later stages[74]. It has been shown that the expression of GDSL genes, like CD1 from tomato, is higher in growing fruit[20,21]. Like tomato, the increase in expression of StCYP77A and StGDSL1 shown in pitaya fruit development could be due to an increase in cuticle deposition during the expansion of the fruit epidermis[20].

    The phylogenetic analysis, the functional domains, and the six transmembrane helices found in the StABCG11 predicted protein (Supplementary Fig. S6), suggests that it is an ABCG plasma membrane transporter of sweet pitaya[15]. Indeed, an increased expression of StABCG11 at 40 DAF was recorded in the present study (Fig. 7). Further, this data strongly suggests that it could be playing a relevant role in the transport of cuticle components at the beginning and during sweet pitaya fruit ripening.

    In Arabidopsis, ABCG11 (WBC11) exports cuticular wax and cutin compounds from the plasma membrane[15,75]. It has been reported that a high expression of the ABC plasma membrane transporter from mango MiWBC11 correlates with a higher cuticle deposition during fruit development[7]. The expression pattern for StABCG11, StCYP77A, and StGDSL1 suggests a role of StABCG11 as a cutin compound transporter in the earlier stages of sweet pitaya fruit development (Fig. 7). Further, its increase at 40 DAF suggests that it could be transporting cuticle compounds other than oxygenated long-chain fatty acids, or long-chain fatty acids that are not synthesized by StCYP77A and StGDSL1 in the later stages of fruit development.

    Like sweet pitaya, during sweet cherry fruit (Prunus avium) development, the expression of PaWCB11, homologous to AtABCG11 (AT1G17840), increases at the earlier stages of fruit development decreases at the intermediate stages, and increases again at the later stages[76]. PaWCB11 expression correlated with cuticle membrane deposition at the earlier and intermediate stages of sweet cherry fruit development but not at the later[76]. The increased expression of StABCG11 found in the present study could be due to the increased transport of cuticular wax compounds, such as VLC fatty acids and their derivates, in the later stages of sweet pitaya development[15,75].

    Cuticular waxes make up the smallest amount of the fruit cuticle. Even so, they mainly contribute to the impermeability of the fruit's epidermis[3]. An increase in the transport of cuticular waxes at the beginning of the ripening stage carried out by ABCG transporters could be due to a greater need to avoid water loss and to maintain an adequate amount of water during the ripening of the sweet pitaya fruit. Nevertheless, further expression analysis of cuticular wax biosynthesis-related genes, complemented with chemical composition analysis of cuticles could contribute to elucidating the molecular mechanism of cuticle biosynthesis in cacti and their physiological contribution during fruit development.

    In this study, the transcriptome of the sweet pitaya (S. thurberi) fruit peel was assembled for the first time. The reference genes found here are a helpful tool for further gene expression analysis in sweet pitaya fruit. Transcripts tentatively involved in cuticle compound biosynthesis and transport are reported for the first time in sweet pitaya. The results suggest a relevant role of cuticle compound biosynthesis and transport at the early and later stages of fruit development. The information generated will help to improve the elucidation of the molecular mechanism of cuticle biosynthesis in S. thurberi and other cactus species in the future. Understanding the cuticle's physiological function in the adaptation of the Cactaceae family to harsh environmental conditions could help design strategies to increase the resistance of other species to face the increase in water scarcity for agricultural production predicted for the following years.

    The authors confirm contribution to the paper as follows: study conception and design: Tiznado-Hernández ME, Tafolla-Arellano JC, García-Coronado H, Hernández-Oñate MÁ; data collection: Tiznado-Hernández ME, Tafolla-Arellano JC, García-Coronado H, Hernández-Oñate MÁ; analysis and interpretation of results: Tiznado-Hernández ME, García-Coronado H, Hernández-Oñate MÁ, Burgara-Estrella AJ; draft manuscript preparation: Tiznado-Hernández ME, García-Coronado H. All authors reviewed the results and approved the final version of the manuscript.

    All data generated or analyzed during this study are included in this published article and its supplementary information files. The sequence data can be accessed at the Sequence Read Archive (SRA) repository of the NCBI through the BioProject ID PRJNA1030439.

    The authors wish to acknowledge the financial support of Consejo Nacional de Humanidades, Ciencias y Tecnologías de México (CONAHCYT) through project number 579: Elucidación del Mecanismo Molecular de Biosíntesis de Cutícula Utilizando como Modelo Frutas Tropicales. We appreciate the University of Arizona Genetics Core and Illumina for providing reagents and equipment for library sequencing. The author, Heriberto García-Coronado (CVU 490952), thanks the CONAHCYT (acronym in Spanish) for the Ph.D. scholarship assigned (749341). The author, Heriberto García-Coronado, thanks Dr. Edmundo Domínguez-Rosas for the technical support in bioinformatics for identifying long non-coding RNA.

  • The authors declare that they have no conflict of interest.

  • [1]

    López-Gutiérrez N, Aguilera-Luiz MD, Romero-González R, Vidal JL, Garrido Frenich A. 2014. Fast analysis of polyphenols in royal jelly products using automated TurboFlow™-liquid chromatography-Orbitrap high resolution mass spectrometry. Journal of Chromatography B, Analytical Technologies in the Biomedical and Life Sciences 973:17−28

    doi: 10.1016/j.jchromb.2014.09.038

    CrossRef   Google Scholar

    [2]

    Pasupuleti VR, Sammugam L, Ramesh N, Gan SH. 2017. Honey, Propolis, and Royal Jelly: A Comprehensive Review of Their Biological Actions and Health Benefits. Oxidative Medicine and Cellular Longevity 2017:1259510

    doi: 10.1155/2017/1259510

    CrossRef   Google Scholar

    [3]

    Spannhoff A, Kim YK, Raynal NJM, Gharibyan V, Su MB, et al. 2011. Histone deacetylase inhibitor activity in royal jelly might facilitate caste switching in bees. EMBO Reports 12:238−43

    doi: 10.1038/embor.2011.9

    CrossRef   Google Scholar

    [4]

    Page RE, Jr., Peng CY. 2001. Aging and development in social insects with emphasis on the honey bee, Apis mellifera L. Experimental Gerontology 36:695−711

    doi: 10.1016/S0531-5565(00)00236-9

    CrossRef   Google Scholar

    [5]

    Drapeau MD, Albert S, Kucharski R, Prusko C, Maleszka R. 2006. Evolution of the Yellow/Major Royal Jelly Protein family and the emergence of social behavior in honey bees. Genome Research 16:1385−94

    doi: 10.1101/gr.5012006

    CrossRef   Google Scholar

    [6]

    Groh C, Rössler W. 2008. Caste-specific postembryonic development of primary and secondary olfactory centers in the female honeybee brain. Arthropod Structure & Development 37:459−68

    doi: 10.1016/j.asd.2008.04.001

    CrossRef   Google Scholar

    [7]

    Maleszka R. 2008. Epigenetic integration of environmental and genomic signals in honey bees: the critical interplay of nutritional, brain and reproductive networks. Epigenetics 3:188−92

    doi: 10.4161/epi.3.4.6697

    CrossRef   Google Scholar

    [8]

    Kucharski R, Maleszka J, Foret S, Maleszka R. 2008. Nutritional control of reproductive status in honeybees via DNA methylation. Science 319:1827−30

    doi: 10.1126/science.1153069

    CrossRef   Google Scholar

    [9]

    Alhosin M. 2023. Epigenetics mechanisms of honeybees: secrets of royal jelly. Epigenetics Insights 16:25168657231213717

    doi: 10.1177/25168657231213717

    CrossRef   Google Scholar

    [10]

    Botezan S, Baci GM, Bagameri L, Pașca C, Dezmirean DS. 2023. Current status of the bioactive properties of royal jelly: a comprehensive review with a focus on its anticancer, anti-inflammatory, and antioxidant effects. Molecules 28:1510

    doi: 10.3390/molecules28031510

    CrossRef   Google Scholar

    [11]

    Kang KA, Piao MJ, Hyun YJ, Zhen AX, Cho SJ, et al. 2019. Luteolin promotes apoptotic cell death via upregulation of Nrf2 expression by DNA demethylase and the interaction of Nrf2 with p53 in human colon cancer cells. Experimental & Molecular Medicine 51:1−14

    doi: 10.1038/s12276-019-0238-y

    CrossRef   Google Scholar

    [12]

    Pramodh S, Raina R, Hussain A, Bagabir SA, Haque S, et al. 2022. Luteolin causes 5'CpG demethylation of the promoters of TSGs and modulates the aberrant histone modifications, restoring the expression of TSGs in human cancer cells. International Journal of Molecular Sciences 23:4067

    doi: 10.3390/ijms23074067

    CrossRef   Google Scholar

    [13]

    Qiu W, Lin J, Zhu Y, Zhang J, Zeng L, et al. 2017. Kaempferol Modulates DNA Methylation and Downregulates DNMT3B in Bladder Cancer. Cellular Physiology and Biochemistry 41:1325−35

    doi: 10.1159/000464435

    CrossRef   Google Scholar

    [14]

    Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, et al. 2018. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Research 46:W296−W303

    doi: 10.1093/nar/gky427

    CrossRef   Google Scholar

    [15]

    Altayb HN. 2022. Fludarabine, a potential DNA-dependent RNA polymerase inhibitor, as a prospective drug against monkeypox virus: a computational approach. Pharmaceuticals 15:1129

    doi: 10.3390/ph15091129

    CrossRef   Google Scholar

    [16]

    Ayipo YO, Alananzeh WA, Ahmad I, Patel H, Mordi MN. 2023. Structural modelling and in silico pharmacology of β-carboline alkaloids as potent 5-HT1A receptor antagonists and reuptake inhibitors. Journal of Biomolecular Structure & Dynamics 41:6219−35

    doi: 10.1080/07391102.2022.2104376

    CrossRef   Google Scholar

    [17]

    Zhang ZM, Lu R, Wang P, Yu Y, Chen D, et al. 2018. Structural basis for DNMT3A-mediated de novo DNA methylation. Nature 554:387−91

    doi: 10.1038/nature25477

    CrossRef   Google Scholar

    [18]

    Bronner C, Alhosin M, Hamiche A, Mousli M. 2019. Coordinated dialogue between UHRF1 and DNMT1 to ensure faithful inheritance of methylated DNA patterns. Genes 10:65

    doi: 10.3390/genes10010065

    CrossRef   Google Scholar

    [19]

    Wang Y, Jorda M, Jones PL, Maleszka R, Ling X, et al. 2006. Functional CpG methylation system in a social insect. Science 314:645−47

    doi: 10.1126/science.1135213

    CrossRef   Google Scholar

    [20]

    Goll MG, Bestor TH. 2005. Eukaryotic cytosine methyltransferases. Annu Rev Biochem 74:481−514

    doi: 10.1146/annurev.biochem.74.010904.153721

    CrossRef   Google Scholar

    [21]

    Chen Z, Zhang Y. 2020. Role of Mammalian DNA Methyltransferases in Development. Annual Review of Biochemistry 89:135−58

    doi: 10.1146/annurev-biochem-103019-102815

    CrossRef   Google Scholar

    [22]

    Smith ZD, Chan MM, Mikkelsen TS, Gu H, Gnirke A, et al. 2012. A unique regulatory phase of DNA methylation in the early mammalian embryo. Nature 484:339−44

    doi: 10.1038/nature10960

    CrossRef   Google Scholar

    [23]

    Shi YY, Huang ZY, Zeng ZJ, Wang ZL, Wu XB, et al. 2011. Diet and cell size both affect queen-worker differentiation through DNA methylation in honey bees (Apis mellifera, Apidae). PLoS One 6:e18808

    doi: 10.1371/journal.pone.0018808

    CrossRef   Google Scholar

    [24]

    Lu Q, Shu Y, Wang L, Li G, Zhang S, et al. 2021. The protective effect of Veronica ciliata Fisch. Extracts on relieving oxidative stress-induced liver injury via activating AMPK/p62/Nrf2 pathway. Journal of Ethnopharmacology 270:113775

    doi: 10.1016/j.jep.2021.113775

    CrossRef   Google Scholar

    [25]

    De Leo E, Elmonem MA, Berlingerio SP, Berquez M, Festa BP, et al. 2020. Cell-based phenotypic drug screening identifies luteolin as candidate therapeutic for Nephropathic Cystinosis. Journal of the American Society of Nephrology 31:1522−37

    doi: 10.1681/ASN.2019090956

    CrossRef   Google Scholar

    [26]

    Liu YS, Yang Q, Li S, Luo L, Liu HY, et al. 2021. Luteolin attenuates angiotensin II-induced renal damage in apolipoprotein E-deficient mice. Molecular Medicine Reports 23:157

    doi: 10.3892/mmr.2020.11796

    CrossRef   Google Scholar

    [27]

    Gao L, Emperle M, Guo Y, Grimm SA, Ren W, et al. 2020. Comprehensive structure-function characterization of DNMT3B and DNMT3A reveals distinctive de novo DNA methylation mechanisms. Nature Communications 11:3355

    doi: 10.1038/s41467-020-17109-4

    CrossRef   Google Scholar

    [28]

    Aoki A, Suetake I, Miyagawa J, Fujio T, Chijiwa T, et al. 2001. Enzymatic properties of de novo-type mouse DNA (cytosine-5) methyltransferases. Nucleic Acids Research 29:3506−12

    doi: 10.1093/nar/29.17.3506

    CrossRef   Google Scholar

    [29]

    Handa V, Jeltsch A. 2005. Profound flanking sequence preference of Dnmt3a and Dnmt3b mammalian DNA methyltransferases shape the human epigenome. Journal of Molecular Biology 348:1103−12

    doi: 10.1016/j.jmb.2005.02.044

    CrossRef   Google Scholar

    [30]

    Shivanika C, Deepak Kumar S, Ragunathan V, Tiwari P, Sumitha A, et al. 2022. Molecular docking, validation, dynamics simulations, and pharmacokinetic prediction of natural compounds against the SARS-CoV-2 main-protease. Journal of Biomolecular Structure and Dynamics 40:585−611

    doi: 10.1080/07391102.2020.1815584

    CrossRef   Google Scholar

    [31]

    Suetake I, Mishima Y, Kimura H, Lee YH, Goto Y, et al. 2011. Characterization of DNA-binding activity in the N-terminal domain of the DNA methyltransferase Dnmt3a. The Biochemical Journal 437:141−48

    doi: 10.1042/BJ20110241

    CrossRef   Google Scholar

    [32]

    Ren W, Gao L, Song J. 2018. Structural Basis of DNMT1 and DNMT3A-Mediated DNA Methylation. Genes 9:620

    doi: 10.3390/genes9120620

    CrossRef   Google Scholar

    [33]

    Ghanbari E, Khazaei MR, Khazaei M, Nejati V. 2018. Royal Jelly Promotes Ovarian Follicles Growth and Increases Steroid Hormones in Immature Rats. International Journal of Fertility & Sterility 11:263−69

    doi: 10.22074/ijfs.2018.5156

    CrossRef   Google Scholar

    [34]

    Khazaei F, Ghanbari E, Khazaei M. 2021. Improved hormonal and oxidative changes by Royal Jelly in the rat model of PCOS: An experimental study. International Journal of Reproductive Biomedicine 19:515−24

    doi: 10.18502/ijrm.v19i6.9373

    CrossRef   Google Scholar

  • Cite this article

    Alhosin M. 2024. Luteolin-7-O-glucoside and kaempferol 3-O-glucoside are candidate inhibitors of the Apis mellifera DNMT3 protein. Epigenetics Insights 17: e001 doi: 10.48130/epi-0024-0001
    Alhosin M. 2024. Luteolin-7-O-glucoside and kaempferol 3-O-glucoside are candidate inhibitors of the Apis mellifera DNMT3 protein. Epigenetics Insights 17: e001 doi: 10.48130/epi-0024-0001

Figures(5)  /  Tables(2)

Article Metrics

Article views(1117) PDF downloads(215)

Other Articles By Authors

ARTICLE   Open Access    

Luteolin-7-O-glucoside and kaempferol 3-O-glucoside are candidate inhibitors of the Apis mellifera DNMT3 protein

Epigenetics Insights  17 Article number: e001  (2024)  |  Cite this article

Abstract: Honeybees use royal jelly-controlled DNMT3-mediated epigenetic mechanisms to produce two distinct female castes, a long-lived fertile queen and a short-lived sterile worker. DNMT3 inhibition in larvae mimics the effect of royal jelly in terms of phenotypic changes that occur in adult female bees. A key question to be addressed in the honeybee genome is to identify epigenetically active compounds in royal jelly that inhibit DNMT3 and thereby determine developmental fate. Molecular docking, MMGBSA analysis, and MD simulation were performed to identify the lead candidate polyphenolic compounds from royal jelly that inhibit DNMT3. Thirteen polyphenolic compounds were docked to DNMT3 and two basic metrics, XP GScore and MMGBSA dG Bind, were used to evaluate the binding affinity. The highest binding affinity was observed for luteolin 7-O-glucoside with a docking score of −10.3 and kaempferol 3-O-glucoside with −8.9. Furthermore, the two compounds exhibited high total binding energies of −52.8 and −64.85 kJ/mol, respectively. MD simulations show that, unlike kaempferol 3-O-glucoside, luteolin-7-O-glucoside maintains a consistent interaction with the DNMT3 throughout the simulation period. These results suggest that of the 13 polyphenolic compounds in royal jelly, luteolin-7-O-glucoside is the most promising candidate to be the component responsible for most of the DNMT3 inhibitory activity in this diet.

    • Both larvae and queens are fed royal jelly, which is produced by the hypopharyngeal and mandibular glands of young honeybees (Apis mellifera) in the colony[1,2]. Royal jelly is fed to all larvae during the first three days of their development. After this short period, worker bees switch to their special diet of pollen, honey, and nectar, while queen larvae continue to consume large amounts of royal jelly throughout their adult lives[3,4]. This differential feeding produces two different female castes, a long-lived queen and a short-lived worker. Interestingly, the worker bees are smaller and functionally sterile, whereas the queen is the largest member of the colony and has fully developed ovaries[57]. This phenotypic polymorphism in female honeybees is generated from two identical genomes by diet-controlled epigenetic mechanisms, mainly DNMT3-mediated DNA methylation[3,8,9].

      Inhibition of DNMT3 in larvae resulted in 72% of adult bees becoming queens with fully developed ovaries, similar to those of queens reared on pure royal jelly in the hive, suggesting that DNMT3 inhibition induces royal jelly-like effects on the caste phenotype of honeybees[8]. This suggests that one or more of the biologically active components of royal jelly may specifically inhibit DNMT3. Thus, one of the key questions to be addressed in the honeybee genome is to identify the epigenetically active compounds in royal jelly that inhibit DNMT3 and thereby determine developmental fate.

      In addition to proteins, vitamins, mineral salts, lipids, enzymes, and carbohydrates, royal jelly contains small amounts of polyphenols, including derivatives of luteolin and kaempferol (Table 1), ranging from 14 to 18,936 μg/kg[1,10]. DNMT3 is a target of luteolin in its mechanistic action against human cancer cells[11,12] and of kaempferol in a mouse model of bladder cancer cells[13], suggesting that such an effect (i.e. DNMT3 inhibition) may also occur in honeybees. Therefore, inhibition of the Apis mellifera DNMT3 activity and/or expression by one or more of royal jelly’s polyphenols would regulate the expression of key genes for larval development.

      Table 1.  Molecular docking of 13 polyphenolic compounds of royal jelly with the Apis mellifera DNMT3 protein.

      Product PubChem ID mol MW XP GScore MMGBSA dG Bind (kcal/mol)
      1 Luteolin-7-O-glucoside 5280637 448.382 −10.39 −52.8
      2 Luteolin-4-O-glucoside 12304737 448.382 −10.27 −47.9
      3 Kaempferol 3-O-glucoside 5282102 448.382 −8.9 −64.85
      4 Isorhamnetin 5281654 316.267 −7.42 −43.55
      5 Hesperetin 72281 302.283 −7.42 −43.55
      6 Quercetin 5280343 302.24 −7.1 −43.56
      7 Pinobanksin 73202 272.257 −5.9 −37.06
      8 Sakuranetin 73571 286.284 −5.76 −40.08
      9 Chrysin 5281607 254.242 −5.7 −42.8
      10 Naringenin 439246 272.257 −5.7 −41.05
      11 Coumestrol 5281707 268.225 −5.68 −41.93
      12 Genistein 5280961 270.241 −5.28 −35.7
      13 Acacetin 5280442 284.268 −5.11 −37.58

      Molecular docking, MMGBSA analysis, and MD simulation were carried out to identify the lead candidate polyphenolic compounds from royal jelly that can inhibit the DNMT3 protein. The binding affinity of 13 polyphenolic compounds in royal jelly for the Apis mellifera DNMT3 was evaluated using two basic metrics, XP GScore and MMGBSA dG Bind. The highest binding affinity was observed for luteolin-7-O-glucoside with a docking score of −10.3 and kaempferol-3-O-glucoside with −8.9. Furthermore, luteolin-7-O-glucoside and kaempferol-3-O-glucoside showed high total binding energies of −52.8 and −64.85 kJ/mol, respectively. MD simulations show that luteolin-7-O-glucoside maintains a consistent interaction with the DNMT3 protein throughout the simulation period. The compound luteolin-7-O-glucoside stands out as the most promising candidate and is likely to be the polyphenolic component of royal jelly responsible for most of the Apis mellifera DNMT3 inhibitory activity in this diet.

    • Homology modeling was conducted using the SWISS-MODEL server[14]. The sequences of DNA methyltransferase 3 [Apis mellifera] (ID: ADH84015.1) were obtained from NCBI. The quality of the generated models was evaluated using SAVESv6.1 − Structure Validation Server (https://saves.mbi.ucla.edu/).

    • The DNMT3 protein structure and compounds were prepared for the docking process by atom bonds assignment, the addition of hydrogen, and energy minimization. The active site was then determined using SiteMap module (Schrodinger suite). Extra Precision docking protocol of Maestro (Schrödinger, LLC, New York, NY, 2020) was used to study the possible interaction between compounds and proteins.

    • The MMGBSA dG Bind (Molecular Mechanics Generalized Born Surface Area) is another method used to predict the free energy of binding of ligands to their target proteins. It provides a more refined estimate of binding free energies by incorporating solvation effects and entropic contributions. The MMGBSA module of Maestro (Schrödinger, LLC, New York, NY, 2020) was used for the calculations, with poses generated from XP docking used as input.

    • Molecular dynamic simulation was used to study the stability of the best compounds with DNMT3, using the Maestro Desmond module, 50 ns run time. More details on MD simulation methods as in previous published work[15].

    • The binding affinity of 13 polyphenolic compounds in royal jelly for the Apis mellifera DNMT3 was evaluated using two basic metrics, the docking score XP GScore and MMGBSA dG Bind to find the most promising compounds acting as inhibitors of the DNMT3 protein in honeybees. More negative values of both XP GScore and MMGBSA dG Bind indicates higher binding affinity[16]. The XP GScore values generated from the DNMT3 protein docking ranged from −10.39 to −4.2 (Table 1). With an XP GScore of −10.39, compound luteolin-7-O-glucoside has the highest binding affinity to DNMT3, indicating that compound luteolin-7-O-glucoside has a much higher binding affinity to DNMT3 than the other compounds investigated (Table 1). The MMGBSA dG Bind values varied from −64.85 to −23.33 kcal/mol, demonstrating a wide range of binding affinities. Luteolin-7-O-glucoside has ranked second among the 13 polyphenolic compounds in royal jelly in terms of the binding free energy (MMGBSA dG Bind of −52.8 kcal/mol) with DNMT3, consistent with its highest XP GScore of −10.39 (Table 1). Figure 1 shows the 3D interaction of DNMT3 with luteolin-4-O-glucoside during the induced fit docking process. MD simulations reveal the interactions of DNMT3 with luteolin-7-O-glucoside throughout 50 ns, providing insight into the binding dynamics and stability of the complex (Fig. 2). The RMSD plot provides a clear representation of the root mean square deviation (RMSD) for both the ligand (luteolin-7-O-glucoside) and the protein backbone (DNMT3) throughout the simulation (Fig. 2). The blue line, representing the protein backbone, initially shows fluctuations but gradually stabilizes around the 30 ns mark, suggesting that the protein conformation reaches a relatively stable state. Similarly, the red line, corresponding to the ligand, shows fluctuations indicative of interaction dynamics and eventually stabilizes in tandem with the protein backbone. This stabilization indicates that the luteolin-7-O-glucoside maintains a consistent interaction with the DNMT3 protein throughout the simulation period (Fig. 2).

      Figure 1. 

      3D interaction of DNMT3 with luteolin-4-O-glucoside. The compound is shown in the center with balls in different colors.

      Figure 2. 

      MD simulation result of the interaction of DNMT3 and luteolin-4-O-glucoside during 50 ns simulation period. The RMSD showing the interaction of the ligand (red) with the protein backbone (blue).

      The interaction histogram provides a quantitative summary of the percentage of interactions between luteolin-7-O-glucoside and the different residues of the DNMT3 protein (Fig. 3a). Key residues such as TYR 11, ILE 13, GLU 15, PHE 30, ASP 86, PHE 90, and TYR 93 show high interaction percentages, indicating their important role in DNMT3 binding to luteolin-7-O-glucoside. The histogram categorizes these interactions, with color coding to distinguish types such as hydrogen bonds and hydrophobic interactions. Green bars indicate hydrogen bonds, purple bars hydrophobic interactions and blue water bridges. This visual representation highlights the importance of specific residues in maintaining the binding affinity of luteolin-7-O-glucoside to DNMT3 (Fig. 3a).

      Figure 3. 

      MD simulation result of the interaction of DNMT3 and luteolin-4-O-glucoside. (a) Histogram showing the percentage of interacted residues with the compound, green bars indicate hydrogen bonds, purple bars hydrophobic interactions and blue water bridges. (b) 2D interaction of luteolin-4-O-glucoside with DNMT3 protein residues, residues colored according to charge, hydrogen bonds in violet and hydrophobic bonds in green.

      The 2D interaction diagram further illustrates the specific interactions between DNMT3 residues and luteolin-7-O-glucoside (Fig. 3b). Residues are color-coded according to their charge properties, providing a clear visual distinction. Hydrogen bonds, shown in violet, highlight significant binding interactions with residues such as TYR 11, TYR 93, ASP 86, and GLU 15 of the DNMT3 protein, emphasizing their essential role in binding to luteolin-7-O-glucoside. As shown in Fig. 2c, TYR11 forms a hydrogen bond 52% of the time, ILE13 67%, TYR93 80%, ASP 86 60%, GLU15 58%, and PHE30 73% of the time. Hydrophobic interactions, shown in green, reveal interactions with residues such as PHE 90, indicating areas where the ligand interacts with non-polar regions of the DNMT3 protein (Fig. 3b). This detailed diagram provides a comprehensive view of the binding interface, showing the different types of interactions that stabilize the ligand-protein complex. They highlight the stability of the complex and the specific residues involved in maintaining this stability, providing valuable insights into the molecular interactions involved.

    • In the present study, the compound kaempferol-3-O-glucoside has ranked second in terms of binding affinity with an XP GScore = −8.9 and first in terms of the binding free energy (MMGBSA dG Bind = −64.85 kcal/mol) (Table 1). Figure 4 shows detailed interactions between DNMT3 and the kaempferol-3-O-glucoside compound during the simulation period. Initially, from 0 to 10 ns, the RMSD gradually increases, indicating that the protein is undergoing some structural adjustments as it stabilizes. Between 10 and 30 ns, the RMSD values stabilize around 3−4 Å, indicating that the protein has reached a relatively stable conformation. In the final phase, from 30 to 50 ns, there are slight fluctuations around 4−5 Å, showing that the protein retains some conformational flexibility even after stabilization. For the ligand backbones, residues remained with the protein backbones, indicating the stability of the ligand within the binding pocket of the protein (Fig. 4).

      Figure 4. 

      MD simulation result of the interaction of DNMT3 and kaempferol-3-O-glucoside during 50 ns simulation period. The RMSD showing the interaction of the ligand (red) with the protein backbone (blue).

      The histogram in Fig. 5a provides further details on the types of interactions that occur between the DNMT3 protein and kaempferol-3-O-glucoside. Key residues such as TRP12, ALA49, and ILE50 show significant hydrophobic interactions. Residues such as TYR 11, ILE 13. ARG37 and TYR 93 are predominantly involved in hydrogen bonding. This diversity of interactions highlights the complex nature of the binding affinity between DNMT3 and kaempferol-3-O-glucoside. Figure 5b details the percentage of residues associated with ligand binding during the simulation period. For example, TYR11 forms a hydrogen bond 76% of the time, ILE13 52%, ARG37 60%, and TYR93 56% of the time. These hydrogen bonds are crucial for the stability and specificity of the ligand binding. In addition, residues such as ILE50, ALA49, and TRP12 are involved in significant hydrophobic interactions that contribute to the overall stability of the ligand within the binding pocket.

      Figure 5. 

      MD simulation result of the interaction of DNMT3 and kaempferol-3-O-glucoside. (a) Histogram showing the percentage of interacted residues with the compound, green bars indicate hydrogen bonds, purple bars hydrophobic interactions, and blue water bridges. (b) 2D interaction of luteolin-4-O-glucoside with DNMT3 protein residues, residues colored according to charge, hydrogen bonds in violet and hydrophobic bonds in green

    • Hemi-methylated DNA produced during DNA replication is specifically targeted by DNMT1 to maintain genomic methylation, while DNMT3A and DNMT3B methylate the cytosine of unmethylated CpG sites on both DNA strands to perform de novo DNA methylation[17,18]. DNMT3-mediated DNA methylation is required for development[17,1922] and is also essential for phenotypic changes in adult female bees in response to nutritional input (i.e. royal jelly)[8].

      Experimentally, inhibiting DNMT3 has provided important clues to understanding its physiological and pathophysiological roles. When DNMT3 is inhibited with siRNA in larvae, 72% of adult bees become queens with fully developed ovaries identical to those of queens reared on pure royal jelly in the hive[8], suggesting that DNMT3 inhibition mimics the effect of royal jelly on caste phenotype. The present study aimed to identify the lead candidate polyphenolic compounds from royal jelly that can inhibit the Apis mellifera DNMT3 protein. The two basic metrics, XP GScore and MMGBSA dG Bind, were used to assess binding affinity. Of the13 polyphenolic compounds in royal jelly docked to DNMT3 protein, the compounds luteolin-7-O-glucoside and kaempferol-3-O-glucoside appear to be promising candidates for inhibition of DNMT3 activity.

      The differential feeding with royal jelly for genetically identical larvae generated two distinct female castes, fertile queens and sterile workers[15,7]. Interestingly, silencing DNMT3 expression in newly emerged larvae had a royal jelly-like effect on larval development, with most DNMT3-depleted individuals emerging as queens with fully developed ovaries[8]. These observations are an indication that royal jelly has biologically active compounds that specifically inhibit DNMT3.

      Royal jelly contains small amounts of polyphenols (Table 1), ranging from 14 to 18,936 μg/kg[1]. Of the 13 polyphenolic compounds in royal jelly docked to the Apis mellifera DNMT3, luteolin-7-O-glucoside and kaempferol-3-O-glucoside were the highest in terms of binding affinity and total binding energy (Table 1), indicating that the two compounds could be promising inhibitors of the DNMT3 protein. In support of this, luteolin was shown to decrease the expression of DNMT3A and DNMT3B proteins in human colon cancer cells[11], and in Hela cells[12].

      A major target of DNMT3-mediated DNA methylation in honeybees is the dynactin p62 gene[8,23]. The larvae fed royal jelly for long periods showed reduced activity and expression of DNMT3, together with reduced overall methylation of dynactin p62[23]. Interestingly, as a result of dynactin p62-related downstream molecular events, all emerging adults were queens, suggesting an important role for DNMT3-mediated dynactin p62 methylation in larval development. This also suggests that one or more epigenetically active polyphenols in royal jelly modulate dynactin p62 methylation. In support of this idea, luteolin has been shown to target the dynactin p62 gene in several experimental models[2426].

      The present study showed that kaempferol-3-O-glucoside is also a promising candidate for inhibition of the Apis mellifera DNMT3. Supporting this conclusion, kaempferol was shown to specifically inhibit and degrade DNMT3B protein in mouse model of bladder cancer without affecting DNMT3A or DNMT1 expression, suggesting that kaempferol is a specific inhibitor of DNMT3B[13]. Interestingly, the specific inhibition of DNMT3B by kaempferol resulted in the modulation of DNA methylation at specific regions[13]. Considering that DNMT3A and DNMT3B have different preferences for flanking sequences of CpG target sites[2729], the selective inactivation of Apis mellifera DNMT3B by kaempferol may result in different DNA methylation patterns, further enhancing the effects of luteolin in establishing an epigenetic state necessary for larval development into a queen.

      Binding efficiency and inhibition increased with increasing the number of hydrogen bonds formed between the ligand and the target protein[30]. Luteolin 7-O-glucoside formed six hydrogen bonds with residues namely TYR 11, ILE 13, GLU 15, PHE 30, ASP 86, and TYR 93 (Fig. 3b), whereas kaempferol 3-O-glucoside formed four hydrogen bonds with residues TYR 11, ILE 13. ARG37, and TYR 93 (Fig. 5b; Table 2).

      Table 2.  Interactions and binding energies of luteolin-4-O-glucoside and kaempferol-3-O-glucoside with the Apis mellifera DNMT3.

      Product Structure No. of hydrogen bonds XP GScore MMGBSA dG Bind (kcal/mol) Hydrogen bond interactions
      Luteolin-7-O-glucoside 6 −10.39 −52.8 TYR 11, ILE 13, GLU 15, PHE 30,
      ASP 86, and TYR 93
      Kaempferol 3-O-glucoside 4 −8.9 −64.85 TYR 11, ILE 13. ARG37, and TYR 93

      The honeybee genome encodes only one DNMT3 protein, consisting of 758 amino acids, whose catalytic domains have a high similarity to human DNMT3A and human DNMT3B, reaching 61% and 66% respectively[19]. The present study showed that luteolin-7-O-glucoside and kaempferol-3-O-glucoside bind to several residues located in the N-terminal domain of the Apis mellifera DNMT3, which contains the DNA-binding domain[19]. The binding of the polyphenolic compounds to the N-terminal domain of the DNMT3 could lead to a decrease in its DNA methyltransferase activity. This conclusion is supported by the fact that the DNA-binding activity of the N-terminal domain of human DNMT3A contributes to the DNA methyltransferase activity of this enzyme[31,32]. Interestingly, human DNMT3A showed high DNA binding and DNA methylation activities, while no such activities were observed with the other isoform, DNMT3A2, which is also encoded by the DNMT3A gene but lacks the N-terminal 219 amino acid residues[31].

      Luteolin-7-O-glucoside and kaempferol-3-O-glucoside showed the highest binding affinity and the highest total binding energies among the 13 polyphenolic compounds in royal jelly docked to the DNMT3 protein. The compound luteolin-7-O-glucoside appears to be the most promising candidate for inhibiting DNMT3 activity in honeybees. This could be attributed to 1) its highest docking score (XP GScore −10.39), 2) the increase in its hydrogen bonding with DNMT3 (six bonds), 3) the maintenance of a consistent interaction with the DNMT3 protein throughout the simulation period, and 4) a high binding free energy, second only to kaempferol-3-O-glucoside (MMGBSA dG Bind = −52.8 kcal/mol).

    • The production of queens with fully developed ovaries when DNMT3 is inhibited in the larvae provides strong evidence that royal jelly contains epigenetically active compounds that act as inhibitors of DNMT3 to create and maintain the epigenetic state necessary in the developing larvae to produce a fertile queen. To date, the epigenetically active compounds in royal jelly with inhibitory effects on DNMT3 are unknown. The present study was designed to identify the lead candidate polyphenolic compounds from royal jelly that can inhibit the DNMT3 protein. Thirteen polyphenolic compounds in royal jelly were docked to the Apis mellifera DNMT3 protein. Of the 13 compounds docked, the top two compounds with high binding energies were luteolin-7-O-glucoside and kaempferol-3-O-glucoside. Luteolin-7-O-glucoside stands out as the most promising candidate and is likely to be the polyphenolic component of royal jelly responsible for most of the DNMT3 inhibitory activity in this diet, thereby determining developmental fate. To confirm these predictions, the effects of a special diet consisting of worker jelly rich in luteolin-7-O-glucoside on the development of larvae into adult bees need to be studied to elucidate whether such a diet rich in luteolin-7-O-glucoside mimics the effect of royal jelly in terms of DNMT3-related phenotypic changes that occur in adult female bees.

      Royal jelly is widely used as a dietary supplement in alternative medicine for the treatment of various conditions, including infertility. Some animal studies have shown that royal jelly may affect the female reproductive system[33,34]. It will therefore be of great interest to evaluate whether the well-documented role of royal jelly in the development of larvae into queens and the beneficial effects of this diet on the reproductive system in female animals also applies to humans, particularly in the context of drug treatment of female infertility.

    • Not applicable.

    • The author confirms sole responsibility for the following: study conception and design, data collection, analysis and interpretation of results, and manuscript preparation.

    • The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

      • The author declares that there is no conflict of interest.

      • © 2024 by the author(s). Published by Maximum Academic Press, Fayetteville, GA. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.
    Figure (5)  Table (2) References (34)
  • About this article
    Cite this article
    Alhosin M. 2024. Luteolin-7-O-glucoside and kaempferol 3-O-glucoside are candidate inhibitors of the Apis mellifera DNMT3 protein. Epigenetics Insights 17: e001 doi: 10.48130/epi-0024-0001
    Alhosin M. 2024. Luteolin-7-O-glucoside and kaempferol 3-O-glucoside are candidate inhibitors of the Apis mellifera DNMT3 protein. Epigenetics Insights 17: e001 doi: 10.48130/epi-0024-0001

Catalog

  • About this article

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return