The MADS-box gene family has undergone extensive examination in various plant species, including Arabidopsis, rice, longan, lychee, and sugarcane[2,5,7,8,36], as advancements in sequencing technologies have led to the sequencing of an increasing number of plant genomes. Previous investigations have highlighted the diverse roles played by MADS-box transcription factors in plant growth, development, and responses to environmental stresses. However, limited information exists regarding the characterization of MADS-box genes in rambutan. Therefore, conducting a comprehensive analysis of NlMADS gene family members and their expression patterns across various tissues and developmental stages would prove invaluable for furthering our understanding of the molecular mechanisms governing rambutan growth and development, as well as for exploring their potential applications in rambutan breeding efforts.
In this study, 75 NlMADS genes were identified in rambutan for the first time. These genes were classified into two types: Type I, comprising Mα (24), Mβ (10), and Mγ (16) genes, and Type II, consisting of MIKC* (6) and MIKCC (19) genes. The MIKCC-type genes were further divided into 10 subfamilies based on phylogenetic relationships to shed light on the functional evolution of NlMADS genes. Although the number of NlMADS genes in rambutan is lower than in certain species like Arabidopsis (107), longan (114), lychee (94), and sugarcane (182), it is higher than in many other species such as pineapple (48) and cucumber (43)[4,37]. This disparity may be due to events like genome or gene duplication, which can lead to neofunctionalization, subfunctionalization, or nonfunctionalization[38]. Notably, the number of Type I genes surpassed that of Type II, aligning with observations in other species such as Arabidopsis (Type I: 68, Type II: 39), longan (Type I: 63, Type II: 51), and lychee (Type I: 56, Type II: 37).
However, gene structure analysis revealed that Type II genes possess a higher number of exons and introns compared to Type I genes, which aligns with findings in Arabidopsis, rice, longan, and lychee[2,5,7,8]. In rambutan, Type I NlMADS genes typically exhibit a simple structure with one or two exons, while Type II NlMADS genes are more complex, containing 2 to 17 exons. Introns play crucial roles in various stages of mRNA processing, including transcription regulation, genome organization, and alternative splicing, thereby influencing gene expression and contributing to the maintenance and regulation of gene function[39]. Introns experience weak selection pressure, potentially leading to rapid evolution in genes lacking introns, whereas genes containing larger or more introns may contribute to the acquisition of new functions during evolution[40]. Consequently, the regulatory mechanisms and functions of Type II genes may exhibit greater variability and complexity compared to Type I genes.
Gene duplications play a crucial role in the evolution of genomes and genetic systems[41]. The analysis of duplicated NlMADS genes in the rambutan genome revealed 14 pairs with Ka/Ks ratios < 1, indicating the importance of purifying selection during NlMADS gene duplication. Cis-acting elements play essential roles in plant development and physiology by regulating gene expression, and their divergence often contributes to evolutionary changes[42]. In this study, the promoter sequences of the NlMADS gene were analyzed to predict several cis-acting elements involved in plant growth and development, phytohormone response, and stress response. This finding is consistent with previous reports in the literature[7,8], indicating a certain level of conservation of cis-acting regulatory elements among different species. Several studies have highlighted the significance of hormones such as MeJA and abscisic acid in plant adaptation to biotic and abiotic stresses[43]. In the study of cis-acting elements, investigating hormone-related cis-regulatory elements (CREs) is crucial. Hormone signaling pathways, mediated by CREs play a pivotal role in regulating various aspects of plant growth and development. For instance, the abscisic acid (ABA) responsive element (ABRE) is critical for the regulation of genes involved in seed dormancy and stress responses[44]. Cis-elements such as the TCA-element (related to SA) and the TGACG motif (related to JA) are important for the activation of defense genes involved in plants' immune responses[45]. By studying the hormone-related CREs, we will have a better understanding of the regulatory networks that govern plant physiology and adaptation to their environment. Taken together, the diverse cis-elements discovered upstream of these NlMADS genes suggest their involvement in plant responses and various biological processes. These elements likely play crucial roles in ensuring the normal growth and development of rambutan, as well as enhancing its ability to adapt to diverse natural environments.
Previous studies have demonstrated considerable variability in the expression of MADS-box family genes, both across species and within species. For example, in pineapple, the majority of MADS-box family genes show heightened expression levels in flowers, while certain genes display differential expression patterns between photosynthetic and non-photosynthetic leaf tissues[4]. Similarly, in longan, MADS-box family genes such as DlAP1, DlFUL, and DlMADS109 exhibit diverse expression levels across various tested tissues[7]. Analysis of Carica papaya, encompassing 152 samples unveiled distinct expression profiles for different subclasses of MADS-box family genes across various tissues[46]. Through expression analysis of rambutan MADS-box family genes, we aimed to decipher the molecular mechanisms underlying biological development. Our findings revealed that NlMADS genes are primarily active during flower and aril development. Notably, most genes in the Type I and MIKC* subgroups showed negligible expression, leaving their functions elusive. It is plausible that they are expressed solely in specific cells or under particular conditions. Conversely, certain NlMADS genes displayed tissue-specific expression, particularly those belonging to the MIKCC-type.
The ABCDE model stands as the foremost and classical model elucidating flower development in plants, with MADS-box family genes assuming pivotal roles, extensively studied across various model plants. In this model, A-class genes (AP1, CAL, FUL, and AGL79) dictate floral organ identity, fostering petal and sepal development while delineating floral meristem identity[47]. B-class genes (AP3 and PI) intricately specify petal and stamen structures. C/D class genes (AG, SHP1, SHP2, and STK) primarily orchestrates stamen, carpel, ovule, and fruit development[48−50]. Furthermore, E-class genes (SEP) exhibit partially redundant functions throughout flower development. Broadly, in Arabidopsis and other hermaphrodite species, sepals are determined by A- and E-class genes, petals by A-, B-, and E-class genes, stamens by B-, C-, and E-class genes, and carpels by C-, D-, and E-class genes[51]. Our investigation unveiled several NlMADS genes, namely NlMADS6 (NlAGL6), NlMADS28 (NlAP3), NlMADS43 (NlAGL4), NlMADS56 (NlAGL2), NlMADS62 (NlAP3), NlMADS64 (NlAP1), and NlMADS69 (NlAG), preferentially expressed in flowers. This finding underscores their potential contributions to flower differentiation and development, in line with the roles delineated in the ABCDE model.
Additionally, NlMADS4 (NlSHP1), NlMADS30 (NlSTK), NlMADS62 (NlAP3), NlMADS63 (NlTT16), NlMADS69 (NlAG), and NlMADS56 (NlAGL2) demonstrated notably elevated expression levels in the aril, with upregulation observed during the initial three stages of aril development. This pattern suggests their potential involvement in fleshy fruit development, particularly in aril formation. The AGAMOUS (AG), SEEDSTICK (STK), SHATTERPROOF (SHP1), and SHP2 genes, members of the AG clade, primarily contribute to the development of reproductive organs, fruit ripening, and seed dispersal[50,52]. In Arabidopsis, these genes function redundantly and are indispensable for specifying ovule integument identity and its subsequent development[53,54]. Notably, the rice STK ortholog, OsMADS13, exhibits confined expression within ovules and plays a crucial role in determining floral meristem fate, facilitating the transformation of ovules into carpelloid structures[55].
Previous research on the rambutan genome has underscored the significant involvement of D-class genes, such as STK and SHP1, in aril development, likely attributable to the aril's developmental origin from the ovule stalk[1]. Moreover, transcription factors within the ABCDE model of flower development genes also govern aril development in Ginkgo and Taxus. This encompasses B-class genes (APETALA3 and PISTILLATA), C-class genes (AGAMOUS), D-class genes (SEEDSTICK and SHATTERPROOF), and E-class genes (AGL6 and SEPALLATA), all contributing to the formation of fleshy structures[16,56]. Additionally, prior research has indicated that orthologs of AGL20, such as AtAGL20, OsMADS50, and GmSOC1-like, regulate flowering time in Arabidopsis, rice, and soybean, respectively[57−59]. However, the present findings reveal that NlMADS5 (NlAGL20) exhibits high expression exclusively in rambutan leaves and stems, with no detectable expression in reproductive organs, suggesting that NlMADS5 may have distinct roles in rambutan development compared to other plant species. In summary, these discoveries underscore the significance of MIKCC-type genes in rambutan flower and aril development. Furthermore, the identified NlMADS genes may serve crucial functions in aril development, presenting themselves as potential candidates for further functional characterization. Analysis of the protein-protein interaction network of the NlMADS genes unveiled various interactions among them, indicating their potential collective regulation of rambutan development and stress response through the formation of heterologous complexes. Particularly, NlSHP1 (NlMADS4), NlAG (NlMADS69), and NlSTK (NlMADS30) emerge as key nodes in the protein interaction network, underscoring once again their crucial role in regulating rambutan development.
In conclusion, this study provides the first comprehensive genome-wide characterization of the MADS-box gene family in rambutan. These findings shed light on the potential functional roles of NlMADS genes in regulating the rambutan development process.