-
Twenty-eight putative MdCYP86As, ranged from MdCYP86A1 to MdCYP86A28 were identified through analyzing and screening by HMMER software (Supplemental Table S2). The physicochemical properties of the MdCYP86As gene family were revealed. The physicochemical properties of the MdCYP86As family proteins were predicted using Expasy software. These result showed that the number of amino acids ranged from 423−753, the molecular weights ranged from 48.12−87.4 KDa, and the theoretical isoelectric points (PIs) ranged from 5.86−9.16. They were acidic proteins and their theoretical isoelectric points were less than 7, 23 proteins were alkaline proteins and their theoretical isoelectric points were greater than 7. Among all the proteins, the instability index of 12 proteins were less than 40 and 16 proteins were more than 40. It means that most of the MdCYP86As proteins are stable. The aliphatic index of all proteins were less than 100, and the total average hydrophobicity index was less than 0, which suggests that MdCYP86As proteins are hydrophilic proteins. The above analyses showed that most of the MdCYP86A family proteins were unstable hydrophilic alkaline protein (Table 1).
Table 1. The physicochemical properties of the MdCYP86As family proteins.
Sequence ID Number of amino acids Molecular weight Theoretical pI Instability index Aliphatic index Grand average of hydropathicity MdCYP86A1 553 62,140.98 5.86 51.14 96.47 −0.029 MdCYP86A2 594 68,147.91 9.05 33.81 94.6 −0.183 MdCYP86A3 508 58,180.4 8.77 30.04 95.63 −0.131 MdCYP86A4 503 58,126.88 8.86 33.51 85.31 −0.213 MdCYP86A5 585 67,211.62 8.19 36.64 89.85 −0.221 MdCYP86A6 753 87,408.56 6.3 41.04 87.78 −0.232 MdCYP86A7 512 59,231.46 8.31 38.68 89.28 −0.138 MdCYP86A8 522 58,977.08 9.16 41.38 90.65 −0.161 MdCYP86A9 524 59,217.2 8.85 39.72 90.5 −0.124 MdCYP86A10 508 57,785.03 8.2 40.48 84.45 −0.306 MdCYP86A11 535 60,662.44 8.15 43.34 94.64 −0.081 MdCYP86A12 513 58,460.27 8.84 40.19 94.56 −0.066 MdCYP86A13 516 59,081.04 7.97 42.69 89.59 −0.178 MdCYP86A14 525 59,381.74 6.17 50.18 97.89 −0.042 MdCYP86A15 423 48,120.41 6.35 35.13 92.25 −0.194 MdCYP86A16 514 58,923.46 7.95 44.63 89.32 −0.131 MdCYP86A17 506 57,795.41 8.9 41.48 86.5 −0.234 MdCYP86A18 506 57,541.1 9.03 41.64 85.18 −0.236 MdCYP86A19 522 59,333.65 8.59 30.91 94.16 −0.104 MdCYP86A20 544 62,600.47 8.45 37.62 88.88 −0.176 MdCYP86A21 574 65,958.23 8.81 39.96 89.51 −0.229 MdCYP86A22 521 59,981.17 8.65 38.03 94.45 0.034 MdCYP86A23 534 60,600.33 8.68 40.6 95.51 −0.098 MdCYP86A24 512 58,494.1 9.02 41.24 92.48 −0.088 MdCYP86A25 520 59,435.27 8.3 41.37 88.71 −0.192 MdCYP86A26 523 60,084.98 6.38 46.06 92.1 −0.152 MdCYP86A27 507 58,722.86 8.55 35.93 91.68 −0.113 MdCYP86A28 506 57,839.15 7.55 47.57 89.98 −0.092 Localization distribution and the inter-species covariance of MdCYP86A family genes
-
Twenty-eight MdCYP86A genes chromosomes were found to be unevenly distributed on the 13 chromosomes of apple with some distributional preferences. Chromosome 2 contained the largest quantities of MdCYP86A family members. MdCYP86A4, MdCYP86A5, MdCYP86A6, and MdCYP86A7 were closely arranged on the chromosomes, which suggests that they may have undergone tandem duplication events during the long period of gene amplification and evolution (Fig. 1a & Supplemental Table S3). The gene amplification enhanced their expression level, improved the function of genes, and facilitated the evolution and adaptation of organisms. These results suggested that gene duplication event displays function in the amplification and evolution of the MdCYP86A family.
Figure 1.
Localization distribution and the inter-species covariance of MdCYP86A family genes. (a) Distribution of MdCYP86A family genes on 13 apple chromosomes. (b) Analysis of covariance in Apple, Arabidopsis, Tobacco, Rice.
An interspecies covariance analysis between the CYP86A family genes in Apple and the homologous genes in Arabidopsis, Rice, and Tobacco, respectively. We found that the gene duplication frequency between Apple and Arabidopsis was the highest, with 31 covariances. This was followed by Rice with nine covariates. These indicated that the CYP86A family genes in Apple and Arabidopsis have the highest homology and the two species are most closely related to each other (Fig. 1b).
Phylogenetic and gene structure of MdCYP86As
-
To realize the evolutionary relationship of CYP86As in multispecies, CYP86As from Apple, Arabidopsis thaliana, Rice, and Tobacco were jointly constructed into a phylogenetic evolutionary tree. They were divided into three branches I, II, and III (Supplemental Table S4). Among which 18 MdCYP86As proteins belonged to the I branch and 10 MdCYP86As proteins belonged to the II branch. The similar gene structure between gene family members suggests that they may have similar transcriptional regulatory and splicing machines, which in turn led to the understanding of gene family regulation and expression patterns (Fig. 2a). In the gene structure analysis of 28 MdCYP86A proteins showed that each protein sequence contained exons, numbering from 1−9. Among them, MdCYP86A2 has the maximum number of exons. These exons were longer than the introns and distributed at both ends of the sequence. MdCYP86As exons and introns have similar distribution patterns, which shows members are closely evolutionarily related and have similar expression patterns (Fig. 2b).
Figure 2.
Phylogenetic and gene structure of MdCYP86As. (a) Phylogenetic tree construction in MEGA software for CYP86A family genes in four species: Arabidopsis thaliana, Rice, Tobacco, and Apple. (b) Gene structure analysis of MdCYP86A subfamily genes in apple, including both CDS and UTR regions.
Evolutionary and conserved motif analysis of MdCYP86A gene in apple
-
To study the evolutionary relationships of MdCYP86A genes, the evolutionary tree of the family was drawn using MEGA11, and found that 28 family genes were closely related to each other. In addition, most of the evolutionary distances greater than 80, which indicated that the family was highly conserved (Fig. 3a). Based on the family evolutionary tree, the MEME website was used to analyze the conserved motifs, and more than 10 motifs were screened (Fig. 3b). Although the number of motifs in each sequence varied, each gene possessed more than eight motifs, and the distribution of motifs in the sequences was generally consistent, indicating that these 10 motifs were highly conserved in this family (Supplemental Fig. S1). The conserved structural domains of MdCYP86As were also batch screend in NCBI, and found that all 28 protein sequences contained CYP86A structural domains, which all contained specific binding sites (Fig. 3c). However, MdCYP86A6 contains not only the CYP86A structural domain but also the cytochrome P450 subfamily structural domain, which contains a non-specific binding site. It was speculated that this gene is likely to bind some specific endogenous and exogenous redox compounds of the substrate, thus catalyzing the redox reaction of the substrate (Fig 3c). Because MdCYP86A21 has the greatest variety of stress-related response elements, including ARE, STRE, LTR, MRE and so on. The protein 3D structure of MdCYP86A21 was predicted. It showed that the MdCYP86As family proteins have high similarity in 3D structure (Fig. 3d & Supplemental Fig. S2).
Figure 3.
Evolutionary and conserved motif analysis of MdCYP86As gene in apple. (a) Construction of MdCYP86A subfamily evolutionary tree in MEGA11 software, the numbers next to the branches represent the evolutionary proximity between genes. (b) MdCYP86A subfamily motif analysis, on the left side is the gene ID of the family, and different colored boxes represent different motifs, with sequence length scale bar underneath. (c) The MdCYP86A subfamily conserved structural domains are analyzed, with gene ID on the left, structural domain on the right, and sequence length scale bar below. (d) Protein 3D structure image of MdCYP86A21.
Cis-acting element analysis
-
Upstream cis-acting elements of promoters are important for the selection of gene transcription start sites and transcription rates. By analyzing the types and numbers of cis-acting elements upstream in promoters, the regulatory mechanisms of gene family members can be revealed, which leads to a deeper understanding of the functions and roles of the family members. To identify 28 conserved cis-acting elements within the promoter region of the MdCYP86As, the 2,000 bp sequences upstream of the ATG start codon were submitted to the PlantCARE database for promoter prediction analysis. These cis-acting elements were classified into four categories, including growth-associated, light-associated, stress-associated and hormone-associated. Among the growth and development-related cis-acting elements, the main elements are involved in endosperm expression (AAGAA-motif), shoot expression (As-1), correlation with meristematic tissue expression (CAT-box), endosperm-specific expression element (GCN4-motif), and senescence expression (W-box), which are contained in the first 2,000 bp upstream of most the MdCYP86A gene promoter cis-acting elements with bud expression (As-1) and had the highest proportion (34%) with the number of 41. In the light response element category, TCCC-motif had the largest proportion. In the stress response-related category, they were associated with anaerobic induction (ARE, GC-motif) and resist stress (STRE, TC-rich repeats), trauma (WRE3, WUN-motif), metal-response (MRE), cold-response (LTR) and drought-response (MBS), with the highest number of defense stress (STRE). And in the hormone response element category included abscisic acid (ABRE, ABRE3a, ABRE4), growth hormone (TGA-element), gibberellin (P-box, TATC-box), jasmone acid (CGTCA-motif, TGACG-motif, MYB, MYC), salicylic acid (TCA), and ethylene-responsive (ERE) response elements, with the largest proportion of cis-acting elements responding to jasmone acid (MeJA). Since hormone-signaling substances have more important roles in plants with biotic and abiotic stresses, the elements involved in the stress response and hormone response may be involved in the regulation of expression in apple interaction with adversity. Because hormonal signaling closely related with responding to stress, hormone-related elements in MdCYP86As promoter suggests that this family of genes may be involved in the regulation of expression in apple and adversity interactions (Fig. 4). These showed that the promoters of the MdCYP86As contain a variety of response elements that can be recognized and bound by different transcription factors involved in plant growth, development, and stress response.
Figure 4.
Cis-acting elements analysis of the MdCYP86A subfamily. The classification of the different types of elements on the left, the gene ID below, and the name of the element on the right. The number representing the number of elements, and a larger number representing higher numbers of that element.
Expression patterns of MdCYP86A21 in different tissues and abiotic stress
-
Previous analyses of cis-acting elements showed that the MdCYP86A family genes all contain different numbers of promoter elements under abiotic stress, we examined whether MdCYP86A21 is differentially expressed under PEG, NaCl, and ABA abiotic stresses. Compared to the control, NaCl treatment resulted in the highest expression at 3 h, followed by a decrease reaching a minimum at 12 h, and then increasing again at 24 h. The expression of PEG treatment was lower than the control in the early stage. Until 24 h, the expression reached about eight times the control, then decreased. The expression was elevated at 3 h, and then peaked at 12 h, which was about 27 times of the control, and then decreased to about six times of the control in ABA treatment. The above results suggest that MdCYP86A21 has a relative function in the response to abiotic stresses (Fig. 5a).
Figure 5.
Expression patterns of MdCYP86A21 in different tissues and abiotic stress. (a) Expression pattern of MdCYP86A21 in response to three abiotic stresses, including PEG, NaCl and ABA. There are three biological replicates of the data, p < 0.05. (b) Expression levels of MdCYP86A21 in root, stem, leave, flower and fruit, there are three biological replicates of the data, p < 0.05.
Tissue-specific expression patterns revealed that MdCYP86A21 was mainly expressed in flowers, followed in by root and fruit, with the lowest expression in leaves, MdCYP86A21 function primarily in flowers (Fig. 5b). 35S::MdCYP86A21-RFP was constructed using the full sequence of MdCYP86A21. RFP empty vector was also injected into tobacco. The empty vector was observed to fluoresce in both the nucleus and the cell membrane under laser confocal microscopy. 35S::MdCYP86A21-RFP appeared net structure and overlapped with the endoplasmic reticulum marker (PAL2259). From the results, it could be determined that MdCYP86A21 is a membrane protein which localizes endoplasmic reticulum (Fig. 6).
MdCYP86A21 improves the tolerance to PEG and NaCl
-
To further investigate the role of MdCYP86A21 to abiotic stresses in apple, overexpressed (MdCYP86A21-OE) was obtained and suppressed (antiMdCYP86A21-PRI) transgenic calli were obtained to detect the resistance of abiotic stresses (Supplemental Fig. S3). The results showed that there was no significant difference in the fresh weight of WT, MdCYP86A21-OE, and antiMdCYP86A21-PRI in MS. But in MS with 4% PEG and 100 mM NaCl, the fresh weight of MdCYP86A21-OE was significantly increased and antiMdCYP86A21-PRI was significantly decreased compared to the wild type (Fig. 7a & b). The content of MDA was significantly decreased in MdCYP86A21-OE than wild type. MdCYP86A21 improves apple calli resistance to PEG and NaCl (Fig. 7c). These results suggested that MdCYP86A21 responded to PEG and NaCl stresses, and improved drought and salt resistance.
Figure 7.
MdCYP86A21 improves the tolerance to PEG and NaCl. (a) MdCYP86A21 apple calli treated on MS, PEG, NaCl medium for 15 d, the first row is wild type, the second is overexpression, and the third is suppression, five replicates per treatment. (b) Fresh weight of wild-type, overexpressed, and suppression calli were determined under different treatments. (c) Determination of MDA content in wild-type, overexpressed, and suppression calli under different treatments.
MdCYP86A21 interacting protein prediction
-
In order to further investigate the molecular mechanism of MdCYP86A21 involved in wax biosynthesis, MdCYP86A21 interacting proteins were predicted. In the predicted network, almost all proteins are related to fatty acid biosynthesis. Among them, ACOT is an acyl-coA thiesterase, which is involved in the hydrolysis of fatty acyl CoA. FAD is a fatty acid dehydrogenase involved in the synthesis of wax esters to promote oleaginous greasiness. CER2 and CYP86A7 are directly involved in wax biosynthesis (Fig. 8).
-
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
-
About this article
Cite this article
Lv HM, Qi RH, Yu ZH, Man YY, Lv YH, et al. 2024. Genome-wide identification and characterization of MdCYP86A family in apple. Fruit Research 4: e030 doi: 10.48130/frures-0024-0024
Genome-wide identification and characterization of MdCYP86A family in apple
- Received: 09 May 2024
- Revised: 28 June 2024
- Accepted: 02 July 2024
- Published online: 02 September 2024
Abstract: The cytochrome P450 enzyme is one of the largest protein families identified in cuticle biosynthesis, plant growth, development, and stress response. In the present study, 28 members of the MdCYP86A family were identified in apple (Malus × domestica Borkh.). Chromosomal localization analysis found that 28 genes were distributed on 13 chromosomes in apple. Phylogenetic and conserved motifs analysis showed that MdCYP86As in apple is evolutionarily most closely related to Arabidopsis thaliana. Each gene contains a cytochrome P450 domain, suggesting that MdCYP86As are highly conserved during the evolution of the species. Cis-acting element analysis of the promoter identified various elements associated with stress response. MdCYP86A21 was cloned in the apple genome to investigate the function of MdCYP86A family genes. MdCYP86A21 responded to abiotic stress, including PEG, NaCl, and ABA. The tissue-specific expression patterns revealed that the highest expression tissue was in flower by real-time quantitative reverse transcription (RT-qPCR). Subcellular localization showed that MdCYP86A21 was localized in the endoplasmic reticulum. Further, the function of MdCYP86A21 to enhance the resistance to PEG and NaCl stress in apple calli was verified. These results contribute to further study on the function of MdCYP86As and provide a molecular basis for the study of apple stress tolerance.
-
Key words:
- Apple /
- MdCYP86A gene family /
- MdCYP86A21 /
- Abiotic stress