Search
2021 Volume 1
Article Contents
ARTICLE   Open Access    

Genome-wide identification and analysis of monolignol biosynthesis genes in Salix matsudana Koidz and their relationship to accelerated growth

More Information
  • Lignin plays an important role in plant growth and development. It serves as a raw material for the manufacture of paper, animal feed, and chemical fertilizers. However, the regulation of lignin biosynthesis genes and the composition of the relevant gene families remain unclear in many plant species. Here, we identified and characterized 11 families of monolignol biosynthesis genes in Salix matsudana Koidz. Based on phylogenetic analysis of lignin biosynthesis genes from nine angiosperm species (Arabidopsis thaliana, Oryza sativa, Zea mays, Solanum lycopersicum, S. suchowensis, S. purpurea, Populus euphratica, P. trichocarpa, and S. matsudana), the 11 gene families could be divided into two classes that differed in their apparent evolutionary history. We compared the distribution of lignin biosynthesis genes between the two sub-genomes (At and Bt) of S. matsudana and found that more duplicated genes were present in the Bt sub-genome. We analyzed RNA sequencing data from two parents of contrasting height and two of their F1 progeny, and detected 23 differentially expressed genes (DEGs) that may regulate accelerated growth. We analyzed the promoter regions of the lignin-related DEGs and identified several hormone-related (auxin, ethylene, and cytokinin) transcription factor binding sites. These results provide an important foundation for future studies on the molecular mechanisms and genetic regulation of lignin biosynthesis and its relationship to accelerated growth in forest trees.
  • 加载中
  • Supplemental Table S1 Identified monolignol biosynthesis genes in nine species.
    Supplemental Table S2 Primers used in this study.
    Supplemental Table S3 Protein domain analysis of the 11 gene families.
    Supplemental Fig. S1 qRT-PCR analysis of four differentially expressed genes in ‘Yanjiang’, ‘FS’, ‘9901’, and ‘FH’. **p < 0.01 by t-test.
  • [1] Boerjan W, Ralph J, Baucher M. 2003. Lignin biosynthesis. Annual Review of Plant Biology 54:519−46 doi: 10.1146/annurev.arplant.54.031902.134938

    CrossRef   Google Scholar

    [2] Weng JK, Akiyama T, Bonawitz ND, Li X, Ralph J, et al. 2010. Convergent evolution of syringyl lignin biosynthesis via distinct pathways in the lycophyte Selaginella and flowering plants. The Plant Cell 22:1033−45 doi: 10.1105/tpc.109.073528

    CrossRef   Google Scholar

    [3] Poovaiah CR, Nageswara-Rao M, Soneji JR, Baxter HL, Stewart CN. 2014. Altered lignin biosynthesis using biotechnology to improve lignocellulosic biofuel feedstocks. Plant Biotechnology Journal 12:1163−73 doi: 10.1111/pbi.12225

    CrossRef   Google Scholar

    [4] Zhao Q, Dixon RA. 2014. Altering the cell wall and its impact on plant disease: from forage to bioenergy. Annual Review of Phytopathology 52:69−91 doi: 10.1146/annurev-phyto-082712-102237

    CrossRef   Google Scholar

    [5] Barros J, Serk H, Granlund I, Pesquet E. 2015. The cell biology of lignification in higher plants. Annals of Botany 115:1053−74 doi: 10.1093/aob/mcv046

    CrossRef   Google Scholar

    [6] Uzal EN, Gómez Ros LV, Pomar F, Bernal MA, Paradela A, et al. 2009. The presence of sinapyl lignin in Ginkgo biloba cell cultures changes our views of the evolution of lignin biosynthesis. Physiologia Plantarum 135:196−213 doi: 10.1111/j.1399-3054.2008.01185.x

    CrossRef   Google Scholar

    [7] Bonawitz ND, Chapple C. 2010. The Genetics of lignin biosynthesis: connecting genotype to phenotype. Annual Review of Genetics 44:337−63 doi: 10.1146/annurev-genet-102209-163508

    CrossRef   Google Scholar

    [8] Zhao Q. 2016. Lignification: flexibility, biosynthesis and regulation. Trends in Plant Science 21:713−21 doi: 10.1016/j.tplants.2016.04.006

    CrossRef   Google Scholar

    [9] Shi R, Sun YH, Li Q, Heber S, Sederoff R, et al. 2010. Towards a systems approach for lignin biosynthesis in Populus trichocarpa: transcript abundance and specificity of the monolignol biosynthetic genes. Plant and Cell Physiology 51:144−63 doi: 10.1093/pcp/pcp175

    CrossRef   Google Scholar

    [10] Weng JK, Chapple C. 2010. The origin and evolution of lignin biosynthesis. New Phytologist 187:273−85 doi: 10.1111/j.1469-8137.2010.03327.x

    CrossRef   Google Scholar

    [11] Wang JP, Liu B, Sun Y, Chiang VL, Sederoff RR. 2019. Enzyme-enzyme interactions in monolignol biosynthesis. Frontiers in Plant Science 9:1942 doi: 10.3389/fpls.2018.01942

    CrossRef   Google Scholar

    [12] Balmant KM, Noble JD, Alves FC, Dervinis C, Conde D, et al. 2020. Xylem systems genetics analysis reveals a key regulator of lignin biosynthesis in Populus deltoides. Genome Research 30:1131−43 doi: 10.1101/gr.261438.120

    CrossRef   Google Scholar

    [13] Zhang J, Yuan H, Li Y, Chen Y, Liu G, et al. 2020. Genome sequencing and phylogenetic analysis of allotetraploid Salix matsudana Koidz. Horticulture Research 7:201

    Google Scholar

    [14] Dai X, Hu Q, Cai Q, Feng K, Ye N, et al. 2014. The willow genome and divergent evolution from poplar after the common genome duplication. Cell Research 24:1274−7 doi: 10.1038/cr.2014.83

    CrossRef   Google Scholar

    [15] Chen Y, Jiang Y, Chen Y, Feng W, Liu G, et al. 2020. Uncovering candidate genes responsive to salt stress in Salix matsudana (Koidz) by transcriptomic analysis. PloS One 15:e0236129 doi: 10.1371/journal.pone.0236129

    CrossRef   Google Scholar

    [16] Wheeler TJ, Eddy SR. 2013. nhmmer: DNA homology search with profile HMMs. Bioinformatics 29:2487−9 doi: 10.1093/bioinformatics/btt403

    CrossRef   Google Scholar

    [17] Li X, Liu G, Geng Y, Wu M, Pei W, et al. 2017. A genome-wide analysis of the small auxin-up RNA (SAUR) gene family in cotton. BMC Genomics 18:815 doi: 10.1186/s12864-017-4224-2

    CrossRef   Google Scholar

    [18] Liu G, Liu J, Pei W, Li X, Wang N, et al. 2019. Analysis of the MIR160 gene family and the role of MIR160a_A05 in regulating fiber length in cotton. Planta 250:2147−58 doi: 10.1007/s00425-019-03271-7

    CrossRef   Google Scholar

    [19] Zhang J, Yuan H, Yang Q, Li M, Wang Y, et al. 2017. The genetic architecture of growth traits in Salix matsudana under salt stress. Horticulture Research 4:17024 doi: 10.1038/hortres.2017.24

    CrossRef   Google Scholar

    [20] Pertea M, Pertea GM, Antonescu CM, Chang T, Mendell JT, et al. 2015. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nature Biotechnology 33:290−5 doi: 10.1038/nbt.3122

    CrossRef   Google Scholar

    [21] Young MD, Wakefield Mj, Smyth Gk, Oshlack A. 2010. Gene ontology analysis for RNA-seq: accounting for selection bias. Genome Biol 11:R14 doi: 10.1186/gb-2010-11-2-r14

    CrossRef   Google Scholar

    [22] Liu G, Wu M, Pei W, Li X, Wang N, et al. 2019. A comparative analysis of small RNAs between two Upland cotton backcross inbred lines with different fiber length: Expression and distribution. The Crop Journal 7:198−208 doi: 10.1016/j.cj.2018.08.004

    CrossRef   Google Scholar

    [23] Sundell D, Street NR, Kumar M, Mellerowicz EJ, Kucukoglu M, et al. 2017. AspWood: high-spatial-resolution transcriptome profiles reveal uncharacterized modularity of wood formation in Populus tremula. The Plant Cell 29:1585−604 doi: 10.1105/tpc.17.00153

    CrossRef   Google Scholar

    [24] Chow CN, Lee TY, Hung YC, Li GZ, Tseng KC, et al. 2019. PlantPAN3.0: a new and updated resource for reconstructing transcriptional regulatory networks from ChIP-seq experiments in plants. Nucleic Acids Research 47:D1155−D1163 doi: 10.1093/nar/gky1081

    CrossRef   Google Scholar

    [25] Gunasekara C, Subramanian A, Avvari JVRK, Li B, Chen S, et al. 2016. ExactSearch: a web-based plant motif search tool. Plant Methods 12:26 doi: 10.1186/s13007-016-0126-6

    CrossRef   Google Scholar

    [26] Kumari S, Nie J, Chen H, Ma H, Stewart R, et al. 2012. Evaluation of gene association methods for coexpression network construction and biological knowledge discovery. PLoS One 7:e50411 doi: 10.1371/journal.pone.0050411

    CrossRef   Google Scholar

    [27] Hefer CA, Mizrachi E, Myburg AA, Douglas CJ, Mansfield SD. 2015. Comparative interrogation of the developing xylem transcriptomes of two wood-forming species: Populus trichocarpa and Eucalyptus grandis. New Phytologist 206:1391−405 doi: 10.1111/nph.13277

    CrossRef   Google Scholar

    [28] Wang JP, Matthews ML, Williams CM, Shi R, Yang C, et al. 2018. Improving wood properties for wood utilization through multi-omics integration in lignin biosynthesis. Nature Communications 9:1579 doi: 10.1038/s41467-018-03863-z

    CrossRef   Google Scholar

    [29] Shen H, Mazarei M, Hisano H, Escamilla-Trevino L, Fu C, et al. 2013. A genomics approach to deciphering lignin biosynthesis in switchgrass. The Plant Cell 25:4342−61 doi: 10.1105/tpc.113.118828

    CrossRef   Google Scholar

    [30] Seyfferth C, Wessels B, Jokipii-Lukkari S, Sundberg B, Delhomme N, et al. 2018. Ethylene-related gene expression networks in wood formation. Frontiers in Plant Science 9:272 doi: 10.3389/fpls.2018.00272

    CrossRef   Google Scholar

    [31] Immanen J, Nieminen K, Smolander OP, Kojima M, Alonso Serra J, et al. 2016. Cytokinin and auxin display distinct but interconnected distribution and signaling profiles to stimulate cambial activity. Current Biology 26:1990−7 doi: 10.1016/j.cub.2016.05.053

    CrossRef   Google Scholar

    [32] Bhalerao RP, Fischer U. 2014. Auxin gradients across wood - instructive or incidental? Physiologia Plantarum 151:43−51 doi: 10.1111/ppl.12134

    CrossRef   Google Scholar

    [33] Morreel K, Goeminne G, Storme V, Sterck L, Ralph J, et al. 2006. Genetical metabolomics of flavonoid biosynthesis in Populus: a case study. The Plant Journal 47:224−37 doi: 10.1111/j.1365-313X.2006.02786.x

    CrossRef   Google Scholar

    [34] Kuzmin E, VanderSluis B, Nguyen Ba AN, Wang W, Koch EN, et al. 2020. Exploring whole-genome duplicate gene retention with complex genetic interaction analysis. Science 368:eaaz5667 doi: 10.1126/science.aaz5667

    CrossRef   Google Scholar

    [35] Keane OM, Toft C, Carretero-Paulet L, Jones GW, Fares MA. 2014. Preservation of genetic and regulatory robustness in ancient gene duplicates of Saccharomyces cerevisiae. Genome Research 24:1830−41 doi: 10.1101/gr.176792.114

    CrossRef   Google Scholar

    [36] Włoch W, Wilczek A, Jura-Morawiec J, Kojs P, Iqbal M. 2013. Modelling for rearrangement of fusiform initials during radial growth of the vascular cambium in Pinus sylvestris L. Trees 27:879−93 doi: 10.1007/s00468-013-0842-8

    CrossRef   Google Scholar

    [37] Wilczek A, Jura-Morawiec J, Kojs P, Iqbal M, Włoch W. 2011. Correlation of intrusive growth of cambial initials to rearrangement of rays in the vascular cambium. IAWA J 32:313−31 doi: 10.1163/22941932-90000060

    CrossRef   Google Scholar

    [38] Oraby HF, Ramadan MF. 2015. Impact of suppressing the caffeic acidO-methyltransferase (COMT) gene on lignin, fiber, and seed oil composition in Brassica napus transgenic plants. European Food Research and Technology 240:931−8 doi: 10.1007/s00217-014-2397-3

    CrossRef   Google Scholar

    [39] Wang YJ, Sheng LP, Zhang HR, Du XP, An C, et al. 2017. CmMYB19 over-expression improves aphid tolerance in Chrysanthemum by promoting lignin synthesis. International Journal Of Molecular Sciences 18:619 doi: doi.org/10.3390/ijms18030619

    CrossRef   Google Scholar

    [40] Gui J, Lam PY, Tobimatsu Y, Sun J, Huang C, et al. 2020. Fibre-specific regulation of lignin biosynthesis improves biomass quality in Populus. New Phytologist 226:1074−87 doi: 10.1111/nph.16411

    CrossRef   Google Scholar

    [41] Peng X, Sun S, Wen J, Yin W, Sun R. 2014. Structural characterization of lignins from hydroxycinnamoyl transferase (HCT) down-regulated transgenic poplars. Fuel 134:485−92 doi: 10.1016/j.fuel.2014.05.069

    CrossRef   Google Scholar

    [42] Xia X, Tang Y, Wei M, Zhao D. 2018. Effect of paclobutrazol application on plant photosynthetic performance and leaf greenness of Herbaceous Peony. Horticulturae 4:5 doi: 10.3390/horticulturae4010005

    CrossRef   Google Scholar

    [43] Xie M, Zhang J, Tschaplinski TJ, Tuskan GA, Chen JG, et al. 2018. Regulation of lignin biosynthesis and its role in growth-defense tradeoffs. Frontiers in Plant Science 9:1427 doi: 10.3389/fpls.2018.01427

    CrossRef   Google Scholar

    [44] Ohtani M, Demura T. 2019. The quest for transcriptional hubs of lignin biosynthesis: beyond the NAC-MYB-gene regulatory network model. Current Opinion in Biotechnology 56:82−7 doi: 10.1016/j.copbio.2018.10.002

    CrossRef   Google Scholar

    [45] Zhang J, Gao G, Chen J, Taylor G, Cui K, et al. 2011. Molecular features of secondary vascular tissue regeneration after bark girdling in Populus. New Phytologist 192:869−84 doi: 10.1111/j.1469-8137.2011.03855.x

    CrossRef   Google Scholar

    [46] Pesquet E, Tuominen H. 2011. Ethylene stimulates tracheary element differentiation in Zinnia elegans cell cultures. New Phytologist 190:138−49 doi: 10.1111/j.1469-8137.2010.03600.x

    CrossRef   Google Scholar

    [47] Felten J, Vahala J, Love J, Gorzsás A, Rüggeberg M, et al. 2018. Ethylene signaling induces gelatinous layers with typical features of tension wood in hybrid aspen. New Phytologist 218:999−1014 doi: 10.1111/nph.15078

    CrossRef   Google Scholar

    [48] Harkey AF, Yoon GM, Seo DH, DeLong A, Muday GK. 2019. Light modulates ethylene synthesis, signaling, and downstream transcriptional networks to control plant development. Frontiers in Plant Science 10:1094 doi: 10.3389/fpls.2019.01094

    CrossRef   Google Scholar

    [49] Love J, Björklund S, Vahala J, Hertzberg M, Kangasjärvi J, et al. 2009. Ethylene is an endogenous stimulator of cell division in the cambial meristem of Populus. Proceedings Of The National Academy Of Sciences Of The United States Of America 106:5984−9 doi: 10.1073/pnas.0811660106

    CrossRef   Google Scholar

    [50] Andersson-Gunnerås S, Hellgren JM, Björklund S, Regan S, Moritz T, et al. 2003. Asymmetric expression of a poplar ACC oxidase controls ethylene production during gravitational induction of tension wood. The Plant Journal 34:339−49 doi: 10.1046/j.1365-313X.2003.01727.x

    CrossRef   Google Scholar

  • Cite this article

    Liu G, Li Y, Liu Y, Guo H, Guo J, et al. 2021. Genome-wide identification and analysis of monolignol biosynthesis genes in Salix matsudana Koidz and their relationship to accelerated growth. Forestry Research 1: 8 doi: 10.48130/FR-2021-0008
    Liu G, Li Y, Liu Y, Guo H, Guo J, et al. 2021. Genome-wide identification and analysis of monolignol biosynthesis genes in Salix matsudana Koidz and their relationship to accelerated growth. Forestry Research 1: 8 doi: 10.48130/FR-2021-0008

Figures(6)  /  Tables(2)

Article Metrics

Article views(5768) PDF downloads(851)

ARTICLE   Open Access    

Genome-wide identification and analysis of monolignol biosynthesis genes in Salix matsudana Koidz and their relationship to accelerated growth

Forestry Research  1 Article number: 8  (2021)  |  Cite this article

Abstract: Lignin plays an important role in plant growth and development. It serves as a raw material for the manufacture of paper, animal feed, and chemical fertilizers. However, the regulation of lignin biosynthesis genes and the composition of the relevant gene families remain unclear in many plant species. Here, we identified and characterized 11 families of monolignol biosynthesis genes in Salix matsudana Koidz. Based on phylogenetic analysis of lignin biosynthesis genes from nine angiosperm species (Arabidopsis thaliana, Oryza sativa, Zea mays, Solanum lycopersicum, S. suchowensis, S. purpurea, Populus euphratica, P. trichocarpa, and S. matsudana), the 11 gene families could be divided into two classes that differed in their apparent evolutionary history. We compared the distribution of lignin biosynthesis genes between the two sub-genomes (At and Bt) of S. matsudana and found that more duplicated genes were present in the Bt sub-genome. We analyzed RNA sequencing data from two parents of contrasting height and two of their F1 progeny, and detected 23 differentially expressed genes (DEGs) that may regulate accelerated growth. We analyzed the promoter regions of the lignin-related DEGs and identified several hormone-related (auxin, ethylene, and cytokinin) transcription factor binding sites. These results provide an important foundation for future studies on the molecular mechanisms and genetic regulation of lignin biosynthesis and its relationship to accelerated growth in forest trees.

    • Lignin is an aromatic phenolic compound found in the cell walls of vascular plants and accounts for 18–35% of total plant biomass[1]. Lignin is deposited in the walls and intermediate layers of some vascular plant cells, where it fills in the spaces of the cellulose microfibril framework. Lignin enhances the mechanical strength of plant cell walls, and its hydrophobicity enables the long-distance transport of water and nutrients in the xylem[2]. Lignin also serves as a defensive barrier against insect pests and can protect plants from bacterial infections[3]. It is used in the production of engineering plastics, thermoplastic elastomers, and polymer foams[4]. Lignin content is a critical factor that determines the application of specific woods. Wood with low lignin content is easier to degrade and is often used in the pulp and paper industries, whereas wood with high lignin content has stronger mechanical properties and is often used for construction and decoration[5].

      Lignin is polymerized from the three monolignols p-coumaryl alcohol, coniferyl alcohol and sinapyl alcohol, also called the H, G and S subunits. These aromatic alcohols are produced from phenylalanine and tyrosine via the phenylpropanoid pathway through a series of reactions that include deamination, hydroxylation, methylation, and reduction[1]. Enzymes involved in this process include 4-coumarate:CoA ligase (4CL), cinnamate-4-hydroxylase (C4H), phenylalanine ammonia lyase (PAL), caffeic acid 3-O-methyltransferase (COMT), hydroxycinnamoyl-CoA shikimate/quinate hydroxycinnamoyl transferase (HCT), cinnamate-3-hydroxylase (C3H), caffeoyl shikimate esterase (CSE), caffeoyl-CoA 3-O-methyltransferase (CCoAOMT), cinnamoyl-CoA reductase (CCR), cinnamyl alcohol dehydrogenase (CAD), and ferulate-5-hydroxylase (F5H). The resulting monolignols are polymerized by oxidases (peroxidases, POX) and laccases (LACs) to form lignins that contain different amounts of the three monolignols.

      Lignin content and composition differ among plant species. In general, herbaceous plant lignin contains G, S, and H subunits, hardwood lignin contains mainly G and S subunits, and coniferous wood lignin consists almost entirely of G subunits[6]. In angiosperms, lignin is composed mainly of G and S monolignols, along with much lower amounts of H subunits. Bonawitz and Chappie (2010) contend that S-type lignin provides greater flexibility and that this flexible polymer may be important for herbs that regrow their aboveground biomass each year[7].

      Key genes that encode essential enzymes for monolignol biosynthesis have been identified in a number of plant species[1,4,8]. As more plant genomes become available, genome-wide surveys enable a systematic characterization of key enzymes and their corresponding family members. Lignin plays a crucial role in the evolution of plant species from aquatic algae to terrestrial plants[9]. Additional research is needed to better understand the origin and biosynthesis of this important plant polymer[10]. Identification and functional characterization of lignin biosynthesis enzymes and their associated genes will provide a foundation for the systematic analysis of carbon flux through lignin metabolism. Such research can guide the genetic improvement of forestry species[11].

      Due to its strong tolerance to salt, heavy metal, and cold damage and its resistance to diseases and pests, Salix matsudana Koidz has a wide global distribution, especially in China. Willow produces large amounts of biomass, and it is easy to propagate and is rich in variety. It is widely used in commercial forestry, and its wood is an important raw material for paper, gunpowder, construction equipment, particleboard, and other industrial products. The release of the S. matsudana genome now enables the identification of key lignin biosynthesis genes and gene families. Although there have been a number of reports on monolignol biosynthesis genes in Populus[12], most studies have focused on one or two genes rather than considering whole gene families.

      The systematic identification and functional annotation of key monolignol biosynthesis genes in S. matsudana provide important background data for improving its lignin content and composition[13] to obtain fast-growing and high-quality wood. The results also provide insight into the evolution of the monolignol biosynthesis pathway at the molecular level.

    • The monolignol biosynthesis pathway (map00940) was retrieved from the KEGG pathway database (https://www.kegg.jp/kegg/pathway.html). Related genes from the reference genomes of A. thaliana, O. sativa, Z. mays, S. lycopersicum, P. euphratica, and P. trichocarpa were obtained from the Phytozome database (http://phytozome.jgi.doe.gov). Monolignol synthesis-related genes were identified in S. matsudana[13], S. suchowensis[14], and S. purpurea using genes from the six species above as queries in local BLAST searches (E-value < 1e−10) against the genomes of the three Salix species[15]. HMMER (http://www.hmmer.org) was used to determine whether the candidate genes contain the essential domains of each family[16]. The identified genes analyzed are listed in Supplemental Table S1.

    • Multiple protein sequences of the members in each gene family were aligned using ClustalW. A phylogenetic tree for each family was constructed using the Maximum-Likelihood method in MEGA X with the pairwise deletion option, the Poisson correction model[17], and 1000 bootstrap replicates. Different clades within each family are identified using different colors.

    • The chromosomal locations of all monolignol biosynthesis genes were obtained from the S. matsudana genome sequence database, and Mapchart 2.2 was used to generate a chromosomal location map[18]. Gene duplication events were defined based on two criteria: the alignment region covered more than 80% of the longer gene, and the identity of the aligned regions was greater than 80%. Regions that contained five or more lignin-related genes in less than 5 Mb were considered to be lignin biosynthesis hotspots[17].

    • We obtained cuttings from two S. matsudana parents with significantly different tree heights (TH) and diameters at breast height (tall and thick ‘9901’ and short and thin ‘Yanjiang’)[19] and from the tall and thick F1 progeny ‘FH’ and the short and thin F1 progeny ‘FS.’ The cuttings (10 cm in length and 1 cm thick) were placed into nutrient soil in three replications. Stem terminals (0–5 cm long) were collected from each replication after 4 months and used for RNA-seq to measure the expression levels of monolignol synthesis genes in the four genotypes. Total RNA was extracted from three biological replicates of each genotype using the Plant RNA Reagent kit (Tiangen, China) according to the manufacturer's instructions. RNA was quantified using a NanoDrop ND 2000 spectrophotometer (Thermo Fisher, USA) and stored at −80 °C prior to sequencing on the Illumina platform (Illumina, USA) by Majorbio (Shanghai, China). RSEM (http://deweylab.github.io/RSEM) was used to calculate gene expression level as the fragments per kilobase of transcript per million mapped reads (FPKM), thereby normalizing for transcript length and the total number of mapped reads[20]. qRT-PCR analysis of the candidate genes was conducted using the One-Step SYBR Primer Script Plus RT-PCR kit (Takara, China) according to the manufacturer’s instructions. The Actin gene was used as an internal control[15]. All primers are listed in Supplemental Table S2.

      Gene Ontology (GO) (http://www.geneontology.org) and KEGG pathway analyses were performed to assign potential functions to the assembled genes using Blast2GO[21]. The top three hits (P < 0.001) were used to provide an annotation for each target gene. Differentially expressed genes (DEGs) were defined based on two criteria using the DESeq2 package: (1) a two-fold difference in expression level (FPKM) between tall and short plants, and (2) a Benjamini–Hochberg FDR-adjusted P-value < 0.05[22]. The DEGs were visualized using heat maps constructed with the pheatmap2 function in R (version 4.0.1).

    • Cuttings of the taller ‘9901’ parent, the shorter ‘Yanjiang’ parent, the tall F1 progeny ‘FH,’ and the short F1 progeny ‘FS’ (10 cm in length and 1 cm thick) were placed into nutrient soil in three replications. Stem terminals (3–8 cm from the top) were collected from each replication after 4 months. The samples were dried to a constant weight at 65 °C, then crushed, filtered through a 300–500 μm sieve, and weighed (W, dry weight). We then used the dried tissue to perform acetylation reactions using a lignin content test kit (Solarbio, Beijing, China) according to the manufacturer's instructions. Finally, OD280 (A) was measured using a UV spectrophotometer. The total lignin content was calculated using the following formula: lignin content (mg/g) = 2.184 × ΔA/W.

    • For promoter analysis, 2000 bp of genomic DNA sequence upstream of the start codon (ATG) of each DEG was downloaded from the genome sequence database. Differentiating xylem–specific lignin biosynthesis genes were identified based on homology to P. trichocarpa differentiating xylem–specific genes in the AspWood database (http://aspwood.popgenie.org/aspwood-v3.0) using BLAST searches[23]. The PlantPAN 3.0 database (http://plantpan.itps.ncku.edu.tw)[24] and ExactSearch, a fast plant motif search tool (http://sys.bio.mtu.edu/motif)[25], were used to search for cis-acting regulatory elements in the putative promoter regions. A heatmap was drawn using the matrix2png interface (https://matrix2png.msl.ubc.ca)[26].

    • To compare monolignol biosynthesis genes in angiosperms, nine species (A. thaliana, O. sativa, Z. mays, S. lycopersicum, S. suchowensis, S. purpurea, P. euphratica, P. trichocarpa, and S. matsudana) were selected for phylogenetic analysis. These species could be divided into four groups: herbaceous monocots (O. sativa and Z. mays), herbaceous dicots (A. thaliana and S. lycopersicum), shrubs (S. suchowensis and S. purpurea), and arbors (P. euphratica, P. trichocarpa, and S. matsudana). The basic biosynthetic pathway of monolignol has been described previously[27,28]. Eleven gene families encoding enzymes of phenylpropanoid biosynthesis that participate in monolignol synthesis are as follows: PAL, C4H, 4CL, HCT, C3H, CSE, COMT, F5H, CCoAOMT, CCR, and CAD. These 11 gene families were used for phylogenetic analysis. Sequence searches at the Pfam database were used to confirm the presence of specific protein domains in the predicted proteins of these genes (Supplemental Table S3).

      The number of genes in each family differed markedly among the nine species (Table 1). The total number of monolignol biosynthesis genes in each species ranged from 44 to 192. A. thaliana had the lowest number, consistent with its small and simple genome. The total number of lignin-related genes in S. matsudana was nearly twice that of the Populus species or the shrub willows. This may be because S. matsudana is a tetraploid, and the other species are diploids. As expected, herbaceous plants contained fewer lignin biosynthesis genes than woody plants in the angiosperms. The total gene numbers in herbaceous monocots and herbaceous dicots were similar. It is not surprising that S. lycopersicum contained more lignin-related genes than Z. mays, O. sativa, and A. thaliana, as S. lycopersicum encompasses both herbaceous and woody varieties. However, the Populus species and the shrub willows contained similar numbers of genes, with S. purpurea containing more genes than the Populus species. These results indicate that woody plants contain substantially more lignin biosynthesis genes than herbaceous plants in the angiosperms, but there is little difference in lignin-related gene numbers between shrubs and arbors.

      Table 1.  Numbers of genes in 11 lignin biosynthesis gene families from nine plant species.

      SpeciesPALC4H4CLHCTC3HCSECOMTF5HCCoAOMTCCRCAD
      Zea mays947162312286
      Oryza sativa849515122126
      Arabidopsis thaliana418317124211
      Solanum lycopersicum10392355511329
      Salix suchowensis44101928843623
      Salix purpurea6411297128631026
      Populus euphratica54102227835428
      Populus trichocarpa531128281035733
      Salix matsudana99214841720891136
      Salix matsudana At2382226622320
      Salix matsudana Bt661121171246115

      We compared the average gene number in each family among herbaceous monocots, herbaceous dicots, shrubs, and arbors. To minimize the effect of tetraploidy in S. matsudana, we divided its genes into the At and Bt sub-genomes based on their homology to P. trichocarpa genes, as previously reported[13]. As expected, several families showed marked differences in gene number among the four plant groups (Fig. 1). The HCT, COMT, CAD, F5H, 4CL, and CSE families contained more members in woody plants than in herbaceous dicots. The COMT, HCT, CAD, F5H, 4CL, and CSE gene families may therefore have undergone expansion or contraction during the evolution of herbs and woody plants in angiosperms, whereas other families showed no such trends.

      Figure 1.  The average gene number in each lignin biosynthesis gene family in herbaceous monocots, herbaceous dicots, shrubs, and trees.

    • We performed phylogenetic analyses for each gene family to study the evolutionary relationships among lignin biosynthesis-related genes. Based on phylogenetic trees of sequences from herbaceous monocots, herbaceous dicots, shrubs, and arbors, the 11 gene families could be divided into two classes that differed in their apparent evolutionary histories (Fig. 2). Class I contained “differentiated gene families” and could be further divided into two sub-classes, Ia and Ib. Class Ia contained the PAL, C4H, and CCoAOMT families, and genes from monocots and dicots could be clearly distinguished. Genes in Class Ia appeared to have differentiated since the divergence of monocots and dicots. Class Ib contained the CCR and C3H gene families. Genes in this class were divided into three lineages belonging to the herbaceous monocots, the herbaceous dicots, and woody plants. These gene families appeared to have differentiated twice since the appearance of woody plants and monocots. Class II contained “expanded gene families,” as genes from the nine species could be detected in nearly all sub-groups of these gene families (Fig. 2). The HCT, 4CL, CSE, CAD, F5H, and COMT families belonged to this class. This prompted us to propose the following hypothesis: most duplication-induced new genes regulate lignin composition rather than lignin yield. As plants differentiated into herbaceous and woody plants, some lignin-related gene families may have expanded to produce a much more complex metabolic network.

      Figure 2.  Phylogenetic trees of lignin biosynthesis genes from A. thaliana, O. sativa, Z. mays, S. lycopersicum, S. suchowensis, S. purpurea, P. euphratica, P. trichocarpa, and S. matsudana. The trees were constructed using MEGA X with the neighbor-joining method and 1000 bootstrap replicates. Different colored lines indicate different groups of gene families.

      To further explore this hypothesis, we compared the functions of lignin biosynthesis genes in Classes I and II. Gene families in Classes Ia and Ib catalyze the conversion of phenylalanine to monolignol, whereas most gene families in Class II regulate the production of specific monolignol varieties such as hydroxy-coniferyl alcohol and sinapyl alcohol. These results indicate that the Class I gene families may contribute in ensuring the completion of the monolignol biosynthesis pathway. By contrast, the expanded gene families of Class II contribute to abundant lignin production. However, there are some exceptions: CCoAOMT in Class Ia and 4CL and CAD in Class II. A possible explanation is that other pressures may affect the evolution of these gene families. Given the low number of Class II genes in herbaceous plants, woody plants may produce more G-type and S-type lignin, consistent with a previous report that H-type lignin is more prevalent in herbaceous than woody plants in angiosperms[29].

      Two sub-classes could be defined within Class I. They differed in the position of the herbaceous dicot sequences, which were grouped with those from woody plants in Class Ia but grouped separately in Class Ib. It was clear that Class Ib further differentiated as dicots evolved. However, none of the 11 gene families could distinguish shrubs from arbors. This may be due to the lignin biosynthesis pathways in shrubs and arbors being similar.

    • We mapped all identified lignin-related genes onto the genome of S. matsudana. 166 of the 192 genes could be mapped onto 36 chromosomes, and none were located on chromosomes A07 and A13 (Fig. 3). Chromosomes B16 and A16 contained the largest number of lignin-related genes. The At and Bt sub-genomes contained 76 and 90 lignin-related genes, respectively. However, 26 lignin-related genes could not be mapped onto the 38 S. matsudana chromosomes. Most genes in the PAL, F5H, C4H, COMT, and CCoAOMT families were located in the Bt sub-genome rather than the At sub-genome, whereas the At sub-genome contained more CAD and CCR genes. Among the 11 gene families, the CAD and HCT families had the largest number of members. Most CAD and HCT genes were present as tandem duplicates: HCT duplicates were found on chromosomes A08, B08, and B18, and CAD duplicates were found on chromosomes A11 and B11. We also identified a COMT cluster on chromosome B16, which may explain the abundance of COMT genes in the Bt sub-genome. Most homologous chromosomes exhibited good collinearity, although a few genes did not have corresponding alleles on homologous chromosomes. For example, CCR and CSE genes were detected on chromosome A01 but not B01, whereas C4H and CCoAOMT genes were identified on chromosome B02 but not A02. 17 and 31 genes were unique to the At and Bt sub-genomes, respectively. Based on the phylogenetic analysis of S. suchowensis, S. purpurea, and S. matsudana, 18 of these 48 genes occurred as tandem duplicates, such as COMT on chromosome B16 and HCT on chromosome B18. However, due to a lack of genomic information about their diploid ancestors, we could not determine whether these genes appeared before or after the appearance of S. matsudana.

      Figure 3.  Distribution of lignin biosynthesis genes on S. matsudana chromosomes. The scale indicates megabases (Mb). Different gene families are indicated by different colors, and hotspots for lignin biosynthesis genes are marked in gray on the left of the chromosomes.

    • To further understand the relationship between lignin biosynthesis and tree height (TH). We performed RNA-seq on stem terminals (0–5 cm long) from two parents (‘Yanjiang’ and ‘9901’) and two F1 progeny (‘FS’ and ‘FH’) to quantify the expression of lignin-related genes during primary stem growth. The expression levels of all 192 lignin-related genes were normalized to FPKM values. 117 genes had an FPKM > 1.0 in at least one biological replicate, and these genes were used for subsequent analysis. 23 of the 117 genes (19.66%) were differentially expressed between tall and short plants.

      Fig. 4 presents the expression profiles of 11 lignin-related gene families in S. matsudana stem terminals, based on the reference lignin biosynthesis model of Arabidopsis and Populus (Fig. 4). Overall, higher levels of lignin-related gene expression were observed in short plants compared with tall plants of S. matsudana. We identified 5, 4, 2, 2, 4, 1, 1, and 4 DEGs in the 4CL, CAD, CCoAOMT, CCR, COMT, CSE, F5H, and HCT gene families, respectively. Four of the 23 DEGs were selected for qRT-PCR verification of differential gene expression, and the qRT-PCR results were consistent with the RNA-seq results (Supplemental Fig. S1). Four of the DEGs were expressed at a higher level in tall plants, and the other 19 DEGs were expressed at a higher level in short plants. With the exception of the HCT family, most gene families showed greater expression in short plants. We then measured the lignin content of the four genotypes. ‘9901’ and ‘FH’ contained less lignin than ‘Yanjiang’ and ‘FS’ (Table 2). The correlation between tree height and lignin content was r = −0.62 (P < 0.05). Based on the reference lignin biosynthesis model, we can infer that the expression levels of most lignin-related genes are negatively correlated with height (Fig. 5). By contrast, most HCT genes showed higher expression in the taller genotypes ‘9901’ and ‘FH’.

      Figure 4.  Expression profiles of 11 lignin biosynthesis gene family members in the two parents and two progenies. Red indicates a higher expression level, and blue indicates a lower expression level. Expression data are presented as FPKM values calculated in R.

      Table 2.  Tree height and lignin content of ‘Yanjiang’, ‘FS’, ‘9901’, and ‘FH’.

      Height (cm)Lignin content (mg/g)
      Yanjiang120.19 ± 7.671011.11 ± 205.52
      FS71.93 ± 5.13633.33 ± 50.41
      9901153.69 ± 2.47341.11 ± 15.48
      FH187.84 ± 3.24255.56 ± 38.90

      Figure 5.  Gene expression network for monolignol biosynthesis in the stem terminals of S. matsudana. Transcript abundance (log2[FPKM+1]) in ‘Yanjiang’, ‘FS’, ‘9901’, and ‘FH’ is indicated by a color gradient.

    • We identified cis-acting regulatory DNA elements within the promoter regions of all lignin-related genes using the PlantPAN database and ExactSearch. The gene promoters contained numerous DNA elements that are predicted to be bound by AP2, ARF, MADS-box, bZIP, NAC and MYB transcription factors (Fig. 6a). A total of 23 elements (motifs) were detected in the 11 families. As shown in Fig. 6a, the motifs showed little preferential distribution among different gene families. Genes from the same families often contained different motifs, and MYB, MADS-box, bZIP, AP2, and TCR-related motifs were detected on nearly all genes at a high frequency. The frequency of MYB- and AP2-related motifs showed substantial variation, and some motifs were not detected on a portion of the genes, such as FAR-and GRAS-related motifs. Interestingly, bZIP, MADS-box, and MYB related motifs were found in the upstream regions of nearly all 42 possible differentiating xylem–specific lignin genes, indicating that these motifs may participate in the regulation of lignin biosynthesis (Fig. 6b). This result is consistent with previous studies in which ethylene, cytokinin, and auxin-related transcription factors were reported to regulate lignin biosynthesis[3032].

      Figure 6.  Motif analysis of promoter regions of the lignin biosynthesis genes. (a) Motif analysis of promoter regions from all S. matsudana lignin biosynthesis genes. (b) Motif analysis of promoter regions from differentiating xylem–specific lignin biosynthesis genes.

    • Fast growth is a complex quantitative trait that is controlled by environmental factors such as light, temperature, water, and fertilizer, although the effect of genetic variation is much greater. Among the traits related to fast growth in forest trees, height and diameter growth at breast height (DBH) are the most important. Tree height is determined primarily by the vessels, which are mainly composed of lignin, cellulose and hemicellulose in the secondary cell walls. Although the lignin biosynthesis pathway has been previously described[27,28], differences in lignin biosynthesis between herbaceous and woody plants remains unclear. To further understand lignin biosynthesis in S. matsudana, we identified a total of 192 lignin biosynthesis genes in its genome and performed phylogenetic analyses using genes from nine angiosperm species, including herbaceous monocots, herbaceous dicots, shrubs, and arbors.

      In this study, 11 gene families putatively related to monolignol biosynthesis were identified. Some of these 11 families may also function in flavonoid metabolism. For example, PAL, C4H, CCoAOMT, and 4CL also regulate the biosynthesis of flavonoids and terpenoid quinones[33].

      We could divide the genes of the monolignol biosynthesis pathway into two categories (Fig. 5). Genes in the first category catalyzed reactions that produced monolignol from phenylalanine, and most genes in Classes Ia and Ib could be placed in this category. Genes in the second category were involved in the production of specific monolignol varieties. Their encoded enzymes do not directly catalyze the production of monolignol from phenylalanine. Our phylogenetic analysis indicated that gene duplication had occurred primarily in Class II, which also contained more genes in woody plants than in herbaceous plants. These results are consistent with our hypothesis that expansions have occurred primarily in the families that participate in the regulation of lignin composition. However, CCoAOMT in Class Ia and 4CL and CAD in Class II do not seem to conform to this hypothesis. This may reflect the fact that the evolution of these gene families is also influenced by other pressures.

      Gene duplication is a major mechanism for the generation of new genes. After gene duplication, some duplicated genes undergo neofunctionalization, whereas others maintain largely redundant functions[34]. Duplicated genes exhibit various degrees of functional diversification in plants[35]. With the appearance of monocotyledons and dicotyledons, genes in Class I underwent differentiation. With the appearance of woody plants, Class Ib was further differentiated. For Class II, almost all gene families duplicated to generate new genes. However, more experiments or mutants are needed to infer the evolutionary fate of these diversified duplicates.

      The remaining phylogenetic analysis showed that shrubs and arbors could not be clearly separated. The height and diameter of arbors are greater than those of shrubs, but their lignin biosynthesis genes are similar, although differences in the regulation of these genes may cause differences between the two growth forms.

    • Based on the sequenced genome of S. matsudana, 166 of the 192 identified lignin-related genes were located on 36 chromosomes. Gene numbers in the At and Bt sub-genomes were not equal. At and Bt contained 39.58% (76/192) and 46.88% (90/192) of all lignin genes, respectively. The At sub-genome was identified as homologous to the P. trichocarpa genome and contained fewer Class II genes than Bt. This result indicates that lignin components of S. matsudana and P. trichocarpa may differ. The At and Bt sub-genomes contained 8 and 11 DEGs, respectively. Several gene families were detected as duplicated on some chromosomes, such as HCT on chromosomes A08, B08, and B18, CAD on chromosomes A11 and B11, and COMT on chromosome B16. In summary, the Bt sub-genome contained more duplicated genes than the At sub-genome.

      In this study, we also identified seven hotspots for lignin biosynthesis genes on chromosomes A01, A08, A11, B08, B11, B16, and B18 (Fig. 3). Most of the hotspots were located on the ends of chromosomes. Hotspots on chromosomes A01, A08, and A16 contained no less than three families. A comparison of the two sub-genomes would provide evidence on the evolution of Salix from diploid to tetraploid.

    • Tree height is mainly determined by primary growth. In a region extending down about 60 mm from the top of the tree, the stem begins to thicken gradually, and the secondary growth of vascular tissues begins[36,37]. We therefore sampled stem terminals (0–5 cm) from two parents and two F1 progeny of contrasting heights and performed RNA sequencing to detect the expression of lignin biosynthesis genes during primary growth. Nearly 60% (117/192) of the lignin-related genes were expressed at levels > 1.0 FPKM in the stem terminals, and 23 of these genes were differentially expressed between genotypes of different heights, suggesting that they may affect tree height by regulating lignin biosynthesis. qRT-PCR was performed to confirm the differential expression of the DEGs. Most of these genes were downregulated to suppress the biosynthesis of lignin in plants with tall genotypes (Fig. 5). However, in contrast to the other gene families, most differentially expressed HCT genes showed higher expression levels in tall plants. HCT catalyzes the conversion of coumaroyl-CoA to coumaroyl shikimic acid/coumaroyl quinic acid and the conversion of caffeoyl shikimic acid/caffeoyl quinic acid to coumaroyl-CoA. G subunits and S subunits are major components of lignin in dicots. Based on the reference lignin biosynthesis model, we can infer that the high expression of COMT promotes a high S/G ratio, which is currently used as a measurement standard in the paper industry. By comparing the transcriptomes of tall and short S. matsudana, we identified 23 DEGs involved in lignin biosynthesis. These genes may suppress plant height by promoting high lignin levels and regulating the S/G ratio. The high expression levels of the DEGs may also alter lignin content. In the paper industry, lower lignin contents and higher S/G ratios facilitate paper production[38]. However, further experiments are needed to determine whether the S/G ratio is associated with tree height.

      Lignin is synthesized in the secondary cell walls of vessel cells and fiber cells and provides transporting function and mechanical support, respectively[5]. Previous studies have shown that the suppression of lignin biosynthesis in vessels can reduce plant growth, whereas the suppression of lignin biosynthesis in xylem fibers can improve biomass production without affecting plant growth[3942]. Previous studies have reported that several transcription factors and hormones can regulate lignin biosynthesis[30,43], and we therefore analyzed cis-elements in the 23 DEG promoters to better understand their regulation.

      Previous studies found that the NAC-MYB module could regulate the expression of lignin biosynthesis genes[44]. In this study, we found that cis motifs were not distributed equally in genes from the same family, indicating that the regulation of these genes may also differ. Several hormone-related (auxin, ethylene, and cytokinin) transcription factor binding sites were identified in the promoter regions of the 23 DEGs. Previous studies have shown that trees have a radial gradient of auxin concentration within their secondary vascular tissues, with a maximum concentration in the cambium[32]. Unlike auxin, the distribution of cytokinins within secondary vascular tissues shows a peak in the developing phloem, although the two patterns partially overlap[31]. Tissue-specific transcriptome analysis in poplar showed that several genes related to auxin and cytokinin signaling and transport were differentially expressed during the regeneration of secondary vascular tissues, indicating that hormone distributions change during this process[45].

      A series of studies have shown that ethylene also participates in the growth and differentiation of cambial cells and affects the secondary growth of stems[4648]. For example, in poplar, many ethylene biosynthesis enzymes and regulatory genes are expressed in woody tissues[30]. It has also been demonstrated that gravity can regulate ethylene content and cell division in cambium meristems, thereby affecting the formation of poplar tension wood[49,50].

    • We identified monolignol biosynthesis genes in the genome of S. matsudana. By comparing lignin-related gene families in nine plant species, we identified two patterns of gene family evolution in herbaceous monocots, herbaceous dicots, shrubs, and arbors. RNA-seq analysis from two parents and two F1 progeny enabled us to identify 23 DEGs that may participate in the regulation of accelerated growth. We also analyzed the distribution of lignin biosynthesis genes within the At and Bt genomes of S. matsudana. These results provide an important foundation for further studies on the molecular mechanisms and genetic regulation of lignin biosynthesis and the accelerated growth of forest trees.

    • This research was supported by grants from the National Natural Science Foundation of China (31971681), the Natural Science Foundation of Jiangsu Province (BK20200963), the Nantong University Scientific Research Start-up Project for Introducing Talents (135419609070), and the Jiangsu Provincial Key Projects of Students Innovation and Entrepreneurship Training Program (2020010304020Z).

      • The authors declare that they have no conflict of interest.
      • Copyright: © 2021 by the author(s). Exclusive Licensee Maximum Academic Press, Fayetteville, GA. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.
    Figure (6)  Table (2) References (50)
  • About this article
    Cite this article
    Liu G, Li Y, Liu Y, Guo H, Guo J, et al. 2021. Genome-wide identification and analysis of monolignol biosynthesis genes in Salix matsudana Koidz and their relationship to accelerated growth. Forestry Research 1: 8 doi: 10.48130/FR-2021-0008
    Liu G, Li Y, Liu Y, Guo H, Guo J, et al. 2021. Genome-wide identification and analysis of monolignol biosynthesis genes in Salix matsudana Koidz and their relationship to accelerated growth. Forestry Research 1: 8 doi: 10.48130/FR-2021-0008

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return