Integration analysis of full-length transcriptomics and metabolomics provides new insights into the mechanism of sex differentiation in buffalograss (<i>Buchloe dactyloides</i>)

Jin Guan; Yuesen Yue; Shuxia Yin; Wenjun Teng; Hui Zhang; Haifeng Wen; Juying Wu; Ke Teng; Xifeng Fan; Jin Guan; Yuesen Yue; Shuxia Yin; Wenjun Teng; Hui Zhang; Haifeng Wen; Juying Wu; Ke Teng; Xifeng Fan

doi:10.48130/GR-2023-0024

The study of sexual and evolutionary differences has long been imperative in the field of biology. Unlike animals, dioecious angiosperms account for only about 6% of the total. Buffalograss (Buchloe dactyloides) plays a vital role in environmental restoration, creating economic benefits and promoting the high-quality development of the grassland and turf industries. Its natural populations contain differing ratios of male, female, and monoecious plants. The value of buffalograss for studying the sex differentiation mechanism in plants cannot be ignored. However, few studies have investigated transcript annotation and complete mRNA structure in B. dactyloides, and the pathways of species-specific factors in sex differentiation remain unknown. We integrated the full-length transcriptome, second-generation transcriptome, and metabolome to specify candidate factors influencing sex differentiation. We identified 110,870 full-length transcripts and obtained 100,362 (90.52%) transcript and annotation information. Then we identified 49,448 differentially expressed genes (DEGs) and 3,070 differentially accumulated metabolites (DAMs) in female, male, and monoecious leaf samples. The co-enrichment analysis indicated that sexual differentiation was regulated by glutathione metabolism, photosynthesis, plant hormone biosynthesis, catabolism, and signaling. The identification of DEGs and DAMs that participate in glutathione metabolism, photosynthesis, abscisic acid (ABA), cytokinin (CTK), and gibberellin (GA) biosynthesis, catabolism, and signaling has helped illuminate the roles of plant hormones in the sex differentiation of B. dactyloides. The full-length transcriptomic data will facilitate additional studies on functional genes. Integration of transcriptomic and metabolomic data advances knowledge of the molecular mechanism of sex differentiation and provides information for B. dactyloides breeding programs.

HTML

Introduction

Unlike animals, dioecious angiosperms account for only about 6% of the total.^[1]. In many cases, dioecious plants provide better tools for understanding the genetics of sex determination than dioecious animals^[2]. Buffalograss (Buchloe dactyloides (Nutt.) Engelm.) is a perennial, stoloniferous, low herbaceous, and low-input ecological grass^[3]. Buffalograss is a dioecious plant, and a few are hermaphroditic. Buffalograss comprises polyploid polymorphisms (x = n = 10), including diploid, tetraploid, pentaploid, and hexaploid individuals^[4]. The turf quality of the female plant in B. dactyloides is better than that of the male as when the male plant enters the flowering stage, there is a distinct yellow color due to the flower shaft being higher than the bush, and the female plant does not have such a problem^[5]. It was reported that there are no sex-determining heteromorphic chromosomes, and sex determination is linked to gene expression and gibberellin-induced morphological alterations in buffalograss^[4]. Studies on B. dactyloides have focused on variety breeding and improvement^[6,7], physiological differentiation in response to biotic and abiotic stress, and phenotypic identification. However, the transcriptional information of B. dactyloides, especially in complete mRNA sequences, alternative splicing (AS) events, and long non-encoding RNAs (lncRNAs), is still largely unknown. Moreover, the large number of unannotated transcripts limits the research on the mechanisms of sex differentiation of B. dactyloides.

Plants have complex transcriptional processes; transcriptome research is critical to understanding plant life processes^[8]. The RNA-seq technology, based on Illumina sequencing platforms, has significantly higher sequencing depth and a reduced error rate, resulting in substantially higher genome coverage^[9], allowing us to better comprehend critical gene expression levels and regulatory mechanisms^[10]. However, RNA-seq data cannot systematically and accurately collect or assemble complete transcripts or identify transcripts of isoform, homologous, superfamily, or allele expression, making it insufficient to understand the transcriptome and underlying biological processes. Instead of interrupting RNA fragments, full-length transcriptomes employing PacBio single-molecule real-time (SMRT) sequencing platforms can directly reverse full-length cDNA. Ultra-long reads (median 10 kb) from the platform contain a single complete transcript sequence^[11,12]. Assembly is not required for post-analysis, and what is measured is^[13]. Moreover, the response to metabolites is more specific than the general response seen at the transcript level^[14]. At present, comprehensive investigations of alterations in metabolite levels remain rare. It is particularly true for the joint and parallel analysis of sex differentiation at two levels of genome information processing: the transcriptomic and the metabolomic. However, the integrative analysis of metabolites and transcripts helps identify genes and small molecules involved in critical biological processes^[15,16]. We completed the full-length transcriptome sequencing analysis using the transcriptome analysis methods. We explored the complete mRNA structure features of B. dactyloides and completed the identified transcript function annotation. We further investigated the underlying transcriptional mechanism of B. dactyloides sex differentiation by integrating Illumina RNA-seq data and metabolomic data. This work will effectively provide helpful information about the transcriptome of B. dactyloides. We expect it will offer essential resources for further research on sex differentiation, functional gene identification, and variety breeding studies in B. dactyloides.

Materials and methods

Plant materials and RNA sample preparation

Buchloe dactyloides (cv. Texoka) were vegetatively propagated in an intelligent greenhouse at the Beijing Academy of Agricultural and Forestry Sciences (Beijing, China). In April 2022, we harvested female, male, and monoecious plants' roots, stolon, flowers, and leaves as materials. All samples were frozen in liquid nitrogen and stored at −80 °C for further investigations. To identify as many transcripts as possible, we extracted the total RNA from all organ materials of buffalograss for SMRT sequencing. However, in practical application, we completed sex identification before blooms. We selected leaves of female, male, and monoecious plants for Illumina and metabolite sequencing. First, we extracted total RNA using the plant RNA kit (Omega, GA, USA). Second, the purity, concentration, and integrity of total RNA were assessed using ND-1000 spectrophotometer (NanoDrop Technologies, DE, USA) and 2100 bioanalyzer (Agilent Technologies, CA, USA).

Full-length library construction and PacBio SMRT sequencing
We mixed RNA from female, male, and monoecious plants' roots, stolons, flowers, and leaves into one sample. Following the qualification of the sample, a full-length library was constructed with no size selection. After the full-length library passed quality control, we performed full-length transcriptome sequencing based on the PB PacBio Sequel II platform. Then full-length transcriptome sequencing data were analyzed from raw reads in three steps^[17]. First, we extracted and polished circular consensus (CCS) sequences using full passes ≥ three and sequence accuracy > 0.9. Next, we classified the full-length transcripts by detecting whether 5' and 3' cDNA primers and poly(A) tails in CCS sequences. Finally, we classified full-length non-chimeric (FLNC) transcripts to recognize consensus isoforms, high-quality isoforms (accuracy > 0.99), and low-quality isoforms using the SMRTLink software. We used CD-HIT v4.6.1 software^[18] to merge highly similar sequences and remove redundancy. The low-quality consensus isoforms were polished based on the corresponding Illumina sequencing data using proovread software^[19].

Full-length transcriptome structure analysis
We conducted the structure analysis with non-redundant FLNC transcripts identified by the full-length transcriptome, including CDS prediction, AS events, and SSR identification and analysis. We used TransDecoder (v5.0.0) software^[20] to predict the reliable CDS and their corresponding protein sequences of the transcript. The AS events prediction method referred to the report following the three alignments^[21]. We screened transcripts with a length > 1,000 bp and completed SSR analysis utilizing MISA software^[22].

Identification of regulatory proteins, lncRNAs and lncRNA target
We used iTAK software^[23] with default parameters to predict regulatory proteins, including transcription factors (TFs), transcriptional regulators (TRs), and protein kinases (PKs) in B. dactyloides. We filtered out the putative protein-coding RNAs before lncRNAs prediction. We used CPC^[24], CNCI^[25], CPAT^[26], and Pfam^[27], to screen > 200 bp transcripts in length that contain two or more exons. The intersection of the four sets was applied in the following lncRNA analysis. In addition, RNA targets of lncRNAs were predicted using LncTar software^[28].

Function annotation of transcripts
To acquire function annotation information, we compared the non-redundant transcripts to non-redundant protein sequences (NR), Swissprot^[29], Gene Ontology (GO)^[30], Clusters of Orthologous proteins Groups (COG)^[31], Eukaryotic Ortholog Groups (KOG)^[32], Pfam, Kyoto Encyclopedia of Genes and Genomes (KEGG)^[33], and eggNOG^[34] databases utilizing DIAMOND software. Moreover, the protein sequences of transcripts were blasted (p-value < 1E-5) against the TrEMBL database based on the Swissprot database to obtain the annotation information.

Illumina cDNA library construction and sequencing
Nine cDNA libraries (three biological replicates for female, male, and monoecious plants) were constructed for Illumina sequencing. After verifying the quality of nine libraries, we used the Illumina NovaSeq6000 platform (San Diego, CA, USA) to perform Illumina sequencing. The specific sequencing data quality controls were: cut the sequencing adapter and the primer sequence in reads; filtrate the low-quality reads to ensure the read quality. Finally, we obtained clean data that is sufficient for subsequent accurate analysis. We further calculated GC content, Q30, and sequence duplication levels of clean data.

Quantification of transcript expression and analysis of differential expression transcripts
We used non-redundant transcript sequences identified from PacBio SMRT as a reference and applied NGS reads to them to acquire transcript position information using STAR^[35] software. The method of quantification of transcript expression levels was as follows: first, we used Kallisto^[36] software to compare NGS reads with transcripts generated from PacBio SMRT and count directly; then, we calculated transcript expression levels using fragments per kilobase of transcript per million fragments mapped (FPKM). To evaluate the reproducibility of biological replicates, we set the Pearson Correlation Coefficient (R²) as the correlation index of samples.

Then we used DESeq2^[37] software to identify differential expressions with replicates. We detected DEGs using the threshold: fold change (FC) of not less than two and a false discovery rate (FDR) of less than 0.01. We adjusted the p-value using the Benjamini-Hochberg method. Volcano Plot assessed the differences in transcript expression between two samples and their corresponding significance. Moreover, GO and KEGG functional annotation and enrichment analysis on DEGs we performed.

qRT-PCR analysis
DEGs involved in abscisic acid (ABA) metabolic and signaling pathways (BdABA1, BdAOG, and BdSAPK8), gibberellin (GA) metabolic and signaling pathways (BdKAO, BdGID1, and BdDWARF8), cytokinin (CTK) related pathways gene (BdORR9), photosynthesis-related pathways (BdPsbP which encodes PSII reaction center subunit and PetC which encodes cytochrome b6/f complex subunits), and one bHLH transcription factor bHLH137 related to sex differentiation were subjected to qRT-PCR analysis. We reverse-transcribe RNA to cDNA using PrimeScriptTM Reverse Transcriptase (TaKaRa, Dalian, China). The primers used in the study were designed using Primer Premier 5 and listed in Supplemental Table S1. qRT-PCR analysis was performed on a Bio-Rad CFX Connect™ RealTime System using SYBR® Premix Ex Taq™ II (TaKaRa, Dalian, China). Three biological replicates were used for all gene expression analyses. We calculated the relative expression level of genes using the 2^−ΔΔCᴛ method.

Metabolites extraction and LC-MS/MS analysis
Three experimental groups (male, female, and monoecious plants) contained six samples per group harvested for sequencing and separate analyses. We performed metabolomics experiments using the Waters UPLC Acquity Ⅰ-Class PLUS and UPLC Xevo G2-XS QTOF. Then we analyzed samples using the Waters Acquity UPLC HSS T3 column (1.8 um, 2.1 mm × 100 mm) in positive and negative codes^[38]. The mobile phases were 0.1% formic acid aqueous solution (A) and 0.1% formic acid acetonitrile (B), and the injection volume was 1.0 μL. We acquired primary and secondary mass spectrometry data in MSe mode using MassLynx V4.2 software (Waters). Dual-channel data acquisition, including low and high collision energies, can be simultaneously accomplished during each data acquisition cycle. The low and high collision energies were 2 V and 10–40 V, respectively, and the scanning frequency was 0.2 s for a mass spectrum. The parameters of the ESI ion source were according to a previously described report^[38]. We processed raw data for peak extraction and alignment and completed identification based on the online METLIN and Biomark's self-built databases using Progenesis QI software. In addition, we identified the theoretical fragment; the mass deviation of the parent ion and fragment ion is less than 100 ppm and 50 ppm, respectively^[38].

Quantification of metabolites and analysis of differential metabolites
We used the R package (ropls) to perform orthogonal projections to latent structures-discriminant analysis (OPLS-DA) modeling between groups, and we completed 200 permutation tests to ensure the reliability of the model^[39]. We determined the model's variable importance in projection (VIP) value using multiple cross-validations. We compared the difference multiples based on the grouping and used the T-test to calculate the difference significance p-value of metabolites. We further identified DAMs using the threshold: p-value less than 0.05 and VIP value greater than one. The hypergeometric distribution test was used to calculate the DAMs of KEGG pathway enrichment significance^[40].

Data preprocessing and annotation, and correlation analysis between DEGs and DAMs
After normalizing the original peak area information with the total peak area, we evaluate the overall quality of the data. We evaluated the overall data quality using principal component analysis (PCA) and Spearman correlation analysis. We searched the classification and pathway information of identified compounds utilizing the KEGG^[33] database.

To calculate the correlation between all DEGs and DAMs, we preprocessed the data using the z-value transformation method and screened according to the correlation coefficient (CC) and p-value of correlation. The screening threshold was |CC| greater than 0.80 and CCP less than 0.05. We mapped DEGs and DAMs simultaneously to related pathways using MapMan software^[41].

Discussion

Buffalograss is a perennial warm-season grass species of Gramineae used commonly as pasture, ecological, or lawn grass^[42]. Buffalograss plays a vital role in environmental restoration, creating economic benefits and promoting the high-quality development of the grassland and turf industries^[3]. Buffalograss is a dioecious plant, and a few are monoecious. The visual and functional quality of the female plant in B. dactyloides is better than the male plant because when the male plant enters the flowering stage, there is a distinct yellow color due to the flower shaft being higher than the bush, and the female plant does not have such a problem^[5]. There are both dioecious and monoecious plants in buffalograss, providing great convenience for studying sex differentiation. The identification of sex differentiation candidate genes (e.g., male sterility) is an essential step to proclaim the practical value of B. dactyloides and may provide necessary information for sex differentiation-related pathway elucidation.

The missing genome sequence and full-length cDNAs of B. dactyloides limit our research. This study first provided the full-length transcriptome of B. dactyloides and completed 90.52% of transcript function annotation using PacBio SMRT technology. Recent studies have shown that long non-coding RNAs (lncRNAs) play critical functions in various biological processes^[43,44]. lncRNAs could interact with proteins, RNAs, and DNA to perform complex and diverse functions^[28]. Therefore, identifying lncRNAs and their targets helps understand the biological processes and mechanisms of plants. We predicted 7,682 lncRNAs and target genes for 6474 lncRNAs. AS allows the plant to enhance gene coding potential, regulate gene expression via different mechanisms, and play essential functions in plant development^[45,46]. We identified 4,091 AS events involved in 6,188 transcripts. Interestingly, 32 transcripts in AS events participate in the regulation of photoperiodism, flowering with a p-value of 1.08 × 10⁻⁸.

The correlation analysis of transcriptomic and metabolomic data better explains transcription regulation mechanisms in metabolic pathways^[47,48]. To characterize sex differentiation in the buffalograss transcriptome and its regulation, we measured and analyzed the transcriptional and metabolomic responses of B. dactyloides to sex differentiation. The most important pathways responding to sex differentiation were glutathione metabolism, photosynthesis, and plant hormone metabolism in B. dactyloides.

The study demonstrated that glutathione metabolism was the most significantly enriched pathway in response to sex differentiation. One hundred and fifty four genes and two metabolites (glutamate and GSSG) related to glutathione metabolism had altered contents. GCL, GSS, GSR, DHAR, and GST participate in glutathione metabolism to generate glutamate, cysteine, and glycine, which are involved in several critical cellular functions, such as scavenging H₂O₂ and protecting cells from oxidative stress^[47,49]. Based on our data, we can infer that glutathione metabolism was in response to sexual differentiation.

Notably, most DEGs were upregulated in male and female plants relative to monoecious plants, including those encoding PSⅡ proteins (such as PsbB and PsbO), cytochrome b6/F complex (such as PetA and PetC), electron transport proteins (such as PetE and PetJ), PSⅠ proteins (such as PsbB and PsbO), and ATP production proteins (gamma and delta). These genes govern light harvesting, electron transport, reduction-oxidation reactions, and ATP production^[41,50]. The increased ATP and Pi levels aligned with this result. The above results supported the positive functions of maintaining the photosynthetic activity of leaves in male and female plants.

Three hormones (ABA, GA, and CTKs) varied in metabolite content among the sexes. NCEDs are a crucial enzyme in ABA biosynthesis^[51]. We found the disruption of multiple NCEDs and the lowest abscisic alcohol and ABA levels in monoecious plants relative to female and male plants. Interestingly, we found the expression profile between NCEDs and ABA2/AAO3 in female, male, and monoecious plants was opposite, which suggests NCEDs, ABA2, and AAO3 were candidate genes involved in ABA homeostasis. ABA-GE, a stored form of ABA^[51], consistently performs with abscisic alcohol and ABA in response to sexual differentiation. The lowest ABA content and disruption of multiple AOGs and CYP707A were found in monoecious plants compared to female and male plants, which suggests that reduced ABA leads to reduced ABA catabolism. In addition, we found that ABA signal transduction was more active in female and male plants than in monoecious plants. The above results showed that ABA biosynthesis, catabolism, and signaling synergize in response to sexual differentiation.

The active GAs (mainly GA4) are essential for stamen development and control of pistil growth in Arabidopsis^[52]. The GA biosynthesis essential genes, KAO and GA20ox, are more active in male plants than female plants (adjusted p-value < 0.01). Furthermore, GA4 signal transduction was most active in female plants, followed by male plants, and inhibited in male plants. The above results showed that GA biosynthesis is inconsistent with GA signaling in response to sexual differentiation.

Compared with male and female plants, we found the overexpression of all three IPTs, the disruption of multiple CKXs, and the highest tZ and iP levels in monoecious plants. The results were in line with the previous report that tZ and iP levels are regulated and fine-tuned by IPTs and CKXs^[53]. In addition, tZ deactivation was associated with the up-regulation of CTK modification enzymes, UGT73C and UGT85A1, and the results corresponded with the previous report^[51]. The cytokinin receptor CRE1 and critical regulators (AHPs and B-ARRs) function in CTK perception and signaling^[53]. Surprisingly, there was little difference in the number of DEGs encoding the proteins CRE1, AHPs, and B-ARRs in female, male, and monoecious plants, which is likely due to compensatory changes in the expression of CTK signaling genes to maintain the appropriate level of CTK function^[53]. A-ARRs are considered CTK signaling negative regulators^[51,53]. Compared with female and monoecious plants, we found disruption of multiple A-ARRs and the lowest tZ levels in male plants. The results suggest that reduced tZ leads to reduced CTK signaling^[51]. The results above suggest that active CTKs and genes encoding IPTs, CKXs, UGT73C, UGT85A1, and A-ARRs involved in CTK metabolism and signaling play critical roles in response to sex differentiation.

Conclusions

Few studies have investigated transcript annotation and complete mRNA structure in B. dactyloides, and the pathways of species-specific factors in sex differentiation remain unclear. We performed full-length transcriptome, second-generation transcriptome, and metabolome analysis to specify candidate factors influencing sex differentiation. We first provided the full-length transcriptome of B. dactyloides, identified 110,870 full-length transcripts, and obtained 90.52% of transcript annotation information. Then we identified 49,448 DEGs and 3,070 DAMs in female, male, and monoecious leaf samples. The co-enrichment analysis indicated that sexual differentiation was regulated by glutathione metabolism, photosynthesis, plant hormone biosynthesis, catabolism, and signaling. The identification of DEGs and DAMs that participate in glutathione metabolism, photosynthesis, ABA, CTK, and GA biosynthesis, catabolism, and signaling has helped illuminate the roles of plant hormones in the sex differentiation of B. dactyloides. The full-length transcriptomic data will allow further functional gene studies. Integrating transcriptomic and metabolomic data advances knowledge of the molecular mechanism of sex differentiation and provides information on breeding programs for B. dactyloides.

Author contributions

The authors confirm contribution to the paper as follows: methodology, investigation, writing-original draft preparation, writing-reviewing and editing: Guan J; investigation, supervision, validation: Muye Liu; investigation: Yue Y; investigation, formal analysis, visualization: Yin S; investigation, validation: Teng W; investigation, validation: Zhang H, Wen H; resources, investigation, validation: Wu J; conceptualization, resources, writing-reviewing and editing, funding acquisition: Teng K; conceptualization, supervision, writing - review & editing: Fan X. All authors reviewed the results and approved the final version of the manuscript.

Supplemental Table S1 Primers used for qRT-PCR validation.
Supplemental Table S2 Prediction of Simple sequence repeats (SSRs) out of transcript datasets.
Supplemental Table S3 List of targets of lncRNA in B. dactyloides.
Supplemental Table S4 Summary of AS events in B. dactyloides.
Supplemental Table S5 Illumaina Sequencing Data Assessment.
Supplemental Table S6 Differentially expressed genes identified between different sample groups.
Supplemental Table S7 Differentially accumulated metabolites identified between different sample groups.
Supplemental Fig. S1 Summary of SMRT sequencing. (A) CCS read length distribution. (B) FLNC sequences read length distribution. (C) Consensus isoforms read length distribution.
Supplemental Fig. S2 The quality evaluation of transcriptomic and metabolomic data. PCA analysis of all transcripts (A) and metabolites (B), respectively. Heatmap analysis of correlation between samples in all transcripts (C) and metabolites (D), respectively.
Supplemental Fig. S3 KEGG enrichment of the up-regulated and down-regulated DEGs identified in Male vs Female (A), Monoecious vs Female (B), and Monoecious vs Male (C), respectively.
Supplemental Fig. S4 Prediction of regulatory proteins. A Number and family of top ten regulatory proteins predicted by SMRT sequencing data. B Venn diagram of differentially expressed regulatory proteins. C Number and family of top ten differentially expressed regulatory proteins. D-F The differentially expressed regulatory proteins predicted in Male vs Female, Monoecious vs Female, and Monoecious vs Male, respectively.
Supplemental Fig. S5 GO enrichment of sex-specific genes. (A) Genes present only in female plants. (B) Genes absent in female plants. (C) Genes present only in male plants. (D) Genes absent in male plants. (E) Genes present only in monoecious plants. (F) Genes absent in monoecious plants.
Supplemental Fig. S6 KEGG enrichment of sex-specific genes. (A) Genes present only in female plants. (B) Genes absent in female plants. (C) Genes present only in male plants. (D) Genes absent in male plants. (E) Genes present only in monoecious plants. (F) Genes absent in monoecious plants.
Supplemental Fig. S7 KEGG enrichment analysis of DAMs in the positive (left) and negative (right) ion mode in the comparison of Male vs Female (A), Monoecious vs Female (B), and Monoecious vs Male (C) sample groups.

[1]	Liao Q, Du R, Gou J, Guo L, Shen H, et al. 2020. The genomic architecture of the sex-determining region and sex-related metabolic variation in Ginkgo biloba. The Plant Journal 104:1399−409 doi: 10.1111/tpj.15009 CrossRef Google Scholar
[2]	Massonnet M, Cochetel N, Minio A, Vondras AM, Lin J, et al. 2020. The genetic basis of sex determination in grapes. Nature communications 11:2902 doi: 10.1038/s41467-020-16700-z CrossRef Google Scholar
[3]	Marshall C, Warnke S, Amundsen K. 2022. Simple sequence repeat marker development and diversity analysis in buffalograss. Crop Science 62:1373−82 doi: 10.1002/csc2.20725 CrossRef Google Scholar
[4]	Johnson PG, Kenworthy KE, Auld DL, Riordan TP. 2001. Distribution of buffalograss polyploid variation in the southern Great Plains. Crop Science 41:909−13 doi: 10.2135/cropsci2001.413909x CrossRef Google Scholar
[5]	Johnson PG, Riordan TP, Johnson-Cicalese JJ. 2000. Low-mowing tolerance in buffalograss. Crop Science 40:1339−43 doi: 10.2135/cropsci2000.4051339x CrossRef Google Scholar
[6]	Budak H, Shearman RC, Parmaksiz I, Gaussoin RE, Riordan TP, et al. 2004. Molecular characterization of buffalograss germplasm using sequence-related amplified polymorphism markers. Theoretical and Applied Genetics 108:328−34 doi: 10.1007/s00122-003-1428-4 CrossRef Google Scholar
[7]	Johnson PG, Riordan TP, Arumuganathan K. 1998. Ploidy level determinations in buffalograss clones and populations. Crop Science 38:478−82 doi: 10.2135/cropsci1998.0011183X003800020034x CrossRef Google Scholar
[8]	Stark R, Grzelak M, Hadfield J. 2019. RNA sequencing: the teenage years. Nature Reviews Genetics 20:631−56 doi: 10.1038/s41576-019-0150-2 CrossRef Google Scholar
[9]	Yan H, Sun M, Zhang Z, Jin Y, Zhang A, et al. 2023. Pangenomic analysis identifies structural variation associated with heat tolerance in pearl millet. Nature Genetics 55:507−18 doi: 10.1038/s41588-023-01302-4 CrossRef Google Scholar
[10]	Guan J, Yin S, Yue Y, Liu L, Guo Y, et al. 2022. Single-molecule long-read sequencing analysis improves genome annotation and sheds new light on the transcripts and splice isoforms of Zoysia japonica. BMC Plant Biology 22:263 doi: 10.1186/s12870-022-03640-7 CrossRef Google Scholar
[11]	Teng K, Teng W, Wen H, Yue Y, Guo W, et al. 2019. PacBio single-molecule long-read sequencing shed new light on the complexity of the Carex breviculmis transcriptome. BMC Genomics 20:789 doi: 10.1186/s12864-019-6163-6 CrossRef Google Scholar
[12]	Liu L, Teng K, Fan X, Han C, Zhang H, et al. 2022. Combination analysis of single-molecule long-read and Illumina sequencing provides insights into the anthocyanin accumulation mechanism in an ornamental grass, Pennisetum setaceum cv. Rubrum. Plant Molecular Biology 109:159−75 doi: 10.1007/s11103-022-01264-x CrossRef Google Scholar
[13]	Wenger AM, Peluso P, Rowell WJ, Chang PC, Hall RJ, et al. 2019. Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome. Nature Biotechnology 37:1155−62 doi: 10.1038/s41587-019-0217-9 CrossRef Google Scholar
[14]	Jozefczuk S, Klie S, Catchpole G, Szymanski J, Cuadros-Inostroza A, et al. 2010. Metabolomic and transcriptomic stress response of Escherichia coli. Molecular Systems Biology 6:364 doi: 10.1038/msb.2010.18 CrossRef Google Scholar
[15]	Bradley PH, Brauer MJ, Rabinowitz JD, Troyanskaya OG. 2009. Coordinated concentration changes of transcripts and metabolites in Saccharomyces cerevisiae. PLoS Computational Biology 5:e1000270 doi: 10.1371/journal.pcbi.1000270 CrossRef Google Scholar
[16]	Sun M, Yan H, Zhang A, Jin Y, Lin C, et al. 2023. Milletdb: a multi-omics database to accelerate the research of functional genomics and molecular breeding of millets. Plant Biotechnology Journal 21:2348−57 doi: 10.1111/pbi.14136 CrossRef Google Scholar
[17]	Sharon D, Tilgner H, Grubert F, Snyder M. 2013. A single-molecule long-read survey of the human transcriptome. Nature Biotechnology 31:1009−14 doi: 10.1038/nbt.2705 CrossRef Google Scholar
[18]	Li W, Godzik A. 2006. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22:1658−59 doi: 10.1093/bioinformatics/btl158 CrossRef Google Scholar
[19]	Hackl T, Hedrich R, Schultz J, Förster F. 2014. proovread: large-scale high-accuracy PacBio correction through iterative short read consensus. Bioinformatics 30:3004−11 doi: 10.1093/bioinformatics/btu392 CrossRef Google Scholar
[20]	Haas BJ, Papanicolaou A, Yassour M, Grabherr M, Blood PD, et al. 2013. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nature Protocols 8:1494−512 doi: 10.1038/nprot.2013.084 CrossRef Google Scholar
[21]	Liu X, Mei W, Soltis PS, Soltis DE, Barbazuk WB. 2017. Detecting alternatively spliced transcript isoforms from single-molecule long-read sequences without a reference genome. Molecular Ecology Resources 17:1243−56 doi: 10.1111/1755-0998.12670 CrossRef Google Scholar
[22]	Beier S, Thiel T, Münch T, Scholz U, Mascher M. 2017. MISA-web: a web server for microsatellite prediction. Bioinformatics 33:2583−85 doi: 10.1093/bioinformatics/btx198 CrossRef Google Scholar
[23]	Zheng Y, Jiao C, Sun H, Rosli HG, Pombo MA, et al. 2016. iTAK: a program for genome-wide prediction and classification of plant transcription factors, transcriptional regulators, and protein kinases. Molecular Plant 9:1667−70 doi: 10.1016/j.molp.2016.09.014 CrossRef Google Scholar
[24]	Kong L, Zhang Y, Ye Z, Liu X, Zhao S, et al. 2007. CPC: assess the protein-coding potential of transcripts using sequence features and support vector machine. Nucleic Acids Research 35:W345−W349 doi: 10.1093/nar/gkm391 CrossRef Google Scholar
[25]	Sun L, Luo H, Bu D, Zhao G, Yu K, et al. 2013. Utilizing sequence intrinsic composition to classify protein-coding and long non-coding transcripts. Nucleic Acids Research 41:e166 doi: 10.1093/nar/gkt646 CrossRef Google Scholar
[26]	Wang L, Park HJ, Dasari S, Wang S, Kocher JP, et al. 2013. CPAT: Coding-Potential Assessment Tool using an alignment-free logistic regression model. Nucleic Acids Research 41:e74 doi: 10.1093/nar/gkt006 CrossRef Google Scholar
[27]	Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, et al. 2016. The Pfam protein families database: towards a more sustainable future. Nucleic Acids Research 44:D279−D285 doi: 10.1093/nar/gkv1344 CrossRef Google Scholar
[28]	Li J, Ma W, Zeng P, Wang J, Geng B, et al. 2015. LncTar: a tool for predicting the RNA targets of long noncoding RNAs. Briefings in Bioinformatics 16:806−12 doi: 10.1093/bib/bbu048 CrossRef Google Scholar
[29]	Boeckmann B, Bairoch A, Apweiler R, Blatter MC, Estreicher A, et al. 2003. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Research 31:365−70 doi: 10.1093/nar/gkg095 CrossRef Google Scholar
[30]	Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, et al. 2000. Gene ontology: tool for the unification of biology. Nature Genetics 25:25−29 doi: 10.1038/75556 CrossRef Google Scholar
[31]	Tatusov RL, Galperin MY, Natale DA, Koonin EV. 2000. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Research 28:33−36 doi: 10.1093/nar/28.1.33 CrossRef Google Scholar
[32]	Koonin EV, Fedorova ND, Jackson JD, Jacobs AR, Krylov DM, et al. 2004. A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes. Genome Biology 5:R7 doi: 10.1186/gb-2004-5-2-r7 CrossRef Google Scholar
[33]	Kanehisa M, Goto S, Sato Y, Furumichi M, Tanabe M. 2012. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Research 40:D109−D114 doi: 10.1093/nar/gkr988 CrossRef Google Scholar
[34]	Huerta-Cepas J, Szklarczyk D, Forslund K, Cook H, Heller D, et al. 2016. eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences. Nucleic Acids Research 44:D286−D93 doi: 10.1093/nar/gkv1248 CrossRef Google Scholar
[35]	Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, et al. 2013. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29:15−21 doi: 10.1093/bioinformatics/bts635 CrossRef Google Scholar
[36]	Bray NL, Pimentel H, Melsted P, Pachter L. 2016. Near-optimal probabilistic RNA-seq quantification. Nature Biotechnology 34:525−27 doi: 10.1038/nbt.3519 CrossRef Google Scholar
[37]	Love MI, Huber W, Anders S. 2014. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology 15:550 doi: 10.1186/s13059-014-0550-8 CrossRef Google Scholar
[38]	Wang J, Zhang T, Shen X, Liu J, Zhao D, et al. 2016. Serum metabolomics for early diagnosis of esophageal squamous cell carcinoma by UHPLC-QTOF/MS. Metabolomics 12:116 doi: 10.1007/s11306-016-1050-5 CrossRef Google Scholar
[39]	Thévenot EA, Roux A, Xu Y, Ezan E, Junot C. 2015. Analysis of the human adult urinary metabolome variations with age, body mass index, and gender by implementing a comprehensive workflow for univariate and OPLS statistical analyses. Journal of Proteome Research 14:3322−35 doi: 10.1021/acs.jproteome.5b00354 CrossRef Google Scholar
[40]	Yu G, Wang L, Han Y, He Q. 2012. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS: A Journal of Integrative Biology 16:284−87 doi: 10.1089/omi.2011.0118 CrossRef Google Scholar
[41]	Yu G, Xie Z, Lei S, Li H, Xu B, et al. 2022. The NAC factor LpNAL delays leaf senescence by repressing two chlorophyll catabolic genes in perennial ryegrass. Plant Physiology 189:595−610 doi: 10.1093/plphys/kiac070 CrossRef Google Scholar
[42]	Shearman RC, Riordan TP, Johnson PG. 2004. Buffalograss. Warm‐Season (C4) Grasses 45:1003−26 doi: 10.2134/agronmonogr45.c31 CrossRef Google Scholar
[43]	Lee JT. 2012. Epigenetic regulation by long noncoding RNAs. Science 338:1435−39 doi: 10.1126/science.1231776 CrossRef Google Scholar
[44]	Di C, Yuan J, Wu Y, Li J, Lin H, et al. 2014. Characterization of stress-responsive lncRNAs in Arabidopsis thaliana by integrating expression, epigenetic and structural features. The Plant Journal 80:848−61 doi: 10.1111/tpj.12679 CrossRef Google Scholar
[45]	Reddy ASN, Marquez Y, Kalyna M, Barta A. 2013. Complexity of the alternative splicing landscape in plants. The Plant Cell 25:3657−83 doi: 10.1105/tpc.113.117523 CrossRef Google Scholar
[46]	Wang M, Wang P, Liang F, Ye Z, Li J, et al. 2018. A global survey of alternative splicing in allopolyploid cotton: landscape, complexity and regulation. New Phytologist 217:163−78 doi: 10.1111/nph.14762 CrossRef Google Scholar
[47]	Zhou F, Xu D, Xiong S, Chen C, Liu C, et al. 2023. Transcriptomic and metabolomic profiling reveal the mechanism underlying the inhibition of wound healing by ascorbic acid in fresh-cut potato. Food Chemistry 410:135444 doi: 10.1016/j.foodchem.2023.135444 CrossRef Google Scholar
[48]	Xu D, Lin H, Tang Y, Huang L, Xu J, et al. 2021. Integration of full-length transcriptomics and targeted metabolomics to identify benzylisoquinoline alkaloid biosynthetic genes in Corydalis yanhusuo. Horticulture Research 8:16 doi: 10.1038/s41438-020-00450-6 CrossRef Google Scholar
[49]	Schmitt B, Vicenzi M, Garrel C, Denis FM. 2015. Effects of N-acetylcysteine, oral glutathione (GSH) and a novel sublingual form of GSH on oxidative stress markers: a comparative crossover study. Redox Biology 6:198−205 doi: 10.1016/j.redox.2015.07.012 CrossRef Google Scholar
[50]	Knoop H, Gründel M, Zilliges Y, Lehmann R, Hoffmann S, et al. 2013. Flux balance analysis of cyanobacterial metabolism: the metabolic network of Synechocystis sp. PCC 6803. PLoS Computational Biology 9:e1003081 doi: 10.1371/journal.pcbi.1003081 CrossRef Google Scholar
[51]	Urano K, Maruyama K, Jikumaru Y, Kamiya Y, Yamaguchi-Shinozaki K, et al. 2017. Analysis of plant hormone profiles in response to moderate dehydration stress. The Plant Journal 90:17−36 doi: 10.1111/tpj.13460 CrossRef Google Scholar
[52]	Sun TP. 2008. Gibberellin metabolism, perception and signaling pathways in Arabidopsis. The Arabidopsis Book 6:e0103 doi: 10.1199/tab.0103 CrossRef Google Scholar
[53]	Kieber JJ, Schaller GE. 2014. Cytokinins. The Arabidopsis Book 12:e0168 doi: 10.1199/tab.0168 CrossRef Google Scholar

{{lists.name}}

Integration analysis of full-length transcriptomics and metabolomics provides new insights into the mechanism of sex differentiation in buffalograss (Buchloe dactyloides)

Abstract

Supplementary information

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors

Integration analysis of full-length transcriptomics and metabolomics provides new insights into the mechanism of sex differentiation in buffalograss (Buchloe dactyloides)

HTML

Plant materials and RNA sample preparation

Full-length library construction and PacBio SMRT sequencing

Full-length transcriptome structure analysis

Identification of regulatory proteins, lncRNAs and lncRNA target

Function annotation of transcripts

Illumina cDNA library construction and sequencing

Quantification of transcript expression and analysis of differential expression transcripts

qRT-PCR analysis

Metabolites extraction and LC-MS/MS analysis

Quantification of metabolites and analysis of differential metabolites

Data preprocessing and annotation, and correlation analysis between DEGs and DAMs

SMRT sequencing analysis and function annotation of transcripts

Full-length transcriptome structure analysis

Acquisition and mining of transcriptome data

Regulatory proteins dynamics during B. dactyloides sex differentiation

Sex-specific genes analysis

Acquisition and mining of metabolomic data

Correlation analysis of transcriptomic and metabolomic data

Glutathione metabolism and photosynthetic capacity changes in response to sex differentiation

Plant hormones in response to sex differentiation

Catalog

{{lists.name}}

Integration analysis of full-length transcriptomics and metabolomics provides new insights into the mechanism of sex differentiation in buffalograss (Buchloe dactyloides)

Abstract

Supplementary information

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors

Integration analysis of full-length transcriptomics and metabolomics provides new insights into the mechanism of sex differentiation in buffalograss (Buchloe dactyloides)

HTML

Plant materials and RNA sample preparation

Full-length library construction and PacBio SMRT sequencing

Full-length transcriptome structure analysis

Identification of regulatory proteins, lncRNAs and lncRNA target

Function annotation of transcripts

Illumina cDNA library construction and sequencing

Quantification of transcript expression and analysis of differential expression transcripts

qRT-PCR analysis

Metabolites extraction and LC-MS/MS analysis

Quantification of metabolites and analysis of differential metabolites

Data preprocessing and annotation, and correlation analysis between DEGs and DAMs

SMRT sequencing analysis and function annotation of transcripts

Full-length transcriptome structure analysis

Acquisition and mining of transcriptome data

Regulatory proteins dynamics during B. dactyloides sex differentiation

Sex-specific genes analysis

Acquisition and mining of metabolomic data

Correlation analysis of transcriptomic and metabolomic data

Glutathione metabolism and photosynthetic capacity changes in response to sex differentiation

Plant hormones in response to sex differentiation

Catalog

Export File

Citation

Format

Content