RNA sequencing of cleanly isolated early endosperms reveals coenocyte-to-cellularization transition features in maize

Yuxin Fu; Shuai Li; Lina Xu; Chen Ji; Qiao Xiao; Dongsheng Shi; Guifeng Wang; Wenqin Wang; Jirui Wang; Jiechen Wang; Yongrui Wu; Yuxin Fu; Shuai Li; Lina Xu; Chen Ji; Qiao Xiao; Dongsheng Shi; Guifeng Wang; Wenqin Wang; Jirui Wang; Jiechen Wang; Yongrui Wu

doi:10.48130/SeedBio-2023-0008

2023 Volume 2

Article Contents

Next Previous

ARTICLE Open Access

RNA sequencing of cleanly isolated early endosperms reveals coenocyte-to-cellularization transition features in maize

1.
National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology & Ecology, Shanghai 200032, China
2.
Triticeae Research Institute, Sichuan Agricultural University, Chengdu 611130, China
3.
College of Life Science, Shanghai Normal University, 100 Guilin Road, Shanghai 200233, China
4.
University of the Chinese Academy of Sciences, Beijing 100049, China Shanghai, 200233, China
5.
National Key Laboratory of Wheat and Maize Crops Science, College of Agronomy, Henan Agricultural University, Zhengzhou 450002, China
6.
Ministry of Education Key Laboratory for Crop Genetic Resources and Improvement in Southwest China, Sichuan Agricultural University, Chengdu, China
^# These authors contributed equally: Yuxin Fu, Shuai Li

More Information

Corresponding author: jcwang@cemps.ac.cn

Received: 17 February 2023
Accepted: 11 May 2023
Published online: 05 July 2023
Seed Biology 2, Article number: 8 (2023) | Cite this article

Abstract

Early endosperm development in maize (Zea mays) is essential for creating a functional endosperm for filling, but its rapid and dynamic process remains largely unknown. The coenocytic stage is a particular stage with rapid nuclear division without cytokinesis. From 48-144 h after pollination (HAP), endosperm mainly undergoes four cellular processes: coenocyte, cellularization, cell proliferation, and differentiation. Although the high temporal-resolution transcriptome data within 144 HAP of maize kernel development have been investigated, due to technical limitations, the samples contained the maternal nucellus and the embryo sac; as a consequence, many endosperm-specifically-expressed genes might be over-looked. In this study, we isolated early endosperms by free hand and laser-capture microdissection (LCM) and generated high-resolution transcriptome data from 48 to 144 HAP with an interval of 24 h. Through weighted gene co-expression network analysis (WGCNA), we identified nine distinct modules of co-expressed gene sets, of which Module 7 was composed of 5,555 genes that showed the highest expression levels at the coenocytic stage. In Module 7, there were 391 genes not expressed in nucellus, and thus were named as the Coenocyte-Expressed (CE) Gene Set. These genes were involved in transcriptional regulation and auxin-activated signaling pathway. Consistent with the stage transition of early endosperm development, the co-expressed gene sets and enriched gene function modules were changed accordingly. We verified the reliability of the transcriptome data by in situ hybridization. Our work provides a valuable gene resource for early endosperm development studies in the future.
- Maize,
- Coenocyte,
- Transcriptome,
- Early endosperm development

Supplementary information

Supplemental Fig. S1 Representative quality assessments of RNAs used in sequencing.
Supplemental Fig. S2 SCC (Spearman's rank Correlation Coefficient) analysis of the mRNA data using log2-transformed FPKM values.
Supplemental Fig. S3 Expression patterns of CE Gene Set in maize embryo sac and ovule by the public RNA-seq data^[27].
Supplemental Data Set 1 Data quality and mapping to B73 genome.
Supplemental Data Set 2 Expression level of genes in different samples.
Supplemental Data Set 3 The expression of WGCNA_modules genes.
Supplemental Data Set 4 En144 VS En48 downregulated genes set.
Supplemental Data Set 5 GO enrichment of the genes in Module 7.
Supplemental Data Set 6 Top 50_weight.
Supplemental Data Set 7 GO terms enriched in the Module 1.
Supplemental Data Set 8 The primers for in-situ hybridization of 2D specific genes.

Rights and permissions
Copyright: © 2023 by the author(s). Published by Maximum Academic Press on behalf of Hainan Yazhou Bay Seed Laboratory. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.

References

[1]	Lopes MA, Larkins BA. 1993. Endosperm origin, development, and function. The Plant Cell 5:1383−99 doi: 10.1105/tpc.5.10.1383 CrossRef Google Scholar
[2]	Olsen OA. 2004. Nuclear endosperm development in cereals and Arabidopsis thaliana. The Plant cell 16:S214−S227 doi: 10.1105/tpc.017111 CrossRef Google Scholar
[3]	Evans MMS. 2007. The indeterminate gametophyte1 gene of maize encodes a LOB domain protein required for embryo sac and leaf development. The Plant Cell 19:46−62 doi: 10.1105/tpc.106.047506 CrossRef Google Scholar
[4]	He YH, Wang JG, Qi WW, Song RT. 2019. Maize Dek15 encodes the cohesin-loading complex subunit SCC4 and is essential for chromosome segregation and kernel development. The Plant Cell 31:465−85 doi: 10.1105/tpc.18.00921 CrossRef Google Scholar
[5]	Huang Y, Wang H, Huang X, Wang Q, Wang J, et al. 2019. Maize VKS1 Regulates Mitosis and Cytokinesis During Early Endosperm Development. The Plant Cell 31:1238−56 doi: 10.1105/tpc.18.00966 CrossRef Google Scholar
[6]	Leroux BM, Goodyke AJ, Schumacher KI, Abbott CP, Clore AM, et al. 2014. Maize early endosperm growth and development: From fertilization through cell type differentiation. American Journal of Botany 101:1259−74 doi: 10.3732/ajb.1400083 CrossRef Google Scholar
[7]	Wang A, Garcia D, Zhang HY, Feng K, Chaudhury A, et al. 2010. The VQ motif protein IKU1 regulates endosperm growth and seed size in Arabidopsis. The Plant Journal 63:670−79 doi: 10.1111/j.1365-313X.2010.04271.x CrossRef Google Scholar
[8]	Olsen OA. 2001. Endosperm Development: Cellularization and cell fate specification. Annual Review of Plant Physiology and Plant Molecular Biology 52:233−67 doi: 10.1146/annurev.arplant.52.1.233 CrossRef Google Scholar
[9]	Sabelli PA, Larkins BA. 2009. The development of endosperm in grasses. Plant Physiology 149:14−26 doi: 10.1104/pp.108.129437 CrossRef Google Scholar
[10]	Yi F, Gu W, Chen J, Song N, Gao X, et al. 2019. High temporal-resolution transcriptome landscape of early maize seed development. The Plant Cell 31:974−92 doi: 10.1105/tpc.18.00961 CrossRef Google Scholar
[11]	Lai J, Dey N, Kim CS, Bharti AK, Rudd S, et al. 2004. Characterization of the maize endosperm transcriptome and its comparison to the rice genome. Genome Research 14:1932−37 doi: 10.1101/gr.2780504 CrossRef Google Scholar
[12]	Liu X, Fu J, Gu D, Liu W, Li T, et al. 2008. Genome-wide analysis of gene expression profiles during the kernel development of maize (Zea mays L.). Genomics 91:378−87 doi: 10.1016/j.ygeno.2007.12.002 CrossRef Google Scholar
[13]	Chen J, Lausser A, Dresselhaus T. 2014. Hormonal responses during early embryogenesis in maize. Biochemical Society Transactions 42:325−31 doi: 10.1042/BST20130260 CrossRef Google Scholar
[14]	Zhan J, Thakare D, Ma C, Lloyd A, Nixon NM, et al. 2015. RNA Sequencing of Laser-Capture Microdissected Compartments of the Maize Kernel Identifies Regulatory Modules Associated with Endosperm Cell Differentiation. The Plant Cell 27:513−31 doi: 10.1105/tpc.114.135657 CrossRef Google Scholar
[15]	Zhang S, Thakare D, Yadegari R. 2018. Laser-capture microdissection of maize kernel compartments for RNA-seq-based expression analysis. Methods in Molecular Biology 1676:153−63 doi: 10.1007/978-1-4939-7315-6_9 CrossRef Google Scholar
[16]	Kerk NM, Ceserani T, Tausta SL, Sussex IM, Nelson TM. 2003. Laser capture microdissection of cells from plant tissues. Plant Physiology 132:27−35 doi: 10.1104/pp.102.018127 CrossRef Google Scholar
[17]	Takahashi H, Kamakura H, Sato Y, Shiono K, Abiko T, et al. 2010. A method for obtaining high quality RNA from paraffin sections of plant tissues by laser microdissection. Journal of Plant Research 123:807−13 doi: 10.1007/s10265-010-0319-4 CrossRef Google Scholar
[18]	Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114−20 doi: 10.1093/bioinformatics/btu170 CrossRef Google Scholar
[19]	Kim D, Langmead B, Salzberg SL. 2015. HISAT: a fast spliced aligner with low memory requirements. Nature Methods 12:357−60 doi: 10.1038/nmeth.3317 CrossRef Google Scholar
[20]	Anders S, Pyl PT, Huber W. 2015. HTSeq-a Python framework to work with high-throughput sequencing data. Bioinformatics 31:166−69 doi: 10.1093/bioinformatics/btu638 CrossRef Google Scholar
[21]	Roberts A, Trapnell C, Donaghey J, Rinn JL, Pachter L. 2011. Improving RNA-Seq expression estimates by correcting for fragment bias. Genome Biology 12:R22 doi: 10.1186/gb-2011-12-3-r22 CrossRef Google Scholar
[22]	Benjamini Y, Hochberg Y. 1995. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society Series B: Statistical Methodology 57:289−300 doi: 10.1111/j.2517-6161.1995.tb02031.x CrossRef Google Scholar
[23]	Zhang B, Horvath S. 2005. A general framework for weighted gene co-expression network analysis. Statistical applications in genetics and molecular biology 4:17 doi: 10.2202/1544-6115.1128 CrossRef Google Scholar
[24]	Langfelder P, Horvath S. 2008. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 9:559 doi: 10.1186/1471-2105-9-559 CrossRef Google Scholar
[25]	Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, et al. 2003. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Research 13:2498−504 doi: 10.1101/gr.1239303 CrossRef Google Scholar
[26]	Langfelder P, Horvath S. 2012. Fast R Functions for Robust Correlations and Hierarchical Clustering. Journal of Statistical Software 46:i11 Google Scholar
[27]	Li X, Wu J, Yi F, Lai J, Chen J. 2023. High temporal-resolution transcriptome landscapes of maize embryo sac and ovule during early seed development. Plant Molecular Biology 111:233−48 doi: 10.1007/s11103-022-01318-0 CrossRef Google Scholar
[28]	Yue R, Tie S, Sun T, Zhang L, Yang Y, et al. 2015. Genome-wide identification and expression profiling analysis of ZmPIN, ZmPILS, ZmLAX and ZmABCB auxin transporter gene families in maize (Zea mays L. ) under various abiotic stresses. PLoS One 10:e0118751 doi: 10.1371/journal.pone.0118751 CrossRef Google Scholar
[29]	Hagen G, Guilfoyle T. 2002. Auxin-responsive gene expression: genes, promoters and regulatory factors. Plant Molecular Biology 49:373−85 doi: 10.1023/A:1015207114117 CrossRef Google Scholar
[30]	Nardmann J, Werr W. 2006. The shoot stem cell niche in angiosperms: expression patterns of WUS orthologues in rice and maize imply major modifications in the course of mono- and dicot evolution. Molecular Biology and Evolution 23:2492−504 doi: 10.1093/molbev/msl125 CrossRef Google Scholar
[31]	Lowe K, Wu E, Wang N, Hoerster G, Hastings C, et al. 2016. Morphogenic regulators Baby boom and Wuschel improve monocot transformation. The Plant Cell 28:1998−2015 doi: 10.1105/tpc.16.00124 CrossRef Google Scholar
[32]	Myers PN, Setter TL, Madison JT, Thompson JF. 1990. Abscisic Acid inhibition of endosperm cell division in cultured maize kernels. Plant Physiology 94:1330−6 doi: 10.1104/pp.94.3.1330 CrossRef Google Scholar
[33]	Myers PN, Setter TL, Madison JT, Thompson JF. 1992. Endosperm cell division in maize kernels cultured at three levels of water potential. Plant Physiology 99:1051−56 doi: 10.1104/pp.99.3.1051 CrossRef Google Scholar
[34]	Guo L, Luo X, Li M, Joldersma D, Plunkert M, et al. 2022. Mechanism of fertilization-induced auxin synthesis in the endosperm for seed and fruit development. Nature Communications 13:3985 doi: 10.1038/s41467-022-31656-y CrossRef Google Scholar
[35]	Figueiredo DD, Köhler C. 2018. Auxin: a molecular trigger of seed development. Genes Development 32:479−90 doi: 10.1101/gad.312546.118 CrossRef Google Scholar
[36]	Batista RA, Figueiredo DD, Santos-González J, Köhler C. 2019. Auxin regulates endosperm cellularization in Arabidopsis. Genes Development 33:466−76 doi: 10.1101/gad.316554.118 CrossRef Google Scholar
[37]	Bernardi J, Lanubile A, Li QB, Kumar D, Kladnik A, et al. 2012. Impaired auxin biosynthesis in the defective endosperm18 mutant is due to mutational loss of expression in the ZmYuc1 gene encoding endosperm-specific YUCCA1 protein in maize. Plant Physiology 160:1318−28 doi: 10.1104/pp.112.204743 CrossRef Google Scholar
[38]	Bernardi J, Battaglia R, Bagnaresi P, Lucini L, Marocco A. 2019. Transcriptomic and metabolomic analysis of ZmYUC1 mutant reveals the role of auxin during early endosperm formation in maize. Plant Science 281:133−45 doi: 10.1016/j.plantsci.2019.01.027 CrossRef Google Scholar
[39]	Zhang M, Zheng H, Jin L, Xing L, Zou J, et al. 2022. miR169o and ZmNF-YA13 act in concert to coordinate the expression of ZmYUC1 that determines seed size and weight in maize kernels. New Phytologist 235:2270−84 doi: 10.1111/nph.18317 CrossRef Google Scholar
[40]	Forestan C, Meda S, Varotto S. 2010. ZmPIN1-mediated auxin transport is related to cellular differentiation during maize embryogenesis and endosperm development. Plant Physiology 152:1373−90 doi: 10.1104/pp.109.150193 CrossRef Google Scholar
[41]	Tang X, Zhang ZY, Zhang WJ, Zhao XM, Li X, et al. 2010. Global gene profiling of laser-captured pollen mother cells indicates molecular pathways and gene subfamilies involved in rice meiosis. Plant Physiology 154:1855−70 doi: 10.1104/pp.110.161661 CrossRef Google Scholar
[42]	Xu X, Crow M, Rice BR, Li F, Harris B, et al. 2021. Single-cell RNA sequencing of developing maize ears facilitates functional analysis and trait candidate gene discovery. Developmental Cell 56:557−568.E6 doi: 10.1016/j.devcel.2020.12.015 CrossRef Google Scholar
[43]	Ortiz-Ramírez C, Guillotin B, Xu X, Rahni R, Zhang S, et al. 2021. Ground tissue circuitry regulates organ complexity in maize and Setaria. Science 374:1247−52 doi: 10.1126/science.abj2327 CrossRef Google Scholar
[44]	Giacomello S, Lundeberg J. 2018. Preparation of plant tissue to enable Spatial Transcriptomics profiling using barcoded microarrays. Nature Protocols 13:2425−46 doi: 10.1038/s41596-018-0046-1 CrossRef Google Scholar
[45]	Doll NM, Just J, Brunaud V, Caïus J, Grimault A, et al. 2020. Transcriptomics at Maize Embryo/Endosperm Interfaces Identifies a Transcriptionally Distinct Endosperm Subdomain Adjacent to the Embryo Scutellum. The Plant Cell 32:833−52 doi: 10.1105/tpc.19.00756 CrossRef Google Scholar
[46]	Zhao P, Zhou X, Shen K, Liu Z, Cheng T, et al. 2019. Two-step maternal-to-zygotic transition with two-phase parental genome contributions. Developmental Cell 49:882−893.e5 doi: 10.1016/j.devcel.2019.04.016 CrossRef Google Scholar

About this article

Cite this article

Fu Y, Li S, Xu L, Ji C, Xiao Q, et al. 2023. RNA sequencing of cleanly isolated early endosperms reveals coenocyte-to-cellularization transition features in maize. Seed Biology 2:8 doi: 10.48130/SeedBio-2023-0008

Fu Y, Li S, Xu L, Ji C, Xiao Q, et al. 2023. RNA sequencing of cleanly isolated early endosperms reveals coenocyte-to-cellularization transition features in maize. Seed Biology 2:8 doi: 10.48130/SeedBio-2023-0008

Figures(6)

Download PDF

Article Metrics

Article views(11473) PDF downloads(1604)

Other Articles By Authors

on this site
- Yuxin Fu
- Shuai Li
- Lina Xu
- Chen Ji
- Qiao Xiao
- Dongsheng Shi
- Guifeng Wang
- Wenqin Wang
- Jirui Wang
- Jiechen Wang
- Yongrui Wu
on Google Scholar
- Yuxin Fu
- Shuai Li
- Lina Xu
- Chen Ji
- Qiao Xiao
- Dongsheng Shi
- Guifeng Wang
- Wenqin Wang
- Jirui Wang
- Jiechen Wang
- Yongrui Wu

HTML

Introduction

The endosperm of cereal crops is a main source of human food and animal feed, and is also used as raw materials for industry and biofuel. Endosperm functions as an absorptive structure that supports embryo development and later germinated seedling in angiosperms^[1,2]. In the process from the double fertilization to seed maturity, early endosperm development only represents a relatively short duration, but it plays a decisive role in establishment of the endosperm function. Many cases showed that mutation of key factors in this process led to kernel abortion^[3−7].

Maize kernel development originates from double fertilization in the embryo sac, where one sperm fertilizes the egg to form a diploid zygote that develops into the embryo; the other genetically identical sperm fuses with the two polar nuclei in the central cell and grows into the triploid endosperm. After double fertilization, early endosperm development undergoes four distinct cytological stages: coenocyte, cellularization, cell proliferation, and differentiation^[6]. Early endosperm development usually spans from 0 to 144 h after pollination (HAP). Generally, from 0 to 48 HAP, the fertilized central cell undergoes several rounds of nuclear division without cytokinesis, thereby resulting in formation of a coenocyte. Afterward, cellularization occurs till 96 HAP, creating a fully cellularized structure. From 96 to 144 HAP, the endosperm rapidly proliferates and begins to differentiate from 120 HAP. Although the four representative cell types (the embryo surrounding region (ESR), starchy endosperm (SE), basal endosperm transfer layer (BETL), and aleurone (AL)) are not morphologically distinguishable at 120 HAP, the marker genes for a specific cell type begin to express in the corresponding regions. The time point at 144 HAP is generally considered as the end of early endosperm development, after which the endosperm activity is characterized by rapid cell proliferation and cell differentiation. From 192 HAP, the endosperm begins to synthesize storage reserves, which is called endosperm filling^[1,6,8−10].

Over the years, several transcriptome profiling studies have advanced our understanding of the gene regulatory networks and cellular processes that control maize kernel development. Several endosperm cDNA libraries were constructed to identify gene expression at early-to-middle growth stages (4−6 days after pollination (DAP) and 7−23 DAP by using expressed-sequence-tag (EST) sequencing)^[11]. A microarray with approximately 58,000 probes was used to study dynamic gene expression from 1 to 35 DAP (five days as an interval)^[12]. By using high-throughput RNA sequencing (RNA-seq), a transcriptome atlas was generated using samples including embryo, endosperm and intact kernel, covering a long-time span from fertilization to maturity (0-38 DAP, two days as an interval), which revealed the extensive genetic control over the seed development^[13]. A coupled laser-capture microdissection (LCM) and RNA-seq approach was used to capture different cell types at 8 DAP, illustrating the gene spatial expression pattern and correlation of gene regulation between each cell types during cell differentiation^[14].

Recent studies reported a high-resolution transcriptome data for 31 consecutive time points by manual dissection of the nucellus within the first 144 HAP of kernel development (four or six hours as an interval)^[10]. This provides a useful data source for functional research of maize early endosperm development. However, the early embryo sac (including the endosperm and the embryo) is small in size and is surrounded by the maternal tissue nucellus, which contributes a large portion to the whole kernel for transcriptome sequencing in that study, especially at the first 48-HAP stage. Therefore, it is possible that some genes expressing in a specific pattern but at a low level failed to be identified, even they might be important for early endosperm development.

Dissection of intact endosperms for RNA-seq is challenging, especially for 0−48 HAP endosperms. Here, we isolated early endosperm samples at five time points (24 h as an interval) by free hand and used them for RNA-seq. To compare the transcriptome data created from the free-hand samples, we generated another set of 96-HAP endosperm transcriptome data, where the endosperm samples were made by paraffin embedding and LCM^[15]. In total, 22,853 unique genes were detected to express in at least one of the 22 samples ([FPKM] ≥ 1). Through WGCNA analysis, we identified nine distinct modules of co-expressed gene sets, of which Module 7 was composed of 5,555 genes that showed the highest expression level than genes in other modules at the coenocytic stage. In Module 7, a total of 391 genes were not detected for expression in nucellus, and therefore were regarded as the newly identified coenocyte-expressed (CE) gene set. We verified the reliability of the transcriptome data by in situ hybridization. Our work provides a valuable resource for studying early endosperm development in maize.

Materials and methods

Plant material collection

The maize (Zea mays) inbred line B73 was grown in the Songjiang experimental field in Shanghai (China) in 2021. All individual plants were self-pollinated at the same time. The embryo sac and endosperm were collected by manual dissection under a stereo microscope (Leica, Cat: DM2500) or a magnifying glass, and then were frozen immediately in liquid nitrogen in a 2 ml sterile tube. The materials were stored at −80°C before further processing. Each time point contains 3−4 replicates, and each replicate has at least 100 endosperms except for the one at 144 HAP, as 30−50 endosperms at 144 HAP are enough to extract 50 μg RNA.

Laser capture microdissection (LCM)
The self-pollinated B73 ears at 96 HAP were harvested for LCM. The uniform kernels in the middle region of the cob were dug out with the pedicle using sharp tweezers. To promote fixative penetration, 1-mm-thin sections in the middle of a kernel were longitudinally cut using a double-edge blade, and put immediately into a glass vial with prechilled Farmer's fixative (ethanol and glacial acetic acid in 3:1 ratio)^[16], and kept at 4 ^oC overnight. The fixed kernel sections were then dehydrated in a graded ethanol series, cleared in graded n-butanol, embedded in paraffin wax (McCormick Scientific, Leica Biosystems, Cat: 39503002) using microwave^[17]. Sections with 8−10 μm thickness were made as ribbons using a manual rotary microtome (Leica, Cat: RM2235), and mounted on PSA 1× White Slides (Leica, Cat: 39475275). Shortly before capture, sections were deparaffinized in Xylenes and air-dried^[17]. Individual cell types were selected and captured using a laser microdissection system (Leica, Cat: LMD6500), with an optimized set of conditions (cutting speed, width, and laser energy).

RNA extraction, cDNA amplification, RNA-seq library construction and sequencing
High-quality total RNA of LCM samples and manually dissected samples was extracted with Arcturus® PicoPure® Frozen RNA Isolation Kit (ThermoFisher, Cat: KIT0214). High fidelity linear amplification of total RNA extracted from LCM samples was performed using Arcturus^™ RiboAmp^™ HS PLUS Kit (ThermoFisher, Cat: KIT0525) following the manufacturer’s protocol. The quality and quantity of the selected RNA samples were checked on an Agilent 2100 Bioanalyzer (Agilent Technologies). A total of 22 RNA samples used for preparing RNA-seq libraries and sequencing on a NovaSeq 6000 platform (Illumina, Inc., San Diego, CA, USA) by OE Biotech (Shanghai, China).

Quality control and mapping
Raw data (raw reads) were processed using Trimmomatic^[18]. Clean reads were obtained by removing low-quality reads with ambiguous nucleotides, and adapter sequences were filtered from raw reads. An index of the reference genome was built using Bowtie v.2.0.6 and clean reads were aligned to the reference genome (ftp://ftp.ensemblgenomes.org/pub/plants/release-45/fasta/zea_mays/dna/) using Hisat2^[19]. Read counts of genes were acquired by HTSeq-count^[20], and expression levels were calculated using the fragments per kilobase per million reads (FPKM) method^[21].

The relative expression level of each transcript was calculated by the statistical package DEGseq2 (version 3.12), and the resulting p-values were adjusted by controlling for the false discovery rate (FDR). Genes with |log2-fold change| > 1 and FDR < 0.05 were considered as differentially expressed genes (DEGs)^[22].

WGCNA
The highly co-expressed gene modules were identified from the DEGs using WGCNA^[23,24]. Genes with low FPKM ([FPKM] < 1) or low coefficient of variation of FPKM (SD ≤ 0.5) were filtered out, and 9,322 genes were obtained as the input gene set for WGCNA. WGCNA network construction and module detection was conducted using an unsigned type of topological overlap matrix (TOM), a minimal module size of 30, and a branch merge cut height of 0.25. WGCNA was performed using online tools (https://cloud.oebiotech.com/task/). The intramodular connectivity value was calculated and used to evaluate the association of genes with modules. The top 50 hub genes in the CE Gene Set were filtered by their higher intramodular connectivity value. The connection weight to each gene pair was calculated, and 3,279 gene pairs were screened with a threshold of 0.5 in the CE Gene Set. The degree of the top 48 hub genes was greater than 70, the other is less than 40. So, these 48 genes are the real hub genes. The regulatory network of six hub TFs in the CE Gene Set was represented using Cytoscape 3.1^[25]. The FPKM values of each gene were convert to Z-score and then used to draw clustered heatmaps by pheatmap in R.

Functional enrichment analysis
To analyze the biological functions or pathways that are overrepresented in WGCNA modules. GO (Gene Ontology) analysis was performed using the OmicShare tools, a free online platform for data analysis (www.omicshare.com/tools). The significant threshold of p-value less than 0.05 would be identified for each GO category.

Histocytochemical analysis
To confirm the developmental stages of endosperms that were sampled for RNA-seq samples, the same batch of each sample was used for the histocytochemical analysis. Kernels were fixed in Farmer’s fixative (ethanol and glacial acetic acid in 3:1 ratio) as processed for LCM. After dehydration using a gradient concentration of ethanol, the sections were embedded in epoxide resin for semi-thin sectioning. The sections were stained with 0.5% toluidine blue solution for 10 min and photographed under bright field using a Leica DM2500 for photos.

Kernel fixation and in situ hybridization
The spatial expression pattern of selected specific genes was validated by RNA in situ hybridization. kernels at 48 HAP were fixed in 4% paraformaldehyde solution (Sigma) with 0.1% TritonX-100 (Sigma) and 0.1% Tween-20 in PBS (Solarbio, Cat: 9005-64-5) for 16 h. After dehydration using graded ethanol and vitrification by xylene, the samples were embedded in paraffin. Kernels were longitudinally cut into 7-μm sections using a Leica manual microtome (Leica, Cat: FM2235). The paraffin sections with visible coenocyte were intercepted and affixed to glass slides. In situ hybridization was carried out according to the protocol in previous studies^[5]. The fragment of each cDNA sequence was cloned and inserted into the pEasy-blunt-zero cloning vector (TransGen, Cat: CB501-01). The vectors used for the synthesis of antisense and sense RNA probes were transcribed in vitro by T7 and SP6 RNA polymerase, labeled by digoxigenin (DIG) according to the company instructions for the DIG RNA labeling kit (Roche, Cat: 11175025910).

{{lists.name}}

RNA sequencing of cleanly isolated early endosperms reveals coenocyte-to-cellularization transition features in maize