Chromosome-level genome assembly and annotation of the native Chinese wild blueberry <i>Vaccinium bracteatum</i>

Lu Yang; Minghui Li; Min Shen; Sijia Bu; Bo Zhu; Feng He; Xiaoping Zhang; Xuan Gao; Jiaxin Xiao; Lu Yang; Minghui Li; Min Shen; Sijia Bu; Bo Zhu; Feng He; Xiaoping Zhang; Xuan Gao; Jiaxin Xiao

doi:10.48130/FruRes-2022-0008

2022 Volume 2

Article Contents

Next Previous

ARTICLE Open Access

Chromosome-level genome assembly and annotation of the native Chinese wild blueberry Vaccinium bracteatum

1.
Key Laboratory for the Conservation and Utilization of Important Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, Anhui, China
2.
School of Marine and Biological Engineering, Yancheng Teachers University, Yancheng 224007, Jiangsu, China
^# These authors contributed equally: Lu Yang, Minghui Li

More Information

Corresponding authors: gaoxuan@ahnu.edu.cn; xjx0930@163.com

Received: 26 December 2021
Accepted: 04 May 2022
Published online: 29 June 2022
Fruit Research 2, Article number: 8 (2022) | Cite this article

Abstract

Vaccinium bracteatum Thunb., an important native Chinese wild blueberry species, is widely used as a rootstock and in blueberry cultivar breeding, as well as in traditional medicine and local food. We report here the genome sequence of V. bracteatum using a combination of Oxford Nanopore Technologies long-read and Illumina HiSeq short-read sequencing technologies to obtain 65.30 Gb of clean data, achieving 114.60-fold genome coverage. The assembled genome has a total sequence length of 569.81 Mb and consists of 36,756 predicted genes. Repetitive DNA sequences represent 57.78% of the genome sequence. Comparative genomic analysis revealed that a total of 336 gene families had expanded and that 298 candidate genes had undergone positive selection during evolution in V. bracteatum. The divergence of V. bracteatum from the related Rhododendron williamsianum and Rhododendron delavayi occurred ~13−85 million years ago. The genome sequence of V. bracteatum allowed us to identify some important genes associated with traits involved in fruit development, such as flavonoid biosynthesis, sugar and acid metabolism, MYB transcription factor gene expression, and hormone regulation. The differential expression patterns of genes encoding flavonoid biosynthesis enzymes and MYB transcription factors might explain the high flavonoid content of V. bracteatum. This chromosome-level genome assembly provides reference sequences for the identification and characterization of genes important in the improvement of blueberry and related research.
- Blueberry,
- Comparative genomics,
- Genomics,
- Flavonoid biosynthesis

Supplementary information

Supplementary Table S1 Sequencing data used for genome assembly and annotation of V. bracteatum.
Supplementary Table S2 Raw data and clean data statistics.
Supplementary Table S3 Genome completeness evaluation based on Reads alignment, CEGs and BUSCOs.
Supplementary Table S4 Statistics of Hi-C assembly data.
Supplementary Table S5 Statistics of repeat sequences.
Supplementary Table S6 Statistical results of coding gene quantity prediction based on Ab initio, Homology-based and RNA-Seq.
Supplementary Table S7 Prediction results of coding genes after EVM integration.
Supplementary Table S8 Noncoding RNA statistics and pseudogene prediction.
Supplementary Table S9 Gene function annotation statistics.
Supplementary Table S10 Gene family clustering statistical information of V. bracteatum and other six species.
Supplementary Table S11 Enrichment analysis of the gene family of Vaccinium bracteatum.
Supplementary Table S12 Annotation of expansion and contraction gene families among seven species.
Supplementary Table S13 A total of 298 candidate genes with positive selection in Vaccinium bracteatum.
Supplementary Table S14 Annotation of positively selected genes in Vaccinium bracteatum.
Supplementary Table S15 Number statistics of positively selected genes in terms by GO analysis.
Supplementary Table S16 Annotation of positively selected genes with KEGG analysis.
Supplementary Fig. S1 K-mer distribution map The k-mer analysis (k=19) of Vaccinium bracteatum genome characteristics.
Supplementary Fig. S2 Interactive thermogram of Hi-C assembly chromosome Note: LG01-LG12 stands for Lachesis group 01-12; abscissa and ordinate represent the order of each bin on the corresponding chromosome group.
Supplementary Fig. S3 Distribution of integrated genes derived from three prediction methods.
Supplementary Figs S4 Statistical chart of functional annotation classification of KOG Note: abscissa is the content of each KOG classification, and the ordinate is the number of genes.
Supplementary Figs S5 KEGG path annotation diagram.
Supplementary Figs S6 GO secondary node annotation classification statistical chart Note: The left side of the ordinate is the percentage of gene number, and the right side is the number of genes.
Supplementary Fig. S7 GO and KEGG enrichment analysis of Vaccinium bracteatum specific gene family.
Supplementary Fig. S8 The Ks distribution map within and between species.
Supplementary Fig. S9 Principle component analysis of 12 samples to assess data quality.
Supplementary Fig. S10 Number of significantly downregulated (down) and upregulated (up) genes in the two comparisons.
Supplementary Fig. S11 Gene ontology (GO) analysis of Differentially expressed genes (DEGs) in the comparisons of pink fruit vs. green fruit (a) , and blue fruit vs. pink fruit (b) .
Supplementary Fig. S12 Heat map diagram of the expression of differently expressed genes (DEGs) related to carbohydrate metabolism in fruit at three stages.

Rights and permissions
Copyright: 2022 by the author(s). Exclusive Licensee Maximum Academic Press, Fayetteville, GA. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.

References

[1]	Xu Y, Fan M, Zhou S, Wang L, Qian H, et al. 2017. Effect of Vaccinium bracteatum Thunb. leaf pigment on the thermal, pasting, and textural properties and microstructure characterization of rice starch. Food chemistry 228:435−40 doi: 10.1016/j.foodchem.2017.02.041 CrossRef Google Scholar
[2]	Wang L, Zhang Y, Xu M, Wang Y, Cheng S, et al. 2013. Anti-diabetic activity of Vaccinium bracteatum Thunb. leaves’ polysaccharide in STZ-induced diabetic mice. International journal of biological macromolecules 61:317−21 doi: 10.1016/j.ijbiomac.2013.07.028 CrossRef Google Scholar
[3]	Wang L, Jiang T, Zhang H, Yao H. 2008. Study on the extraction of black pigment from Vaccinium bracteatum Thunb. leaves by enzyme and its stability. Science and Technology of Food Industry 29:224−226+258 doi: 10.13386/j.issn1002-0306.2008.10.055 CrossRef Google Scholar
[4]	Fan M, Lian W, Li T, Fan Y, Rao Z, et al. 2020. Metabolomics approach reveals discriminatory metabolites associating with the blue pigments from Vaccinium bracteatum thunb. leaves at different growth stages. Industrial Crops and Products 147:112252 doi: 10.1016/j.indcrop.2020.112252 CrossRef Google Scholar
[5]	Zhang J, Chu C, Li X, Yao S, Yan B, et al. 2014. Isolation and identification of antioxidant compounds in Vaccinium bracteatum Thunb. by UHPLC-Q-TOF LC/MS and their kidney damage protection. Journal of Functional Foods 11:62−70 doi: 10.1016/j.jff.2014.09.005 CrossRef Google Scholar
[6]	Ren Y, Ke C, Tang C, Yao S, Ye Y. 2017. Divaccinosides A–D, four rare iridoid glucosidic truxillate esters from the leaves of Vaccinium bracteatum. Tetrahedron Letters 58(24):2385−8 doi: 10.1016/j.tetlet.2017.05.013 CrossRef Google Scholar
[7]	Zhao J, Wu Y, Niu X, Zhang Y, Xu X, et al. 2017. Content determination of vaccinoside in leaves of Vaccinium bracteatum Thunb. by HPLC. Shanghai Journal of Traditional Chinese Medicine 51:100−2 doi: 10.16305/j.1007-1334.2017.10.027 CrossRef Google Scholar
[8]	Fan M, Li T, Li Y, Qian H, Zhang H, et al. 2021. Vaccinium bracteatum Thunb. as a promising resource of bioactive compounds with health benefits: An updated review. Food Chemistry 356:129738 doi: 10.1016/j.foodchem.2021.129738 CrossRef Google Scholar
[9]	Polashock J, Zelzion E, Fajardo D, Zalapa J, Georgi L, et al. 2014. The American cranberry: first insights into the whole genome of a species adapted to bog habitat. BMC Plant Biology 14:165 doi: 10.1186/1471-2229-14-165 CrossRef Google Scholar
[10]	Diaz-Garcia L, Garcia-Ortega LF, González-Rodríguez M, Delaye L, Iorizzo M, et al. 2021. Chromosome-Level Genome Assembly of the American Cranberry (Vaccinium macrocarpon Ait. ) and Its Wild Relative Vaccinium microcarpum. Frontiers in Plant Science 12:633310 doi: 10.3389/fpls.2021.633310 CrossRef Google Scholar
[11]	Wu C, Deng C, Hilario E, Albert NW, Lafferty D, et al. 2022. A chromosome-scale assembly of the bilberry genome identifies a complex locus controlling berry anthocyanin composition. Molecular Ecology Resources 22:345−60 doi: 10.1111/1755-0998.13467 CrossRef Google Scholar
[12]	Tsuda H, Kunitake H, Yamasaki M, Komatsu H, Yoshioka K. 2013. Production of intersectional hybrids between colchicine-induced tetraploid shashanbo (Vaccinium bracteatum) and highbush blueberry ‘Spartan’. Journal of the American Society for Horticultural Science 138:317−24 doi: 10.21273/JASHS.138.4.317 CrossRef Google Scholar
[13]	Costich DE, Ortiz R, Meagher TR, Bruederle LP, Vorsa N. 1993. Determination of ploidy level and nuclear DNA content in blueberry by flow cytometry. Theoretical and Applied Genetics 86:1001−6 doi: 10.1007/BF00211053 CrossRef Google Scholar
[14]	Li X, Sun H, Pei J, Dong Y, Wang F, et al. 2012. De novo sequencing and comparative analysis of the blueberry transcriptome to discover putative genes related to antioxidants. Gene 511:54−61 doi: 10.1016/j.gene.2012.09.021 CrossRef Google Scholar
[15]	Gupta V, Estrada AD, Blakley I, Reid R, Patel K, et al. 2015. RNA-Seq analysis and annotation of a draft blueberry genome assembly identifies candidate genes involved in fruit ripening, biosynthesis of bioactive compounds, and stage-specific alternative splicing. Gigascience 4:5 doi: 10.1186/s13742-015-0046-9 CrossRef Google Scholar
[16]	Colle M, Leisner CP, Wai CM, Ou S, Bird KA, et al. 2019. Haplotype-phased genome and evolution of phytonutrient pathways of tetraploid blueberry. GigaScience 8:giz012 doi: 10.1093/gigascience/giz012 CrossRef Google Scholar
[17]	Yu J, Hulse-Kemp AM, Babiker E, Staton M. 2021. High-quality reference genome and annotation aids understanding of berry development for evergreen blueberry (Vaccinium darrowii). Horticulture Research 8:228 doi: 10.1038/s41438-021-00641-9 CrossRef Google Scholar
[18]	Wang P, Luo Y, Huang J, Gao S, Zhu G, et al. 2020. The genome evolution and domestication of tropical fruit mango. Genome Biology 21:60 doi: 10.1186/s13059-020-01959-8 CrossRef Google Scholar
[19]	Rose JP, Kleist TJ, Löfstrand SD, Drew BT, Schönenberger J, et al. 2018. Phylogeny, historical biogeography, and diversification of angiosperm order Ericales suggest ancient Neotropical and East Asian connections. Molecular Phylogenetics and Evolution 122:59−79 doi: 10.1016/j.ympev.2018.01.014 CrossRef Google Scholar
[20]	Soza VL, Lindsley D, Waalkes A, Ramage E, Patwardhan RP, et al. 2019. The Rhododendron genome and chromosomal organization provide insight into shared whole-genome duplications across the heath family (Ericaceae). Genome Biology and Evolution 11:3353−71 doi: 10.1093/gbe/evz245 CrossRef Google Scholar
[21]	Peng Y, Lin-Wang K, Cooney JM, Wang T, Espley RV, et al. 2019. Differential regulation of the anthocyanin profile in purple kiwifruit (Actinidia species). Horticulture Research 6:3 doi: 10.1038/s41438-018-0076-4 CrossRef Google Scholar
[22]	Dong J, Cao L, Zhang X, Zhang W, Yang T, et al. 2021. An R2R3-MYB transcription Factor RmMYB108 responds to chilling stress of Rosa multiflora and conferred cold tolerance of Arabidopsis. Frontiers in Plant Science 12:696919 doi: 10.3389/fpls.2021.696919 CrossRef Google Scholar
[23]	Chen Y, Yang X, Li W, Zhao S. 2020. Knockdown of the DUF647 family memberRUS4 impairs stamen development and pollen maturation in Arabidopsis. Plant Science 301:110645 doi: 10.1016/j.plantsci.2020.110645 CrossRef Google Scholar
[24]	Cheng H, Han L, Yang C, Wu X, Zhong N, et al. 2016. The cotton MYB108 forms a positive feedback regulation loop with CML11 and participates in the defense response against Verticillium dahliae infection. Journal of Experimental Botany 67:1935−50 doi: 10.1093/jxb/erw016 CrossRef Google Scholar
[25]	Wei Z, Hu K, Zhao D, Tang J, Huang Z, et al. 2020. MYB44 competitively inhibits the formation of the MYB340-bHLH2-NAC56 complex to regulate anthocyanin biosynthesis in purple-fleshed sweet potato. BMC Plant Biology 20:258 doi: 10.1186/s12870-020-02451-y CrossRef Google Scholar
[26]	El-Sharkawy I, Liang D, Xu K. 2015. Transcriptome analysis of an apple (Malus × domestica) yellow fruit somatic mutation identifies a gene network module highly associated with anthocyanin and epigenetic regulation. Journal of Experimental Botany 66:7359−76 doi: 10.1093/jxb/erv433 CrossRef Google Scholar
[27]	Lin Q, Wang C, Dong W, Jiang Q, Wang D, et al. 2015. Transcriptome and metabolome analyses of sugar and organic acid metabolism in Ponkan (Citrus reticulata) fruit during fruit maturation. Gene 554:64−74 doi: 10.1016/j.gene.2014.10.025 CrossRef Google Scholar
[28]	Guo S, Sun H, Zhang H, Liu J, Ren Y, et al. 2015. Comparative Transcriptome Analysis of Cultivated and Wild Watermelon during Fruit Development. PloS One 10:e0130267 doi: 10.1371/journal.pone.0130267 CrossRef Google Scholar
[29]	Rahim MA, Robin AHK, Natarajan S, Jung HJ, Lee J, et al. 2018. Identification and Characterization of Anthocyanin Biosynthesis-Related Genes in Kohlrabi. Applied Biochemistry and Biotechnology 184:1120−41 doi: 10.1007/s12010-017-2613-2 CrossRef Google Scholar
[30]	Li Y, Nie P, Zhang H, Wang L, Wang H, et al. 2017. Dynamic changes of anthocyanin accumulation and endogenous hormone contents in blueberry. Journal of Beijing Forestry University 39:64−71 doi: 10.13332/j.1000-1522.20160283 CrossRef Google Scholar
[31]	Primetta AK, Karppinen K, Riihinen KR, Jaakola L. 2015. Metabolic and molecular analyses of white mutant Vaccinium berries show down-regulation of MYBPA1-type R2R3 MYB regulatory factor. Planta 242:631−43 doi: 10.1007/s00425-015-2363-8 CrossRef Google Scholar
[32]	Lin Y, Wang Y, Li B, Tan H, Li D, et al. 2018. Comparative transcriptome analysis of genes involved in anthocyanin synthesis in blueberry. Plant Physiology and Biochemistry 127:561−72 doi: 10.1016/j.plaphy.2018.04.034 CrossRef Google Scholar
[33]	Rogers SO, Bendich AJ. 1985. Extraction of DNA from milligram amounts of fresh, herbarium and mummified plant tissues. Plant Molecular Biology 5:69−76 doi: 10.1007/BF00020088 CrossRef Google Scholar
[34]	Jiang S, An H, Xu F, Zhang X. 2020. Chromosome-level genome assembly and annotation of the loquat (Eriobotrya japonica) genome. GigaScience 9:giaa015 doi: 10.1093/gigascience/giaa015 CrossRef Google Scholar
[35]	Li R, Fan W, Tian G, Zhu H, He L, et al. 2010. The sequence and de novo assembly of the giant panda genome. Nature 463:311−17 doi: 10.1038/nature08696 CrossRef Google Scholar
[36]	Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, et al. 2017. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Research 27:722−36 doi: 10.1101/gr.215087.116 CrossRef Google Scholar
[37]	Vaser R, Sović I, Nagarajan N, Šikić M. 2017. Fast and accurate de novo genome assembly from long uncorrected reads. Genome Res 27:737−46 doi: 10.1101/gr.214270.116 CrossRef Google Scholar
[38]	Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, et al. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963 doi: 10.1371/journal.pone.0112963 CrossRef Google Scholar
[39]	Lieberman-Aiden E, van Berkum NL, Williams L, Imakaev M, Ragoczy T, et al. 2009. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326:289−93 doi: 10.1126/science.1181369 CrossRef Google Scholar
[40]	Rao SSP, Huntley MH, Durand NC, Stamenova EK, Bochkov ID, et al. 2014. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159:1665−80 doi: 10.1016/j.cell.2014.11.021 CrossRef Google Scholar
[41]	Burton JN, Adey A, Patwardhan RP, Qiu R, Kitzman JO, et al. 2013. Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Nature Biotechnology 31:1119−25 doi: 10.1038/nbt.2727 CrossRef Google Scholar
[42]	Li H, Durbin R. 2009. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754−60 doi: 10.1093/bioinformatics/btp324 CrossRef Google Scholar
[43]	Parra G, Bradnam K, Korf I. 2007. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics 23:1061−67 doi: 10.1093/bioinformatics/btm071 CrossRef Google Scholar
[44]	Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. 2015. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31:3210−12 doi: 10.1093/bioinformatics/btv351 CrossRef Google Scholar
[45]	Price AL, Jones NC, Pevzner PA. 2005. De novo identification of repeat families in large genomes. Bioinformatics 21:i351−i358 doi: 10.1093/bioinformatics/bti1018 CrossRef Google Scholar
[46]	Xu Z, Wang H. 2007. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Research 35:W265−W268 doi: 10.1093/nar/gkm286 CrossRef Google Scholar
[47]	Hoede C, Arnoux S, Moisset M, Chaumier T, Inizan O, et al. 2014. PASTEC: an automatic transposable element classification tool. PloS One 9:e91929 doi: 10.1371/journal.pone.0091929 CrossRef Google Scholar
[48]	Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O, et al. 2005. Repbase Update, a database of eukaryotic repetitive elements. Cytogenetic and Genome Research 110:462−7 doi: 10.1159/000084979 CrossRef Google Scholar
[49]	Tarailo-Graovac M, Chen N. 2009. Using RepeatMasker to identify repetitive elements in genomic sequences. Current Protocols in Bioinformatics 25:4.10.1−4.10.14 doi: 10.1002/0471250953.bi0410s25 CrossRef Google Scholar
[50]	Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Research 25:955−64 doi: 10.1093/nar/25.5.955 CrossRef Google Scholar
[51]	Kent WJ. 2002. BLAT — the BLAST-like alignment tool. Genome Research 12:656−64 doi: 10.1101/gr.229202 CrossRef Google Scholar
[52]	She R, Chu JS, Wang K, Pei J, Chen N. 2009. GenBlastA: enabling BLAST to identify homologous gene sequences. Genome Research 19:143−49 doi: 10.1101/gr.082081.108 CrossRef Google Scholar
[53]	Birney E, Clamp M, Durbin R. 2004. GeneWise and Genomewise. Genome Research 14:988−95 doi: 10.1101/gr.1865504 CrossRef Google Scholar
[54]	Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. Journal of Molecular Biology 215:403−10 doi: 10.1016/S0022-2836(05)80360-2 CrossRef Google Scholar
[55]	Dimmer EC, Huntley RP, Alam-Faruque Y, Sawford T, O'Donovan C, et al. 2012. The UniProt-GO Annotation database in 2011. Nucleic Acids Research 40:D565−D570 doi: 10.1093/nar/gkr1048 CrossRef Google Scholar
[56]	Emms DM, Kelly S. 2019. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biology 20:238 doi: 10.1186/s13059-019-1832-y CrossRef Google Scholar
[57]	Mi H, Muruganujan A, Ebert D, Huang X, Thomas PD. 2019. PANTHER version 14: more genomes, a new PANTHER GO-slim and improvements in enrichment analysis tools. Nucleic Acids Research 47:D419−D426 doi: 10.1093/nar/gky1038 CrossRef Google Scholar
[58]	Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ. 2015. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Molecular Biology and Evolution 32:268−74 doi: 10.1093/molbev/msu300 CrossRef Google Scholar
[59]	Katoh K, Asimenos G, Toh H. 2009. Multiple alignment of DNA sequences with MAFFT. In Bioinformatics for DNA Sequence Analysis. Methods in Molecular Biology, eds. Posada D. (eds) vol. 537: XIV, 354. New York: Humana Press. pp. 39−64 https://doi.org/10.1007/978-1-59745-251-9_3
[60]	Talavera G, Castresana J. 2007. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Systematic Biology 56:564−77 doi: 10.1080/10635150701472164 CrossRef Google Scholar
[61]	Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. 2017. ModelFinder: fast model selection for accurate phylogenetic estimates. Nature Methods 14:587−89 doi: 10.1038/nmeth.4285 CrossRef Google Scholar
[62]	Yang Z. 1997. PAML: a program package for phylogenetic analysis by maximum likelihood. Bioinformatics 13:555−56 doi: 10.1093/bioinformatics/13.5.555 CrossRef Google Scholar
[63]	Han MV, Thomas GW, Lugo-Martinez J, Hahn MW. 2013. Estimating gene gain and loss rates in the presence of error in genome assembly and annotation using CAFE 3. Molecular Biology and Evolution 30:1987−97 doi: 10.1093/molbev/mst100 CrossRef Google Scholar
[64]	Yang Z. 2007. PAML 4: phylogenetic analysis by maximum likelihood. Molecular Biology and Evolution 24:1586−91 doi: 10.1093/molbev/msm088 CrossRef Google Scholar
[65]	Zwaenepoel A, Van de Peer Y. 2019. wgd—simple command line tools for the analysis of ancient whole-genome duplications. Bioinformatics (Oxford, England) 35:2153−55 doi: 10.1093/bioinformatics/bty915 CrossRef Google Scholar
[66]	Buchfink B, Xie C, Huson DH. 2015. Fast and sensitive protein alignment using DIAMOND. Nature Methods 12:59−60 doi: 10.1038/nmeth.3176 CrossRef Google Scholar
[67]	Wang Y, Tang H, Debarry JD, Tan X, Li J, et al. 2012. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Research 40:e49 doi: 10.1093/nar/gkr1293 CrossRef Google Scholar
[68]	Tang H, Krishnakuar V, Li J. 2015. jcvi: JCVI utility libraries. Zenodo. http://doi.org/10.5281/zenodo.31631
[69]	Xu Y, Bi C, Wu G, Wei S, Dai X, et al. 2016. VGSC: A web-based vector graph toolkit of genome synteny and collinearity. BioMed Research International 2016:7823429 doi: 10.1155/2016/7823429 CrossRef Google Scholar

About this article

Cite this article

Yang L, Li M, Shen M, Bu S, Zhu B, et al. 2022. Chromosome-level genome assembly and annotation of the native Chinese wild blueberry Vaccinium bracteatum. Fruit Research 2:8 doi: 10.48130/FruRes-2022-0008

Yang L, Li M, Shen M, Bu S, Zhu B, et al. 2022. Chromosome-level genome assembly and annotation of the native Chinese wild blueberry Vaccinium bracteatum. Fruit Research 2:8 doi: 10.48130/FruRes-2022-0008

Figures(8) / Tables(1)

Download PDF

Article Metrics

Article views(12654) PDF downloads(1356)

Other Articles By Authors

on this site
- Lu Yang
- Minghui Li
- Min Shen
- Sijia Bu
- Bo Zhu
- Feng He
- Xiaoping Zhang
- Xuan Gao
- Jiaxin Xiao
on Google Scholar
- Lu Yang
- Minghui Li
- Min Shen
- Sijia Bu
- Bo Zhu
- Feng He
- Xiaoping Zhang
- Xuan Gao
- Jiaxin Xiao

HTML

INTRODUCTION

Vaccinium bracteatum Thunb. (known as 'sea bilberry', 'oriental blueberry', or 'Nan zhu' in China) is a wild blueberry, widely distributed in East Asia, especially in China, Japan, and Korea^[1]. V. bracteatum is a traditional medicinal plant, recorded in the Compendium of Materia Medica. Many studies have reported the health benefits of extracts from V. bracteatum leaves or fruit^[2−4]. In eastern coastal regions of China, the pigment from V. bracteatum leaves is used to dye rice to produce 'Wu Mi Fan', a well-known local traditional food, dating back 1,000 years^[4]. Studies have shown that V. bracteatum leaves contain a number of phytochemical compounds, such as flavonoids^[4,5], polysaccharides^[2], iridoid glycosides^[6], vaccinoside^[7], free amino acids, and organic acids^[1,6,8].

Chromosome-level genome assembly of some vacciniums have been reported such as cranberry^[9,10] and bilberry^[11]. V. bracteatum can also be used as a rootstock to enhance the adaptation of cultivated blueberry. However, little research has been reported on V. bracteatum due to the lack of genomic information. Here, we report on the sequences of the whole-genome assembly and of the transcriptome of V. bracteatum. Our results provide key insights into the transcriptional regulation of flavonoid biosynthesis genes in V. bracteatum.

DISCUSSION

V. bracteatum is a member of the Ericaceae and is a typical diploid plant, with 12 different chromosomes and a very small genome^[12]. In the current study, we report the first de novo assembly of the V. bracteatum genome, through a combination of ONT long-read and Illumina HiSeq short-read sequencing technologies. Due to the complexity of the blueberry genetic diversity, there are many difficulties in the study of blueberry biological characteristics, especially its genome^[13,14]. For example, a draft genome for a wild diploid species V. corymbosum (2n = 2x =24) of blueberry was previously assembled consisting of a large number of scaffolds (total of 13,757; N50 of ~145 kb), a high percentage of gaps (~27.35%) in a ~393.16 Mb assembly^[15]. The first chromosome-scale genome assembly of the tetraploid highbush blueberry (V. corymbosum cv. Draper) (2n = 4x = 48) consisted of 48 pseudomolecules with ~1.68 Gb of assembled sequences^[16]. V. darrowii Camp (2n = 2x = 24) of blueberry is scaffolded into 24 chromosomes with ~1.06 Gb^[17]. The genome of V. bracteatum is very small and the diploid may be easier to be homozygous when comparing to multiploidy, so that obtaining the whole-genome information for V. bracteatum for molecular biology research could provide guidance and reference for the larger and more complex genome of the cultivated blueberry.

Based on the whole-genome sequencing data, comparative genomic analysis was performed between V. bracteatum and six other related plant species. Similar comparative genomics analysis studies have been reported for other plant species. For example, compared with the ratio of gene expansion to contraction in sweet orange (0.6) and longan (0.4), the ratio in the mango genome was 4.5^[18]. The highest ratio (expansion of 336 vs. contraction of 9 gene families) among the seven species studied was in the V. bracteatum genome, reflecting a relatively recent occurrence of the WGD event in the V. bracteatum genome.

PANTHER annotation results showed that the MYB gene family (OG0000036), F-box gene family (OG0000011) and the LRR receptor-like serine/threonine protein kinase gene family (OG0000010) belonged to a large extended gene family (Supplementary Table S12). GO and KEGG analysis of the expanded gene family showed that the gene family associated with oxidoreductase (GO) and flavonoid synthesis-related enzyme (KEGG) activities showed strong gene expansion in V. bracteatum (Supplementary Table S8). These results were in agreement with an earlier investigation in blueberry, where several genes encoding key biosynthetic steps in many antioxidant pathways were enriched with tandem gene duplications, and expanded gene families were involved in the biosynthesis of anthocyanins. Compared with the six other plant species, 298 positively selected genes were identified in the current study in V. bracteatum (Supplementary Table S13). These positively selected, expanded genes offered valuable insights into the formation of phenotypic characteristics and evolution of V. bracteatum.

A previous study had demonstrated that Rhododendron and Vaccinium represent species-rich genera within the Ericaceae, which had diverged from one another ~77 Mya^[19]. Compared with R. delavayi and R. williamsianum, collinearity analysis in the present study showed few scattered points in the V. bracteatum/R. williamsianum comparison in a scatter diagram, suggesting a close relationship between these latter two species. The evolutionary analysis suggested that V. bracteatum and the other two species, R. delavayi and R. williamsianum, may have diverged ~13-85 Mya (Figs 3a & 4). Previous evidence had shown that the two shared WGDs represented by similar Ks values in the Rhododendron and Vaccinium genomes represent two ancient shared WGDs, originating from a common ancestor of the Ericaceae, which can be traced back to a common ancestor of the Ericales^[20]. In our current study, the low Ks value of 0.67 found in V. bracteatum, compared with F. vesca and R. occidentalis, suggested that the divergence of V. bracteatum occurred later than for the other two species (Fig. 3c). The genomic data from the current study will provide valuable reference material for understanding the expression and regulation of important agronomic traits in V. bracteatum and related species.

Flavonoid-related genes were located on the genome of V. bracteatum. These genes did not cluster together, indicating that V. bracteatum may have undergone several WGD events. Chalcone synthase (CHS) is a key enzyme in the flavonoid biosynthesis pathway, and eight genes encoding CHS were detected in the V. bracteatum genome, more than was reported from any of the other six species. Nine genes encoding F3'5'H were detected in the V. bracteatum genome. FLS is an important enzyme necessary for flavonol biosynthesis, and seven FLS genes were detected in the V. bracteatum genome. The up-regulated expression of many structural genes involved in flavonoid biosynthesis during fruit ripening suggested that flavonoids play an important role during fruit development.

Compared with unripe, green fruit, some MYB transcription factor genes exhibited greater transcript abundance in developing pink and blue fruits, indicating that these genes may be involved in controlling pathways of flavonoid biosynthesis during V. bracteatum fruit maturation. MYB110, MYB108, and MYB44 were particularly highly expressed during fruit development, which indicated that these genes may play an important role in regulating flavonoid biosynthesis during fruit maturation. In kiwifruit^[21], the lack of MYB110 expression is responsible for the total absence of anthocyanins in the fruits, as MYB110 promotes the transcription of the F3'H and F3'5'H genes. MYB108 is involved in regulating various biosynthetic pathways in different plant species. In Rosa multiflora, MYB108 expression was induced by chilling stress^[22], and MYB108 expression was required for jasmonic acid-mediated stamen and pollen maturation in Arabidopsis^[23], while overexpression of MYB108 in Arabidopsis thaliana conferred improved tolerance to the Verticillium dahlia infection^[24]. On the other hand, MYB44 acts as a repressor of anthocyanin biosynthesis in sweet potato^[25], suggesting that MYB44 may have multiple functions in plants. MYB10 and MYB_3 were downregulated in V. bracteatum during fruit development, which indicated that these genes may inhibit the biosynthesis of anthocyanins. On the other hand, MYB10 expression increased during the biosynthesis of anthocyanidins in apple^[26].

Differential expression patterns of sucrose synthase genes during V. bracteatum fruit development indicated that they may play various roles in fruit development. Sugar transporters have been proved to regulate intercellular sugar transport in the phloem to ripening fruit. In the current study, expression of three of the AST (encoding aspartate aminotransferase) homologs increased during the early stage of V. bracteatum fruit development, with no significant change being observed during the late stage. The expression of PEPC (encoding phosphoenolpyruvate carboxylase 4) was downregulated during the early stage of fruit development, a finding which was consistent with the results from a previous study^[27]. We found that expression of IDH (encoding isocitrate dehydrogenase) was upregulated early in fruit development, whereas that of GDH (encoding glutamate dehydrogenase) was upregulated at the late stage. It seemed that most of the genes related to high fruit acidity were highly expressed early in fruit development in V. bracteatum, whereas the genes related to fruit sugar levels were highly expressed late in fruit development (Supplementary Fig. S12).

The genes associated with ethylene biosynthesis or signaling were highly expressed in the early stage rather than the late stage of fruit development. Two ACC oxidase and one ethylene receptor (ETR) genes were upregulated at the early stage. Ethylene has been reported to negatively regulate anthocyanin content^[28]. The expression of NCED5 (encoding 9-cis-epoxycarotenoid dioxygenase 5) was upregulated in both the early and late stages of fruit development. NCEDs catalyze the first step of abscisic acid (ABA) biosynthesis from carotenoids. High expression of NCEDs was found in purple-skinned apple compared with non-purple fruit^[29]. The expression of ethylene- or ABA-related genes indicated that the increasing content of anthocyanin during fruit maturation may be related to the regulation of phytohormone concentrations.

Hormones affect fruit development not only by the interaction between different hormones but also by the interaction between the hormone and sugars. For example, ABA, the auxin indole-3-acetic acid (IAA), and ethylene work together to regulate the development of blueberry fruit, while gibberellins (GA) and IAA can promote the absorption of sugar at the early stage of fruit development^[30], whereas ABA and ethylene enhance it at the late stage. In our current study, ABA and ethylene may also increase the anthocyanin content. The blueberry color change during ripening is caused by changes in the anthocyanin content. The expression of some flavonoid biosynthesis structural genes such as phenylalanine ammonia-lyase (PAL), CHS, CHI, F3H, and F3'H, as well that of some transcription factor genes^[31], increased during fruit development^[32]. These results were in agreement with the findings of our own research.

In conclusion, we present here a chromosome-level genome sequence of the wild blueberry species V. bracteatum. This first genome assembly from wild blueberry is expected to advance our understanding of the evolutionary history of blueberry and of the gene expression changes which occur during fruit development. The genome sequence will provide fundamental genomic resources for blueberry improvement. Our phylogenetic analysis of MYB transcriptional factors in wild blueberry has already led to the discovery of several novel MYBs, as well as providing evidence to suggest that MYB110, MYB108, and MYB44 may play an important role in fruit maturation. From the transcriptome, levels of CHS, C4H, F3'5'H, sucrose synthases, sugar transporters, ACC oxidase, and ETR were shown to increase at the late stage of fruit development. In this way, the V. bracteatum genome will serve as an important resource for the development of genomics-assisted selection to achieve blueberry improvement, particularly for traits related to the efficiency of flavonoid production and with stress tolerance.

Assembly feature	Statistic
Assembly feature	Contig-level assembly	Chromosome- scale assembly
Estimated genome size (by k-mer analysis) (Mb)	579.42
Repetitive sequence content	42.72%	57.78%
GC content (estimation)	38.63%
Estimated heterozygosity	1.10%
Assembled genome size (Mb)	569.81
Contig number	1,384	1,430
Contig N50 (Mb)	1.98	1.87
Contig N90 (Mb)	0.30	0.26
Contig max (Mb)	9.42	9.42
GC content (Nanopore)	38.32%	38.32%
Assembly % of genome	98.33%
Scaffold number		973
Scaffold length (Mb)		569.86
Contig length (Mb)		569.81
Scaffold N50 (Mb)		43.77
Scaffold N90 (Mb)		39.17
Scaffold max (Mb)		50.70
Gap total length (Mb)		0.05

{{lists.name}}

Chromosome-level genome assembly and annotation of the native Chinese wild blueberry Vaccinium bracteatum