-
With a large amount of previously known resequencing data, 20,597 core SNPs (cSNPs) with high detection rates, homogeneity, and repeatability were screened out. By designing probes covering cSNPs and high-throughput detection of their captured fragments, these cSNPs and 76,856 other SNPs in these regions were identified. All these 97,453 SNPs are called multiple SNPs (mSNPs).
These markers were evenly distributed on 19 chromosomes (chr), covered 457,925,245 bp of the genome (Fig. 1a, Table 1). Among them, Chr18, the longest chromosome, was covered by 1,408 cSNPs/6,587 mSNPs, and the shortest chr17 was covered by 1,013 cSNPs/4,808 mSNPs. The average distance between cSNPs was 22,233 bp. By comparison with the reference genome annotation (VCost. v3), the coverage length of markers was more than 99% in each chromosome of the grapevine genome (Fig. 1b). By analyzing the location of cSNPs in the genome, the three largest classes were: 7,199 cSNPs in introns, 4,659 cSNPs in intergenic regions, and 4,165 cSNPs in exons, respectively (Fig. 1c). That implied that a large number of markers were located in gene regions.
Figure 1.
Characteristics of GBTS array. (a) Distribution of cSNPs on each chromosome. Color indicates the number of cSNPs within 1 Mbp window size. (b) Coverage length of markers compared with reference genome (VCost. v3). (c) Annotation of the location of the cSNPs. (d) Number and proportion of MAF for all cSNPs.
Table 1. Characteristics of SNPs distributed on 19 grape chromosomes.
Chr. SNP no. mSNP no. Distance
(bp)Average SNP
interval (bp)mSNP/
SNP1 1,198 5,357 24,200,107 20,200 4.47 2 1,015 4,619 18,860,487 18,582 4.55 3 944 4,408 20,668,111 21,894 4.67 4 1,175 5,347 24,682,346 21,006 4.55 5 1,134 5,229 25,554,074 22,534 4.61 6 1,038 4,838 22,638,265 21,810 4.66 7 1,328 5,447 27,330,303 20,580 4.10 8 754 3,067 22,542,685 29,897 4.07 9 735 3,464 22,840,396 31,075 4.71 10 1,230 5,487 23,441,644 19,058 4.46 11 958 4,390 20,025,463 20,903 4.58 12 1,156 6,032 24,240,562 20,969 5.22 13 1,313 6,880 29,056,180 22,130 5.24 14 1,262 6,200 30,244,820 23,966 4.91 15 878 4,276 20,254,536 23,069 4.87 16 1,048 5,447 23,491,226 22,415 5.20 17 1,013 4,808 18,650,226 18,411 4.75 18 1,408 6,587 34,516,940 24,515 4.68 19 1,010 5,570 24,686,874 24,442 5.51 Total 20,597 97,453 457,925,245 22,233 4.73 Minor allele frequency (MAF) is an important indicator to evaluate the diversity of markers. In this array, the cSNP with MAF > 0.1 was 8,727, accounting for 42% of all cSNPs (Fig. 1d). A sufficient number of cSNPs with high MAF means that this array has the ability to detect different grapevine cultivars or lines.
Sex segregation in the hybrid population
-
Flower sex is closely related to cultivar selection, cultivation management, and the yield of grapes. Identification of sex related locus can quickly determine the sex of progenies in juvenile when wild grapes, usually male or female unisexual flowers, are used for making crosses. The hybrid population of V. vinifera 'Cabernet Sauvignon' (hermaphrodite flower – female parent) and V. pseudoreticulata 'Huadong1058' (male flower) was separated into flower types (Fig. 2a). The phenotypes of flower sex among the 131 F1 hybrids were collected in 2015 and 2019, respectively. In 2015, among the 83 seedlings bloomed, there were 58 males, 16 females, and 9 hermaphrodites. In 2019, all 131 progenies bloomed with 70 males, 47 females, and 14 hermaphrodites (Fig. 2b).
Figure 2.
The characteristics of flower sex. (a) Flower types among the individuals in the mapping population. (b) Distributions of flower types among the F1 hybrids in 2015 and 2019.
Mapping of sex determination locus
-
From the sequencing data, 27,647 polymorphic mSNPs were identified and divided into eight types: 'aa × bb', 'ab × cc', 'cc × ab', 'ab × cd', 'ef × eg', 'hk × hk', 'lm × ll' and 'nn × np' (Table 2). 'aa × bb' type markers were filtered out because they were not separated in the progeny. 'ab × cc', 'cc × ab', 'ab × cd' and 'ef × eg' types were filtered out because they were too less to have significant influence. Finally, only 'hk × hk', 'lm × ll' and 'nn × np' types of mSNPs were used to construct the genetic linkage map.
Table 2. Marker types distribution.
Marker types Cabernet Sauvignon Huadong
1058Marker number Percentage (%) aa × bb aa bb 10,022 36.25 ab × cc ab cc 84 0.30 cc × ab cc ab 65 0.24 ab × cd ab cd 0 0.00 ef × eg ef eg 86 0.31 hk × hk hk hk 883 3.19 lm × ll lm ll 12,225 44.22 nn × np nn np 4,282 15.49 Total − − 27,647 100.00 The remaining markers were used for further analysis. In segregation distortion analysis, the markers with p > 0.05 were retained. From a similarity analysis, the markers with similarity equal to 1 were filtered out. Then the LOD score of 6 was taken as the threshold for deciding whether loci were linked and the markers were discarded which significantly affected the linkage group marker order. These measures conduced to enhance the accuracy of genetic map and reduce the computational complexity. After that, the genetic linkage map with 422 mSNPs was constructed (Fig. 3). The map contained 19 linkage groups (LGs) and spanned 2,351.71 cM, with an average inter-SNP distance of 5.57 cM. The number of mSNPs on each LG ranged from 16 to 30. The LG8 had the longest length with 177.65 cM, and the LG19 had the shortest length with 80.38 cM (Supplemental Table S2).
Figure 3.
Genetic map of hybrid population crosses from Vitis vinifera 'Cabernet Sauvignon' × Vitis pseudoreticulata 'Huadong1058'. LG1 to LG19 represents 19 linkage groups respectively, and each bar represents a SNP marker. The ruler on the left is the genetic distance (cM).
There was one sex determination locus was identified on the linkage group (Fig. 4). This locus was located on LG2, between 54.74 and 58.80 cM. The physical position was from 3.29 to 5.78 Mbp and the LOD peak was located at 4.83 Mbp. The locus could be detected in 2015 and 2019 by interval mapping and MQM mapping which showed good repeatability, and the PVE (phenotypic variance explained) up to 98.6%.
Figure 4.
Mapping of sex determination locus. (a) Interval mapping of sex determination locus on chr2. (b) Multiple-QTL mapping of sex determination locus on chr2. (c) The overlap region of IM and MQM on chromosome 2, and the markers on the peak. The boundaries of locus were determined by the markers closest to the threshold (LOD = 30) on the flanks.
GWAS analysis of floral sex and linkage locus
-
GWAS approach was conducted by GAPIT (Version 3) using Blink, FarmCPU, SUPER, MLM, and GLM models (Fig. 5a). A locus on chr2 was obtained from different models based on the Manhattan plots constructed using data from both years. This locus had different boundaries among the five models, but its peak was consistently at 4.85 Mbp.
Figure 5.
GWAS Analysis of floral sex and linkage locus. (a) Manhattan plots and QQ plots of GWAS analysis based on five model (Blink, FarmCPU, SUPER, MLM and GLM). The vertical axis of the Manhattan map is the −log10(p) of each marker based on the analysis of different models. (b) Interval of sex determination locus on Chr2.
The boundary position of this locus was further determined using the 2019 segregation data, and the locus was confined between 3.02 Mbp and 6.81 Mbp in chromosome 2 (Fig. 5b). QQ plot of each Manhattan plot indicated that there were significant markers of deviation from random effects that were highly correlated with the phenotype in each group.
The identification of sex identification marker combination for MAS
-
In order to find SNP markers closely associated with flower sex types that could be used for MAS, two SNPs were identified within this locus based on the QTL mapping and GWAS analysis. The first marker with T/C substitution localized at 4,825,970 bp of chr2 was named as SLS1. The second marker localized at 4,758,220 bp near the peak position with C/T substitution, was named as SLS2. The SLS1 are 'CC' in 'Cabernet Sauvignon' and 'TC' in 'Huadong1058', and in SLS2, they are 'AG' in 'Cabernet Sauvignon' and 'GG' in 'Huadong1058', respectively. Individual genotypes and flower phenotypes in this locus were analyzed, and results showed that progenies with 'TC' in SLS1 were always male, regardless the genotypes in SLS2. When progenies carried 'CC' in SLS1, their flower types were determined by the alleles in SLS2, in which progenies with 'GG' are female while those carrying 'AG' were hermaphrodite. In summary, 'T' in SLS1 was tag SNP of male and 'A' in SLS2 was tag SNP of hermaphrodite. The genotypes of SLS1 and SLS2 can always accurately predict the flower types of grapevines, and those progenies carrying 'TC-XX' was male, while the progenies carrying 'CC-GG' was female and the progenies carrying 'CC-AX' was hermaphrodite ('X'represents any nucleotide type). We proved that the accuracy of flower type estimation was 100% in these hybrid populations (Fig. 6, Supplemental Table S3).
Figure 6.
Model of sex identification marker combination on chromosome 2. 'A', 'T', 'C' and 'G' represent four nucleotide type respectively, 'X' represents any nucleotide type. Two dashed lines of each sex type represent the two sister chromatids of chromosome 2.
To determine whether the SLS1-SLS2 combination can be used for prediction of flower types in different grapevines varieties/accessions, a total of 34 wine, table, juice, and rootstock grape varieties of which six are male, 27 are hermaphrodite, and one is female, were used for the validation study. Based on the results obtained from the segregating populations. All sex types were predicted accurately, which also confirms the accuracy of sex identification markers combination in this study (Supplemental Table S4).
-
In this study, the sex determination locus was mapped and sex identification marker combination was developed, using the sequencing data of 131 progenies from crosses of V. vinifera 'Cabernet Sauvignon' × V. pseudoreticulata 'Huadong1058' by GBTS. A total of 20,597 cSNP (97,453 mSNP) coving more than 99% of the genome were developed to construct SNP array, in which most of markers were located in gene regions and had sufficient diversity. In order to mapping the sex determination locus, sex types were surveyed in 2015 and 2019, genetic map construction and GWAS were performed using GBTS data. The sex determination locus was finally located at 54.74−58.80 cM by mapping and 3.02−6.81 Mbp by GWAS, with the common peak at 4.83 Mbp on chr2. In this locus, a marker combination of 'SLS1-SLS2' was identified, and 34 species/cultivars were used to evaluate the accuracy of the combination for identifying the sex type of grapevine.
-
About this article
Cite this article
Yang B, Wu W, Lv J, Li J, Xu Y, et al. 2023. Identification of sex determination locus and development of marker combination in Vitis based on genotyping by target sequencing. Fruit Research 3:31 doi: 10.48130/FruRes-2023-0031
Identification of sex determination locus and development of marker combination in Vitis based on genotyping by target sequencing
- Received: 21 April 2023
- Accepted: 11 August 2023
- Published online: 03 November 2023
Abstract: The grapevine is an important and economically valuable fruit crop, with flower sex being a key genetic trait that directly affects grapevine yield and quality. Despite its significance, there is a lack of studies on sex-linked molecular markers that can assist in grapevine breeding. In this study, we developed a grapevine single nucleotide polymorphism (SNP) marker array using a combination of genotyping by target sequencing (GBTS) and capture-in-solution technology and applied it to marker-assisted selection (MAS) of grapevine gender. The SNP array could detect a total of 20,597 core SNPs and 97,453 multiple SNPs (mSNPs), covering over 99% of the grapevine genome on each chromosome. A total of 131 progenies from a cross between Vitis vinifera 'Cabernet Sauvignon' and Vitis pseudoreticulata 'Huadong1058' that exhibited segregated sex phenotypes were sequenced using this array. Through locus mapping and a genome-wide association study (GWAS), a locus on chromosome 2 (54.74−58.80 cM) that explained 98.6% of the phenotypic variation was identified. To further utilize this locus, a sex prediction marker combination consisting of two SNPs was developed, which accurately predicted the sex of 34 natural grapevine varieties/accessions. This study demonstrates the application of GBTS in grapevine breeding and provides a reliable MAS marker set for early-stage sex selection.
-
Key words:
- Grapevine /
- GBTS technology /
- Sex determination locus /
- Genetic breeding /
- Marker-assisted selection