Gene-based Breeding (GBB), a novel discipline of biological science and technology for plant and animal breeding

Hong-Bin Zhang; Hong-Bin Zhang

doi:10.48130/tp-0024-0005

Gene-based breeding (GBB) is an innovative technology and science for plant and animal breeding. Studies have shown that GBB is extremely powerful, predictable, accelerated, and cost-efficient for both pure-line and hybrid variety breeding. Moreover, the concepts, principles, techniques, and methodologies developed and used for GBB are also applicable to molecular precision agriculture, such as gene-based agriculture, and molecular precision medicine in humans as well as in animals, such as gene-based health, gene-based clinics, and gene-based medicine. Therefore, research, development, and applications of GBB for plant and animal breeding are promising to promote substantial crop and livestock genetic improvement, enhanced agriculture production, and improvement and transition of current phenotypic medicine to genotypic medicine in humans and animals.

HTML

Introduction

Food production and security has been one of the most important issues worldwide, especially as the world's population has rapidly increased^[1] and the global climate has changed^[2] in the past decades. Global climate changes, including, but not limited to, elevated temperature, increased erratic rainfall, declined water table, and raised drought incidence, are threatening the world's food production and security. Therefore, it has been a major research subject in agriculture on how to feed the world. It has been the consensus that development and application of genetically improved crop varieties and livestock strains is the most efficient, economical, and environmentally friendly approach to continuously increase or sustain the world's food production and security. To improve the ability, efficiency, and productivity and/or accelerate the process of developing improved crop varieties and livestock strains, several molecular methods have been developed^[3], including marker-assisted selection^[4], genetic engineering^[5], RNA interference or RNAi^[6], genomic selection^[7,8], gene or genome editing^[9], and gene-based breeding^[10−16]. As most agronomic traits important to food production, such as yield, quality, and biotic and abiotic resistances or tolerances, are complex traits each controlled by numerous genes while marker-assisted selection, genetic engineering, RNAi, and gene editing can often manipulate at a time only a single or few genes controlling an agronomical trait, they are more suitable to improve elite varieties with only a single or few undesirable traits influenced by the unfavorable alleles of one to a few genes^[3]. Therefore, only genomic selection and gene-based breeding have been the method of choice that can assist at development of or develop brand new varieties with complete intellectual properties^[3]. Given that genomic selection has been demonstrated to be efficient only for genome-wide assisted progeny selection and it is impossible for genomic selection to help develop a superior new variety from a progeny pool without superior individuals, gene-based breeding has become the method of choice to develop brand new varieties with complete intellectual properties through the entire breeding process. In this review article, gene-based breeding, therefore, is introduced, including what is gene-based breeding; concept, research, development, and potential; advanced research and development; and perspectives. Research, development, and application of gene-based breeding for improved crop varieties and livestock strains is promising to improve current best varieties substantially and continuously, thus helping feed the world.

Concept, research, development, and potential

Advanced research and development

As GBB is a recently developed novel technology for plant and animal breeding, additional research and development are needed to obtain a robust GBB system for improved pure-line or hybrid variety development in a crop or livestock species. Although the AI technology has been already incorporated into GBB, improvement of the AI is necessary to further empower GBB for variety development. Research and development of GBB in cotton and maize have shown that development of a robust GBB system for a crop or livestock species could be accomplished within 5–10 years, from beginning to a robust GBB system. For instance, it took only five years for Liu et al.^[10,12] & Zhang et al.^[11,13,14] to genome-wide clone and functionally characterize most, if not all of, the genes controlling fiber and oilseed yields and fiber yield and quality component traits in cotton and the genes controlling grain yield and grain yield and quality component traits in maize, respectively, using a novel technology, designated artificial intelligence (AI)-gExpress, and approximately ${\$} $1.0 million for each of the crop species. AI-gExpress technology has been demonstrated to be applicable to genome-wide high-throughput cloning of genes controlling complex traits in any species, including plants, animals, humans, and microbes, regardless of genome size, genome complexity, ploidy level, and availability of genomic and molecular resources and knowledge. To develop a robust GBB system for a crop or livestock species that contains most, if not all, of genes controlling an agronomical trait and the genes controlling most, if not all, of important agronomic traits, a population consisting of elite varieties, advanced breeding lines, and germplasm lines that represents the genetic diversity and variation of agronomical traits in the species, such as the genome-wide association study (GWAS) panels used for genome-wide association mapping, should be used for genome-wide high-throughput gene cloning and validation of their utility and efficiency for GBB. The genes controlling an agronomical trait include both those applicable to GBB for different breeding programs for a variety of environments or eco-agricultural production systems and those applicable to GBB for a single or few breeding programs for limited environments or eco-agricultural production systems. Therefore, GBB that is universal across environments and across diverse populations and specific to a single or few environments and limited populations can be developed for pure-line or hybrid variety breeding. To identify the key genes controlling an agronomical trait to simplify GBB progeny selection and reduce its cost, Liu et al.^[12] & Zhang et al.^[13] examined several methods using fiber length in cotton and grain yield in maize, respectively, that were based on (1) the biological impacts of gene expression variation on objective traits; (2) the biological impacts of gene SNP/InDel mutations on objective traits; (3) the biological impacts of gene co-expression network gene node variation on objective traits; and (4) the biological impacts of gene co-expression network gene interaction edge variation on objective traits. The first method was shown to be a method of choice for both traits in the two crop species while the second to fourth methods were specific to cotton fiber length or maize grain yield. However, it remains to test the hub genes in the regulatory network of the genes controlling an agronomical trait for identification of key genes for GBB. Finally, the expressions of transcripts of the genes responsible for an agronomical trait have been demonstrated to be efficient for GBB^[12−15]. Zhang et al.^[26] showed that gene transcript expression in a tissue at a developmental stage was highly reproducible (r = 0.90–0.98, p < 0.0001) across biological replicates and across years, indicating that gene transcript expression is well suitable for GBB. However, a procedure that allows high- or moderate-throughput, simple, rapid, and economical RNA isolation is needed for GBB with gene transcript expressions. Utility and efficiency of transcript expressions of the genes controlling an agronomical trait in different tissues at different stages are to be examined for GBB.

Perspectives

GBB can be a revolutionary technology for breeding in all field crops, vegetable crops, fruit trees, and livestock for either pure-line varieties (or strains) or hybrid varieties, but only a preliminary GBB system has been established to date in maize and cotton. Additional research is necessary to develop the GBB in maize and cotton into robust GBB systems that are suited for enhanced breeding across environments and across populations in different breeding programs. Importantly, research and development of GBB in crops and livestock can also promote research, development, and application of molecular precision agriculture such as gene-based agriculture for enhanced food production, and transition of human medicine from current phenotypic medicine to genotypic medicine for improved medicine, such as gene-based health, gene-based clinics, and gene-based medicine. The invention of the AI-gExpress technology for genome-wide high-throughput cloning of genes controlling biological traits or processes, genome-wide high-throughput coning of the genes controlling cotton fiber length^[10,12] and the genes controlling maize grain yield^{[11, 13,14]}, and the research in utility and efficiency of the cloned genes for GBB^[12−16] have, for the first time, made it feasible and possible to genome-wide high-throughput clone most, if not all, of the genes controlling biological traits or processes in a species and have, for the first time, revealed that the actual number of genes controlling a biological trait or process much larger than that expected and that identified by QTL mapping and genome-wide association studies. This finding indicated that a quantitative or complex trait is much more complicated than we expected. This knowledge is crucial to the development of new methods and/or improvement of current methods for plant and animal breeding and human medicine for which the number of genes controlling the objective trait and their interactions should be considered. The three genic datasets of the genes controlling breeding objective traits used for GBB, including their NFAs, their SNPs/InDels significantly influencing the performance of objective traits, and the expressions and networks of their transcripts responsible for the phenotype and performance of the objective traits, are essential to research, development, and application of molecular precision agriculture. For instance, agricultural production practices, such as fertilization, irrigation, disease, pest and weed control, and drought and heat/cold stress mitigation, could be carried out based on and/or by regulating the activities and/or networks of genes controlling agronomical traits important to agricultural production. For human medicine, the SNPs/InDels that significantly influence human health or lead to human genetic diseases and/or the expressions and networks of gene transcripts responsible for human diseases are useful for human health, clinics, and medicine. The methodologies developed for GBB, such as the method for high- or moderate-throughput genotyping of individuals with genes controlling biological traits or processes, and the high- or moderate-throughput, high quality, rapid, and economical method for RNA extraction, can be used for both molecular precision agriculture and human precision medicine, including health, clinics, and medicine.

Author contributions

The author confirms sole responsibility for the following: study conception and design, data collection, analysis and interpretation of results, and manuscript preparation.

[1]	Godfray HCJ, Beddington JR, Crute IR, Haddad L, Lawrence D, et al. 2010. Food security: the challenge of feeding 9 billion people. Science 327:812−18 doi: 10.1126/science.1185383 CrossRef Google Scholar
[2]	Adger N, Aggarwal P, Agrawala S, Alcamo J, Allali A, et al. 2007. Technical Summary. In Climate Change 2007 Impacts, Adaptation and Vulnerability. Contribution of Working Group II to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change. Cambridge University Press, New York. pp. 25−77.
[3]	Zhang MP, Liu YH, Zhang HB. 2021. Molecular breeding for improving yield in maize: recent advances and future perspectives. In Molecular Breeding in Wheat, Maize and Sorghum: Strategies for Improving Abiotic Stress Tolerance and Yield, eds. Hossain MA, Alam M, Seneweera S, Rakshit S, Henry R. Wallingford: CAB International. pp. 380−404. https://doi.org/10.1079/9781789245431.0022
[4]	Collard BCY, Mackill DJ. 2008. Marker-assisted selection: an approach for precision plant breeding in the twenty-first century. Philosophical Transactions of the Royal Society B: Biological Sciences 363:557−72 doi: 10.1098/rstb.2007.2170 CrossRef Google Scholar
[5]	Datta A. 2013. Genetic engineering for improving quality and productivity of crops. Agriculture & Food Security 2:15 doi: 10.1186/2048-7010-2-15 CrossRef Google Scholar
[6]	Yogindran S, Rajam MV. 2015. RNAi for crop improvement. In Plant Biology and Biotechnology, eds. Bahadur B, Venkat M, Rajam M, Sahijram L, Krishnamurth K. New Delhi: Springer. pp. 623–37. https://doi.org/10.1007/978-81-322-2283-5_31
[7]	Meuwissen THE, Hayes BJ, Goddard ME. 2001. Prediction of total genetic value using genome-wide dense marker maps. Genetics 157:1819−29 doi: 10.1093/genetics/157.4.1819 CrossRef Google Scholar
[8]	Desta ZA, Ortiz R. 2014. Genomic selection: genome-wide prediction in plant improvement. Trends in Plant Science 19:592−601 doi: 10.1016/j.tplants.2014.05.006 CrossRef Google Scholar
[9]	Gao C. 2021. Genome engineering for crop improvement and future agriculture. Cell 184:1621−35 doi: 10.1016/j.cell.2021.01.005 CrossRef Google Scholar
[10]	Liu YH, Zhang MP, Zhang Y, Smith CW, Hague S, et al. 2014. Large-scale cloning and characterization of genes controlling fiber length for deciphering of the molecular basis of fiber quality and development of a gene-based breeding system in cotton. International Plant & Animal Genome Conference XXII, San Diego, California, January 11−15, 2014. Presentation no. 474. Scherago International.
[11]	Zhang MP, Zhi H, Chang F, Zhang Y, Liu YH, et al. 2014. Large-scale cloning and characterization of genes controlling grain yield for deciphering of the molecular basis of grain yield and development of a gene-based breeding system in maize. International Plant & Animal Genome Conference XXII, San Diego, California, January 11−15, 2014. Presentation no. 875. Scherago International.
[12]	Liu YH, Xu Y, Zhang MP, Cui Y, Sze SH, et al. 2020. Accurate prediction of a quantitative trait using the genes controlling the trait for gene-based breeding in cotton. Frontiers in Plant Science 11:583277 doi: 10.3389/fpls.2020.583277 CrossRef Google Scholar
[13]	Zhang M, Cui Y, Liu YH, Xu W, Sze SH, et al. 2020. Accurate prediction of maize grain yield using its contributing genes for gene-based breeding. Genomics 112:225−36 doi: 10.1016/j.ygeno.2019.02.001 CrossRef Google Scholar
[14]	Zhang M, Liu YH, Wang Y, Sze SH, Scheuring CF, et al. 2022. Genome-wide identification of genes enabling accurate prediction of hybrid performance from parents across environments and populations for gene-based breeding in maize. Plant Science 324:111424 doi: 10.1016/j.plantsci.2022.111424 CrossRef Google Scholar
[15]	Liu YH, Zhang M, Scheuring CF, Cilkiz M, Sze SH, et al. 2022. Accurate prediction of complex traits for individuals and offspring from parents using a simple, rapid, and efficient method for gene-based breeding in cotton and maize. Plant Science 316:111153 doi: 10.1016/j.plantsci.2021.111153 CrossRef Google Scholar
[16]	Liu YH, Zhang M, Sze SH, Smith CW, Zhang HB. 2022. Analysis of the genes controlling cotton fiber length reveals the molecular basis of plant breeding and the genetic potential of current cultivars for continued improvement. Plant Science 321:111318 doi: 10.1016/j.plantsci.2022.111318 CrossRef Google Scholar
[17]	Saint Pierre C, Burgueño J, Crossa J, Fuentes Dávila G, Figueroa López P, et al. 2016. Genomic prediction models for grain yield of spring bread wheat in diverse agro-ecological zones. Scientific Reports 6:27312 doi: 10.1038/srep27312 CrossRef Google Scholar
[18]	Weissbrod O, Geiger D, Rosset S. 2016. Multikernel linear mixed models for complex phenotype prediction. Genome Research 26:969−79 doi: 10.1101/gr.201996.115 CrossRef Google Scholar
[19]	Zenke-Philippi C, Thiemann A, Seifert F, Schrag T, Melchinger AE, et al. 2016. Prediction of hybrid performance in maize with a ridge regression model employed to DNA markers and mRNA transcription profiles. BMC Genomics 7:262 doi: 10.1186/s12864-016-2580-y CrossRef Google Scholar
[20]	Duhnen A, Gras A, Teyssèdre S, Romestant M, Claustres B, et al. 2017. Genomic selection for yield and seed protein content in soybean: a study of breeding program data and assessment of prediction accuracy. Crop Science 57:1325−37 doi: 10.2135/cropsci2016.06.0496 CrossRef Google Scholar
[21]	Gapare W, Liu S, Conaty W, Zhu QH, Gillespie V. 2018. Historical datasets support genomic selection models for the prediction of cotton fiber quality phenotypes across multiple environments. G3 8:1721−32 doi: 10.1534/g3.118.200140 CrossRef Google Scholar
[22]	Alves FC, Granato ÍSC, Galli G, Lyra DH, Fritsche-Neto R, et al. 2019. Bayesian analysis and prediction of hybrid performance. Plant Methods 15:14 doi: 10.1186/s13007-019-0388-x CrossRef Google Scholar
[23]	Xu S, Xu Y, Gong L, Zhang Q. 2016. Metabolomic prediction of yield in hybrid rice. The Plant Journal 88:219−27 doi: 10.1111/tpj.13242 CrossRef Google Scholar
[24]	Dan Z, Hu J, Zhou W, Yao G, Zhu R, et al. 2016. Metabolic prediction of important agronomic traits in hybrid rice (Oryza sativa L.). Scientific Reports 6:21732 doi: 10.1038/srep21732 CrossRef Google Scholar
[25]	Speed D, Balding DJ. 2014. MultiBLUP: improved SNP-based prediction for complex traits. Genome Research 24:1550−57 doi: 10.1101/gr.169375.113 CrossRef Google Scholar
[26]	Zhang M, Liu YH, Chang CS, Zhi H, Wang S, et al. 2019. Quantification of gene expression while taking into account RNA alternative splicing. Genomics 111:1517−28 doi: 10.1016/j.ygeno.2018.10.009 CrossRef Google Scholar
[27]	Zhang MP, Liu YH, Xu W, Smith CW, Murray SC, et al. 2020. Analysis of the genes controlling three quantitative traits in three diverse plant species reveals the molecular basis of quantitative traits. Scientific Reports 10:10074 doi: 10.1038/s41598-020-66271-8 CrossRef Google Scholar

Consistency of GBB with PS	ZmINGY genic datasets
Consistency of GBB with PS	I	II	III	I + II	I + III	II + III	I + II + III
Field trials, Halfway, Texas, 2010	40.0%	50.0%	66.7%	100.0%	66.7%	100.0%	100.0%
Field Trials, College Station, Texas, 2010	41.2%	33.3%	55.6%	80.0%	100.0%	100.0%	100.0%
I. Number of favorable alleles (NFAs) of 27 SNP/InDel-containing ZmINGY genes; II. SNPs/InDels of the 27 SNP/InDel-containing ZmINGY genes; III. The transcript expressions of the 150 key ZmINGY genes. Note that when the grain yields of the plants predicted with two or all three genic datasets of the ZmINGY genes were jointly used for progeny selection, the top 10% plants selected with the highest grain yields predicted with the genes were consistent up to 100% with those selected with the highest grain yields determined by replicated field trials. Halfway, Texas and College Station, Texas represent two different agricultural ecosystems and climate zones in the USA.

Statistical analysis (ANOVA and LSD)	Effect	Genotype of a gene
Statistical analysis (ANOVA and LSD)	Effect	AA	Aa	aa
AA > Aa > aa	Additive	2	1	0
Aa = AA > aa	complete dominant	2	2	0
Aa > AA > aa	Over-dominant	2	3	0
Allele 'A' is the favorable allele over allele 'a', when 'AA' is larger than 'aa' (p ≤ 0.05). The total NFAs of all genes controlling an agronomical trait is calculated with the following formula: y = ∑x_i, where y is the total NFAs of the genes controlling an agronomical trait, x is the NFAs of individual genes controlling the trait, which is 0, 1, 2, or 3, in individual 'i'.

{{lists.name}}

Gene-based Breeding (GBB), a novel discipline of biological science and technology for plant and animal breeding

Abstract

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors