Pecan kinome: classification and expression analysis of all protein kinases in <i>Carya illinoinensis</i>

Kaikai Zhu; Pinghua Fan; Hui Liu; Juan Zhao; Pengpeng Tan; Zhenghai Mo; Fangren Peng; Kaikai Zhu; Pinghua Fan; Hui Liu; Juan Zhao; Pengpeng Tan; Zhenghai Mo; Fangren Peng

doi:10.48130/FR-2021-0014

2021 Volume 1

Article Contents

Next Previous

ARTICLE Open Access

Pecan kinome: classification and expression analysis of all protein kinases in Carya illinoinensis

1.
Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, Jiangsu 210037, China
2.
State Key Laboratory of Crop Genetics and Germplasm Enhancement, Ministry of Agriculture and Rural Affairs Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops in East China, College of Horticulture, Nanjing Agricultural University, Nanjing, Jiangsu 210095, China
3.
Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing 210014, China

More Information

Corresponding author: frpeng@njfu.edu.cn

Received: 17 May 2021
Accepted: 03 August 2021
Published online: 18 August 2021
Forestry Research 1, Article number: 14 (2021) | Cite this article

Abstract

Protein kinases (PKs) are involved in plant growth and stress responses, and constitute one of the largest superfamilies due to numerous gene duplications. However, limited PKs have been functionally described in pecan, an economically important nut tree. Here, the comprehensive identification, annotation and classification of the entire pecan kinome are reported. A total of 967 PK genes were identified from the pecan genome, and further classified into 20 different groups and 121 subfamilies using the kinase domain sequences, which were verified by phylogenetic analysis. The receptor-like kinase (RLK) group contained 565 members, which constituted the largest group. Gene duplication contributed to the expansion of pecan kinome, 169 segmental duplication events including 285 PK genes were found, and the Ka/Ks ratio revealed they experienced strong negative selection. The RNA-Seq data of PK genes in pecan were further analyzed at the subfamily level, and different PK subfamilies performed various expression patterns across pecan embryo development or drought treatment, suggesting PK genes in pecan are involved in embryo development and drought stress response. Taken together, this study provides insight into the classification, expansion, evolution, and expression of pecan PKs. Our findings regarding expansion, expression and co-expression analyses lay a good foundation for future research to understand the roles of pecan PKs, and more efficiently determine the key candidate genes.
- Carya illinoinensis,
- Classification,
- Co-expression networks,
- Expression patterns,
- Kinome

Supplementary information

Supplemental Table S1 Kinase domain annotation of 967 pecan PKs.
Supplemental Table S2 Family classification of pecan PKs with related information.
Supplemental Table S3 Domain organization of pecan PKs.
Supplemental Table S4 List of pecan protein kinases containing multiple kinase domains.
Supplemental Table S5 GO IDs of pecan PKs.
Supplemental Table S6 Segmental duplication events and Ka/Ks values of pecan protein kinases.
Supplemental Table S7 FPKM values of pecan PK genes during embryo development.
Supplemental Table S8 Genes in eight groups with different expression patterns during pecan embryo developmen.
Supplemental Table S9 Average FPKM expression values of pecan PK genes under drought stress.
Supplemental Table S10 Differentially expressed PK genes in six clusters with different expression patterns under drought stress.
Supplemental Fig. S1 Phylogenetic classification of pecan PKs. The phylogenetic tree was generated with amino sequences of the kinase domain with maximum-likelihood method. PK subfamilies were highlighted with different colors.
Supplemental Fig. S2 Phylogenetic classification of PKs in four different species. The phylogenetic tree was generated with amino sequences of the kinase domain from four different species (967 from pecan, 1006 from Arabidopsis, 1168 from grape and 758 from pineapple) with maximum-likelihood method.
Supplemental Fig. S3 Expression of PK genes during embryo development in pecan. Log₂ (FPKM+1) values were performed according to the red-white-blue color scale, and the heatmaps were generated using the R language with hierarchical clustering.
Supplemental Fig. S4 Different expression patterns of pecan PK genes during embryo development.
Supplemental Fig. S5 Expression data of PK subfamilies with drought treatment in pecan. Log₂ (FPKM+1) values were performed according to the red-white-blue color scale, and the heatmaps were generated using R language with hierarchical clustering.

Rights and permissions
Copyright: © 2021 by the author(s). Exclusive Licensee Maximum Academic Press, Fayetteville, GA. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.

References

[1]	Bennett J. 1991. Protein phosphorylation in green plant chloroplasts. Annual Review of Plant Physiology and Plant Molecular Biology 42:281−311 doi: 10.1146/annurev.pp.42.060191.001433 CrossRef Google Scholar
[2]	Stone JM, Walker JC. 1995. Plant protein kinase families and signal transduction. Plant Physiology 108:451−57 doi: 10.1104/pp.108.2.451 CrossRef Google Scholar
[3]	Champion A, Kreis M, Mockaitis K, Picaud A, Henry Y. 2004. Arabidopsis kinome: after the casting. Functional & Integrative Genomics 4:163−87 doi: 10.1007/s10142-003-0096-4 CrossRef Google Scholar
[4]	Manning G, Whyte DB, Martinez R, Hunter T, Sudarsanam S. 2002. The protein kinase complement of the human genome. Science 298:1912−34 doi: 10.1126/science.1075762 CrossRef Google Scholar
[5]	Hanks SK, Quinn AM, Hunter T. 1988. The protein kinase family: conserved features and deduced phylogeny of the catalytic domains. Science 241:42−52 doi: 10.1126/science.3291115 CrossRef Google Scholar
[6]	Lehti-Shiu MD, Shiu SH. 2012. Diversity, classification and function of the plant protein kinase superfamily. Philosophical Transactions of the Royal Society B - Biological Sciences 367:2619−39 doi: 10.1098/rstb.2012.0003 CrossRef Google Scholar
[7]	Liu J, Chen N, Grant JN, Cheng ZM, Stewart CN Jr, et al. 2015. Soybean kinome: functional classification and gene expression patterns. Journal of Experimental Botany 66:1919−34 doi: 10.1093/jxb/eru537 CrossRef Google Scholar
[8]	Zhu K, Wang X, Liu J, Tang J, Cheng Q, et al. 2018. The grapevine kinome: annotation, classification and expression patterns in developmental processes and stress responses. Horticulture Research 5:19 doi: 10.1038/s41438-018-0027-0 CrossRef Google Scholar
[9]	Hanada K, Zou C, Lehti-Shiu MD, Shinozaki K, Shiu SH. 2008. Importance of lineage-specific expansion of plant tandem duplicates in the adaptive response to environmental stimuli. Plant Physiology 148:993−1003 doi: 10.1104/pp.108.122457 CrossRef Google Scholar
[10]	Lehti-Shiu MD, Zou C, Hanada K, Shiu SH. 2009. Evolutionary history and stress regulation of plant receptor-like kinase/pelle genes. Plant Physiology 150:12−26 doi: 10.1104/pp.108.134353 CrossRef Google Scholar
[11]	Dardick C, Chen J, Richter T, Ouyang S, Ronald P. 2007. The rice kinase database. A phylogenomic database for the rice kinome. Plant Physiology 143:579−86 doi: 10.1104/pp.106.087270 CrossRef Google Scholar
[12]	Gish LA, Clark SE. 2011. The RLK/Pelle family of kinases. The Plant Journal 66:117−27 doi: 10.1111/j.1365-313X.2011.04518.x CrossRef Google Scholar
[13]	Zhu K, Fan P, Mo Z, Tan P, Feng G, et al. 2020. Identification, expression and co-expression analysis of R2R3-MYB family genes involved in graft union formation in pecan (Carya illinoinensis). Forests 11:917 doi: 10.3390/f11090917 CrossRef Google Scholar
[14]	Guo W, Chen J, Li J, Huang J, Wang Z, et al. 2020. Portal of Juglandaceae: A comprehensive platform for Juglandaceae study. Horticulture Research 7:35 doi: 10.1038/s41438-020-0256-x CrossRef Google Scholar
[15]	Huang Y, Xiao L, Zhang Z, Zhang R, Wang Z, et al. 2019. The genomes of pecan and Chinese hickory provide insights into Carya evolution and nut nutrition. Gigascience 8:giz036 doi: 10.1093/gigascience/giz036 CrossRef Google Scholar
[16]	Panchy N, Lehti-Shiu M, Shiu SH. 2016. Evolution of gene duplication in plants. Plant Physiology 171:2294−316 doi: 10.1104/pp.16.00523 CrossRef Google Scholar
[17]	Cannon SB, Mitra A, Baumgarten A, Young ND, May G. 2004. The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC Plant Biology 4:10 doi: 10.1186/1471-2229-4-10 CrossRef Google Scholar
[18]	Zhu K, Chen F, Liu J, Chen X, Hewezi T, et al. 2016. Evolution of an intron-poor cluster of the CIPK gene family and expression in response to drought stress in soybean. Scientific Reports 6:28225 doi: 10.1038/srep28225 CrossRef Google Scholar
[19]	Chen X, Ding Y, Yang Y, Song C, Wang B, et al. 2021. Protein kinases in plant responses to drought, salt, and cold stress. Journal of Integrative Plant Biology 63:53−78 doi: 10.1111/jipb.13061 CrossRef Google Scholar
[20]	Ferreira-Neto JRC, Borges AN da C, da Silva MD, Morais DA de L, Bezerra-Neto JP, et al. 2021. The cowpea kinome: genomic and transcriptomic analysis under biotic and abiotic stresses. Frontiers in Plant Science 12:667013 doi: 10.3389/fpls.2021.667013 CrossRef Google Scholar
[21]	Zhu J. 2016. Abiotic stress signaling and responses in plants. Cell 167:313−24 doi: 10.1016/j.cell.2016.08.029 CrossRef Google Scholar
[22]	Bundó M, Coca M. 2017. Calcium-dependent protein kinase OsCPK10 mediates both drought tolerance and blast disease resistance in rice plants. Journal of Experimental Botany 68:2963−75 doi: 10.1093/jxb/erx145 CrossRef Google Scholar
[23]	Andrási N, Rigó G, Zsigmond L, Pérez-Salamó I, Papdi C, et al. 2019. The mitogen-activated protein kinase 4-phosphorylated heat shock factor A4A regulates responses to combined salt and heat stresses. Journal of Experimental Botany 70:4903−18 doi: 10.1093/jxb/erz217 CrossRef Google Scholar
[24]	Wei K, Wang Y, Zhong X, Pan S. 2014. Protein kinase structure, expression and regulation in maize drought signaling. Molecular Breeding 34:583−602 doi: 10.1007/s11032-014-0059-6 CrossRef Google Scholar
[25]	Zhu K, Liu H, Chen X, Cheng Q, Cheng ZM. 2018. The kinome of pineapple: catalog and insights into functions in crassulacean acid metabolism plants. BMC Plant Biology 18:199 doi: 10.1186/s12870-018-1389-z CrossRef Google Scholar
[26]	Hindle MM, Martin SF, Noordally ZB, van Ooijen G, Barrios-Llerena ME, et al. 2014. The reduced kinome of Ostreococcus tauri: core eukaryotic signalling components in a tractable model species. BMC Genomics 15:640 doi: 10.1186/1471-2164-15-640 CrossRef Google Scholar
[27]	Zulawski M, Schulze G, Braginets R, Hartmann S, Schulze WX. 2014. The Arabidopsis Kinome: phylogeny and evolutionary insights into functional diversification. BMC Genomics 15:548 doi: 10.1186/1471-2164-15-548 CrossRef Google Scholar
[28]	Dievart A, Gottin C, Périn C, Ranwez V, Chantret N. 2020. Origin and diversity of plant receptor-like kinases. Annual Review of Plant Biology 71:131−56 doi: 10.1146/annurev-arplant-073019-025927 CrossRef Google Scholar
[29]	Maere S, De Bodt S, Raes J, Casneuf T, Van Montagu M, et al. 2005. Modeling gene and genome duplications in eukaryotes. PNAS 102:5454−59 doi: 10.1073/pnas.0501102102 CrossRef Google Scholar
[30]	Zhang Z, Li J, Zhao X, Wang J, Wong GKC, et al. 2006. KaKs_Calculator: calculating Ka and Ks through model selection and model averaging. Genomics Proteomics Bioinformatics 4:259−63 doi: 10.1016/S1672-0229(07)60007-2 CrossRef Google Scholar
[31]	Hou J, Wei S, Pan H, Zhuge Q, Yin T. 2019. Uneven selection pressure accelerating divergence of Populus and Salix. Horticulture Research 6:37 doi: 10.1038/s41438-019-0121-y CrossRef Google Scholar
[32]	Antolín-Llovera M, Ried MK, Binder A, Parniske M. 2012. Receptor kinase signaling pathways in plant-microbe interactions. Annual Review of Phytopathology 50:451−73 doi: 10.1146/annurev-phyto-081211-173002 CrossRef Google Scholar
[33]	Liang X, Zhou JM. 2018. Receptor-like cytoplasmic kinases: central players in plant receptor kinase–mediated signaling. Annual Review of Plant Biology 69:267−99 doi: 10.1146/annurev-arplant-042817-040540 CrossRef Google Scholar
[34]	Chandran AKN, Yoo YH, Cao P, Sharma R, Sharma M, et al. 2016. Updated Rice Kinase Database RKD 2.0: enabling transcriptome and functional analysis of rice kinase genes. Rice 9:40 doi: 10.1186/s12284-016-0106-5 CrossRef Google Scholar
[35]	Nodine MD, Yadegari R, Tax FE. 2007. RPK1 and TOAD2 are two receptor-like kinases redundantly required for Arabidopsis embryonic pattern formation. Developmental Cell 12:943−56 doi: 10.1016/j.devcel.2007.04.003 CrossRef Google Scholar
[36]	Li J. 2010. Multi-tasking of somatic embryogenesis receptor-like protein kinases. Current Opinion in Plant Biology 13:509−14 doi: 10.1016/j.pbi.2010.09.004 CrossRef Google Scholar
[37]	Wang R, Li L, Cao Z, Zhao Q, Li M, et al. 2012. Molecular cloning and functional characterization of a novel apple MdCIPK6L gene reveals its involvement in multiple abiotic stress tolerance in transgenic plants. Plant Molecular Biology 79:123−35 doi: 10.1007/s11103-012-9899-9 CrossRef Google Scholar
[38]	Meng D, Dong B, Niu L, Song Z, Wang L, et al. 2021. The pigeon pea CcCIPK14-CcCBL1 pair positively modulates drought tolerance by enhancing flavonoid biosynthesis. Plant Journal 106:1278−97 doi: 10.1111/tpj.15234 CrossRef Google Scholar
[39]	Lu L, Chen X, Wang P, Lu Y, Zhang J, et al. 2021. CIPK11: a calcineurin B-like protein-interacting protein kinase from Nitraria tangutorum, confers tolerance to salt and drought in Arabidopsis. BMC Plant Biology 21:123 doi: 10.1186/s12870-021-02878-x CrossRef Google Scholar
[40]	Fujii H, Verslues PE, Zhu JK. 2011. Arabidopsis decuple mutant reveals the importance of SnRK2 kinases in osmotic stress responses in vivo. PNAS 108:1717−22 doi: 10.1073/pnas.1018367108 CrossRef Google Scholar
[41]	Gao L, Xue H. 2012. Global analysis of expression profiles of rice receptor-like kinase genes. Molecular Plant 5:143−53 doi: 10.1093/mp/ssr062 CrossRef Google Scholar
[42]	Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, et al. 2016. The Pfam protein families database: towards a more sustainable future. Nucleic Acids Research 44:D279−D285 doi: 10.1093/nar/gkv1344 CrossRef Google Scholar
[43]	Eddy SR. 1998. Profile hidden Markov models. Bioinformatics 14:755−63 doi: 10.1093/bioinformatics/14.9.755 CrossRef Google Scholar
[44]	Letunic I, Khedkar S, Bork P. 2021. SMART: recent updates, new developments and status in 2020. Nucleic Acids Research 49:D458−D460 doi: 10.1093/nar/gkaa937 CrossRef Google Scholar
[45]	Katoh K, Rozewicki J, Yamada KD. 2019. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Briefings in Bioinformatics 20:1160−66 doi: 10.1093/bib/bbx108 CrossRef Google Scholar
[46]	Price MN, Dehal PS, Arkin AP. 2009. FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Molecular Biology and Evolution 26:1641−50 doi: 10.1093/molbev/msp077 CrossRef Google Scholar
[47]	Yu CS, Chen YC, Lu CH, Hwang JK. 2006. Prediction of protein subcellular localization. Proteins 64:643−51 doi: 10.1002/prot.21018 CrossRef Google Scholar
[48]	Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, et al. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research 25:3389−402 doi: 10.1093/nar/25.17.3389 CrossRef Google Scholar
[49]	Wang Y, Li J, Paterson AH. 2013. MCScanX-transposed: detecting transposed gene duplications based on multiple colinearity scans. Bioinformatics 29:1458−60 doi: 10.1093/bioinformatics/btt150 CrossRef Google Scholar
[50]	Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, et al. 2007. Clustal W and Clustal X version 2.0. Bioinformatics 23:2947−48 doi: 10.1093/bioinformatics/btm404 CrossRef Google Scholar
[51]	Chen C, Chen H, Zhang Y, Thomas HR, Frank MH, et al. 2020. TBtools: an integrative toolkit developed for interactive analyses of big biological data. Molecular Plant 13:1194−202 doi: 10.1016/j.molp.2020.06.009 CrossRef Google Scholar
[52]	Kim D, Langmead B, Salzberg SL. 2015. HISAT: a fast spliced aligner with low memory requirements. Nature Methods 12:357−60 doi: 10.1038/nmeth.3317 CrossRef Google Scholar
[53]	Pertea M, Pertea GM, Antonescu CM, Chang TC, Mendell JT, et al. 2015. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nature Biotechnology 33:290−95 doi: 10.1038/nbt.3122 CrossRef Google Scholar
[54]	Li B, Dewey CN. 2011. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics 12:323 doi: 10.1186/1471-2105-12-323 CrossRef Google Scholar
[55]	Love MI, Huber W, Anders S. 2014. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology 15:550 doi: 10.1186/s13059-014-0550-8 CrossRef Google Scholar
[56]	Ernst J, Bar-Joseph Z. 2006. STEM: a tool for the analysis of short time series gene expression data. BMC Bioinformatics 7:191 doi: 10.1186/1471-2105-7-191 CrossRef Google Scholar
[57]	Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, et al. 2003. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Research 13:2498−504 doi: 10.1101/gr.1239303 CrossRef Google Scholar

About this article

Cite this article

Zhu K, Fan P, Liu H, Zhao J, Tan P, et al. 2021. Pecan kinome: classification and expression analysis of all protein kinases in Carya illinoinensis. Forestry Research 1: 14 doi: 10.48130/FR-2021-0014

Zhu K, Fan P, Liu H, Zhao J, Tan P, et al. 2021. Pecan kinome: classification and expression analysis of all protein kinases in Carya illinoinensis. Forestry Research 1: 14

doi: 10.48130/FR-2021-0014

Figures(7)

Download PDF

Article Metrics

Article views(9342) PDF downloads(1495)

Other Articles By Authors

on this site
on Google Scholar

HTML

INTRODUCTION

Reversible phosphorylation is a common type of post-translational modification, which is catalyzed by protein kinases (PKs), widely existing in living organisms^[1]. PKs regulate the activity of downstream target proteins via transferring the phosphates to phosphorylate specific amino acids including serine, threonine or tyrosine as molecular switches^[2]. PKs constitute a super gene family with a large number of members in plants, and the entire PKs in a genome are defined as the kinome. More than 1000 PK genes were found in Arabidopsis, representing about 4% of the genome^[3]. However, only 518 putative PKs were identified in the human genome, which constitutes 1.7% of entire human genes^[4].

In general, PKs have a catalytic domain ranging from 250 to 300 amino acid residues. This superfamily was first classified into various subfamilies based on the phylogenetic analysis of the catalytic domain sequences^[5]. In recent years, hundreds of plant genome sequences have been released, providing an excellent opportunity in the understanding of the evolutionary history of plant PKs. Kinomes from 25 plant species were identified and further classified into nine major groups with 115 families, and the PKs experienced huge expansion in flowering plants^[6]. In soybean, 2,166 putative PKs were found, and divided into 19 groups and 122 subfamilies^[7]. In the grapevine kinome, 1,168 PK genes were classified into 20 main groups and 121 subfamilies, the RLK-Pelle was the largest group with 872 PKs^[8]. The huge expansion of kinome in flowering plants is due to gene duplication and a good retention rate of duplicates in some groups, especially the RLK-Pelle group^[9]. Only four Interleukin Receptor-Associated Kinase (IRAK) genes have been found in the human genome, which perform a close relationship with plant RLK-Pelle group^[10].

Functional characterization studies of plant PK genes have mainly occurred in model plants such as Arabidopsis and rice, and PKs have been proven to play key roles involved in various biological processes^{[3, 11, 12]}. However, few PK genes have been functionally analyzed in non-model plants, especially in perennial woody plants.

The pecan tree [Carya illinoinensis (Wangenh.) K. Koch] is a well known commercially cultivated nut tree worldwide, which is native to North America and Mexico^[13]. Pecan is a member of the Juglandaceae family in the genus Carya, and the delicious nuts are a good source of unsaturated fatty acids, flavonoids and protein for human benefit seeing an increase in consumption in recent years^[14]. In 2018, the United States of America, produced over 130,000 tons of pecan nuts, with a total production value approaching $600 million (https://www.nass.usda.gov). Recently, the release of the pecan genome and transcriptome data has allowed characterization of the pecan kinome, duplication events, and their expression patterns under different conditions^[15]. In the current study, 967 pecan PKs were identified and further classified into different groups and subfamilies. Conserved domain sequence features and phylogenetic relationships of different subfamily members were also evaluated. Subsequently, the expression patterns and co-expression networks of various subfamilies were analyzed to more efficiently determine the key members. Collectively, the comprehensive annotation of pecan PK genes and expression files helps us to understand the potential roles of pecan protein kinases.

DISCUSSION

Reversible phosphorylation, performed by PKs, is one of the most crucial post-translational modifications, and involved in multiple cellular processes^{[19, 20]}. Although functional analysis of some PKs has been discovered in model plants including Arabidopsis and rice^[21−23], few PKs have been well understood in woody plants due to limited genome information. The recent release of the Carya illinoinensis genome sequence, an economically important nut tree cultivated worldwide, provides the chance to characterize and understand the regulatory networks of the pecan kinome. In the present research, 967 putative pecan PKs were identified using bioinformatic methods (Supplemental Fig. S1), which accounted for 3.11% (967/31,075) of protein-coding genes in the pecan genome^[15]. This proportion of PKs in pecan was lower than that in soybean (4.7%), rice (4.1%), maize (3.8%), and Arabidopsis (3.4%), while higher than that of pineapple (2.8%)^{[6, 7, 24, 25]}. The classification of PKs from 25 plant species showed that gene numbers ranged from 326 to 2535, and the kinome size was significantly larger in the flowering plants, while two algae species including Chlamydomonas reinhardtii and Volvox carteri had 503 and 326 PKs, respectively^[6]. Ostreococcus tauri, a unicellular species of green alga, only possessed 133 PKs in its genome, amounting to 1.7%^[26].

Plant kinomes were commonly categorized into different groups and families based on the sequence difference of the kinase domain. The pecan kinome was divided into 20 different groups, and the RLK group was found to be the largest, containing more than half of the members (565) in the pecan kinome (Fig. 1), a similar phenomenon was also found in other flowering plants including Arabidopsis, grapevine, and rice (Supplemental Fig. S2)^{[6, 8, 27]}. Interestingly, Chlamydomonas reinhardtii and Volvox carteri contained only two and three members in the RLK group, respectively. The large numbers of PK genes in flowering plants can be mainly attributed to the dramatic expansion of a few PK groups, especially the RLK group^[28]. The number of subfamilies in the pecan kinome (121) was larger than that in pineapple (116), and similar to the soybean kinome (122)^{[7, 25]}.

Duplication contributes to the evolution of novel gene functions including stress adaptation, disease resistance, and also makes major contributions to the large size of the RLK group in higher plants^[16]. Over 90% of the increase in regulatory genes was caused by gene duplication in the Arabidopsis lineage^[29]. In the pecan kinome, 29.47% (285/967) of the PK genes with 169 gene pairs were generated from segmental duplication, 145 of them were RLK genes and separated into 34 subfamilies (Supplemental Table S6), 10,530 paralogous pairs were found in the pecan genome^[15]. Different families in the RLK group showed various expansion patterns, large families such as LRR and RLCK make important contributions to the expansion of the large size of the RLK group. Sixty-five and 49 PKs in LRR and RLCK families were generated from gene duplication, respectively, which is consistent with the previous results found in soybean^[7]. The distribution of Ks values can be used to estimate the evolutionary date, more than 70% of duplicated genes in the pecan kinome occurred more recently (Fig. 4a). The ratio of Ka/Ks was commonly used to detect the history of selection pressure on coding sequences of duplicated genes^[30]. In this study, Ka/Ks values of the 169 duplication events in the pecan kinome were less than 0.05, strong negative selection drove the evolution of the PKs in pecan (Fig. 4b). In a previous study, negative selection was also found to be the primary influence on PK genes in pineapple, negative selection indicated the process of removing deleterious mutations^[31].

PKs were generally related to the transmission of extracellular signals to the nucleus by activating or repressing target proteins, and subcellular localization information of PKs might help to explain protein's function^[32]. We predicted the subcellular localization data of PKs in different groups, and about half of the RLK group members were located in the plasma membrane (Fig. 2), however, only 7% of PKs in RLCK families were membrane-located due to the absence of extracellular ligand-binding domains^[33]. PKs in the non-RLK clade showed different subcellular localization features, such as most AGC group members were nucleus-located and more than 70% of CAMK group members were localized in the cytoplasm, similar results were also found in the pineapple kinome^[25].

Plant PKs, especially calcium-dependent protein kinases (CDPKs), mitogen-activated protein kinase (MAPK) cascades, sucrose non-fermenting1-related protein kinases (SnRKs), and RLKs have been well investigated and functionally analyzed in some model plants and crops^{[18, 19, 33]}. To find the key genes more efficiently from the rice kinome with thousands of members, the rice kinase database (RKD) with PK genes in various tissues, under abiotic and biotic stresses was built^[34]. However, limited expression information of PK genes is available for pecan. Expression levels might provide evidence of gene function, then RNA-Seq data of pecan PK genes were analyzed to obtain the central candidates during embryo development or response to drought stress. The expression patterns of pecan PK subfamilies during embryo development revealed many RLK subfamilies were down-regulated, especially some LRR subfamilies (Fig. 5), and this family has been found to play a role in embryo formation^{[35, 36]}.

Drought stress could seriously impact food and energy security, and PKs have key functions in response to abiotic stresses including drought^[21]. The expression data of PK subfamilies in pecan were analyzed under drought stress, while half of the RLK subfamilies performed low expression levels (Supplemental Fig. S5), these subfamilies also showed low expression in soybean and grapevine in response to drought^{[7, 8]}. Furthermore, the differentially expressed genes in the pecan kinome were selected and divided into six clusters based on their different expression patterns, Cluster 5 contained 159 PK genes which were increased under drought stress, including three subfamilies such as CAMK_CAMKL-CHK1, CAMK_CDPK, and CAMK_OST1L in the CAMK group (Fig. 7). The CAMK_CAMKL-CHK1 subfamily, known as CBL-interacting protein kinase (CIPK), was involved in the drought stress response^[21]. MdCIPK6L was up-regulated under drought stress, and the overexpression plants remarkably enhanced the tolerance to drought stress^[37]. CcCBL1-CcCIPK14 module positively regulated drought tolerance via enhancing flavonoid biosynthesis in pigeon pea^[38]. NtCIPK11 was up-regulated significantly in Nitraria tangutorum after mannitol treatment, and overexpression lines in Arabidopsis improved both drought and salt tolerance^[39]. CDPK and CAMK_OST1L (named as SnRK2) genes have been proved to function in plant drought stress response^{[19, 40]}. Among the 159 members in Cluster 5, 95 were RLK group genes and distributed in 28 subfamilies, which accounted for 59.75% (Supplemental Table S10). The receptor-like kinases activate the downstream signaling pathway via perceiving the extracellular signals and phosphorylating the targets, and drought stress caused the most notable effect on rice RLKs^[41]. Intriguingly, nearly one-third of the genes in the largest subfamily, RLK-Pelle_DLSV, were found in Cluster 5, indicating this subfamily may play a key role in response to drought stress.

CONCLUSIONS

Plant protein kinases are important regulators of a variety of cellular processes including plant development and stress responses. In this study, a total of 967 PKs were annotated in the pecan genome, and divided into 121 subfamilies with 20 groups. Gene duplication functioned in the expansion of the pecan kinome, and the segmentally duplicated events suffered strong negative selection based on the Ka/Ks ratios. Moreover, different PK subfamilies in the pecan kinome performed dynamic transcript abundance during embryo development. In addition, pecan PK genes presented various expression patterns in response to drought, and most of them were differentially expressed. This research provides valuable information concerning pecan PKs, and lays a good foundation for further functional investigation of these genes during embryo development and drought stress responses.

{{lists.name}}

Pecan kinome: classification and expression analysis of all protein kinases in Carya illinoinensis