Study on Multi-environment Genome-wide Prediction of Inbred Agronomic Traits in Maize Natural Populations

doi:10.11983/CBB24087

Abstract

Abstract: Multi-environment field testing is an important way to select optimize maize yield and yield stability varieties. However, because of its high cost, it has gradually become a challenge in plant breeding. The combination of field sparse testing and genome-wide prediction method can be used to predict untested phenotypes, reduced the effort and cost on field testing. In this experiment, 244 inbred lines of natural populations were planted in Shunyi, Beijing and Mishan, Heilongjiang in 2022 and 2023. Six agronomic traits were studied, including days to anthesis, plant height, ear height, ear length, kernel number per row and ear row number. The effects of four different models (Single, Across, M×E and R-norm), two different cross-validation schemes (CV1 and CV2) and three different training sets sampling ratios (0.5, 0.7 and 0.9) on the prediction accuracy were compared. The results showed that the average prediction accuracy of the six agronomic traits was 0.67, 0.58, 0.50, 0.33, 0.33 and 0.48. The average prediction accuracy of the Single model, Across model, M×E model and R-norm model was 0.36, 0.52, 0.53 and 0.53 for each trait. In CV1, the average prediction accuracy of each model in six traits ranged from 0.19 to 0.65, and in CV2, the average prediction accuracy ranged from 0.47 to 0.89. The comparison of different training set sampling ratios shows that the improvement of the proportion of the training sets has limited improvement in the prediction accuracy of different traits in different models, and the maximum is only 0.05. The results show that the CV2 training set can be used to form a scheme and include phenotypic data from multiple environments in the prediction model to provide good prediction accuracy for multi-environment prediction.

Key words: maize, genome-wide prediction, multi-environment prediction, optimal training sets

Yuan Li, Kaijian Fan, Tai An, Cong Li, Junxia Jiang, Hao Niu, Weiwei Zeng, Yanfang Heng, Hu Li, Junjie Fu, Huihui Li, Liang Li. Study on Multi-environment Genome-wide Prediction of Inbred Agronomic Traits in Maize Natural Populations[J]. Chinese Bulletin of Botany, 2024, 59(6): 1041-1053.

Add to citation manager EndNote|Ris|BibTeX

URL: https://www.chinbullbotany.com/EN/10.11983/CBB24087

https://www.chinbullbotany.com/EN/Y2024/V59/I6/1041

Figures/Tables 8

References 24

[1]	Alemu A, Åstrand J, Montesinos-López OA, Isidro Y, Sánchez J, Fernández-Gónzalez J, Tadesse W, Vetukuri RR, Carlsson AS, Ceplitis A, Crossa J, Ortiz R, Chawade A (2024). Genomic selection in plant breeding: key factors shaping two decades of progress. Mol Plant 17, 552-578. DOI PMID
[2]	Burgueño J, De Los Campos G, Weigel K, Crossa J (2012). Genomic prediction of breeding values when modeling genotype × environment interaction using pedigree and dense molecular markers. Crop Sci 52, 707-719.
[3]	Carena MJ, Hallauer AR, Filho JBM (2010). Quantitative Genetics in Maize Breeding. New York: Springer Science & Business Media. pp. 1-6.
[4]	Crossa J, De Los Campos G, Pérez P, Gianola D, Burgueño J, Araus JL, Makumbi D, Singh RP, Dreisigacker S, Yan JB, Arief V, Banziger M, Braun HJ (2010). Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers. Genetics 186, 713-724. DOI PMID
[5]	Cuevas J, Crossa J, Soberanis V, Pérez-Elizalde S, Pérez- Rodríguez P, De Los Campos G, Montesinos-López OA, Burgueño J (2016). Genomic prediction of genotype × environment interaction kernel regression models. Plant Genome 9, plantgenome2016. 03.0024.
[6]	Ferrão LFV, Ferrão RG, Ferrão MAG, Francisco A, Garcia AAF (2017). A mixed model to multiple harvest-location trials applied to genomic prediction in Coffea canephora. Tree Genet Genomes 13, 95.
[7]	Fu JJ, Hao YF, Li HH, Reif JC, Chen SJ, Huang CL, Wang GY, Li XH, Xu YB, Li L (2022). Integration of genomic selection with doubled-haploid evaluation in hybrid breeding: from GS 1.0 to GS 4.0 and beyond. Mol Plant 15, 577-580.
[8]	Hayes BJ, Bowman PJ, Chamberlain AJ, Goddard ME (2009). Invited review: genomic selection in dairy cattle: progress and challenges. J Dairy Sci 92, 433-443. DOI PMID
[9]	Jannink JL, Lorenz AJ, Iwata H (2010). Genomic selection in plant breeding: from theory to practice. Brief Funct Genomics 9, 166-177.
[10]	Jarquín D, Crossa J, Lacaze X, Du Cheyron P, Daucourt J, Lorgeou J, Piraux F, Guerreiro L, Pérez P, Calus M, Burgueño J, De Los Campos G (2014). A reaction norm model for genomic selection using high-dimensional genomic and environmental data. Theor Appl Genet 127, 595-607. DOI PMID
[11]	Jarquin D, Howard R, Crossa J, Beyene Y, Gowda M, Martini JWR, Pazaran GC, Burgueño J, Pacheco A, Grondona M, Wimmer V, Prasanna BM (2020). Genomic prediction enhanced sparse testing for multi-environment trials. G3 (Bethesda) 10, 2725-2739.
[12]	Lopez-Cruz M, Crossa J, Bonnett D, Dreisigacker S, Poland J, Jannink JL, Singh RP, Autrique E, De Los Campos G (2015). Increased prediction accuracy in wheat breeding trials using a marker × environment interaction genomic selection model. G3 (Bethesda) 5, 569-582.
[13]	Lorenz AJ (2013). Resource allocation for maximizing prediction accuracy and genetic gain of genomic selection in plant breeding: a simulation experiment. G3(Bethesda) 3, 481-491.
[14]	Luo P, Wang HW, Ni ZY, Yang RS, Wang F, Yong HY, Zhang L, Zhou ZQ, Song W, Li MS, Yang J, Weng JF, Meng ZD, Zhang DG, Han JN, Chen Y, Zhang RZ, Wang LW, Zhao M, Gao WW, Chen XY, Li WJ, Hao ZF, Fu JJ, Zhang XC, Li XH (2023). Genomic prediction of yield performance among single-cross maize hybrids using a partial diallel cross design. Crop J 11, 1884-1892. DOI
[15]	Meuwissen THE, Hayes BJ, Goddard ME (2001). Prediction of total genetic value using genome-wide dense marker maps. Genetics 157, 1819-1829. DOI PMID
[16]	Pérez P, De Los Campos G (2014). Genome-wide regression and prediction with the BGLR statistical package. Genetics 198, 483-495. DOI PMID
[17]	Roorkiwal M, Jarquin D, Singh MK, Gaur PM, Bharadwaj C, Rathore A, Howard R, Srinivasan S, Jain A, Garg V, Kale S, Chitikineni A, Tripathi S, Jones E, Robbins KR, Crossa J, Varshney RK (2018). Genomic-enabled prediction models using multi-environment trials to estimate the effect of genotype × environment interaction on predic-tion accuracy in chickpea. Sci Rep 8, 11701. DOI PMID
[18]	Sousa MBE, Cuevas J, de Oliveira Couto EG, Pérez- Rodríguez P, Jarquín D, Fritsche-Neto R, Burgueño J, Crossa J (2017). Genomic-enabled prediction in maize using kernel models with genotype × environment interaction. G3 (Bethesda) 7, 1995-2014.
[19]	Terraillon J, Roeber FK, Flachenecker C, Frisch M (2023). Training set designs for prediction of yield and moisture of maize test cross hybrids with unreplicated trials. Front Plant Sci 14, 1080087.
[20]	VanRaden PM (2008). Efficient methods to compute genomic predictions. J Dairy Sci 91, 4414-4423. DOI PMID
[21]	Wang BB, Lin ZC, Li X, Zhao YP, Zhao BB, Wu GX, Ma XJ, Wang H, Xie YR, Li QQ, Song GS, Kong DX, Zheng ZG, Wei HB, Shen RX, Wu H, Chen CX, Meng ZD, Wang TY, Li Y, Li XH, Chen YH, Lai JS, Hufford MB, Ross-Ibarra J, He H, Wang HY (2020). Genome-wide selection and genetic improvement during modern maize breeding. Nat Genet 52, 565-571. DOI PMID
[22]	Xu YB, Liu XG, Fu JJ, Wang HW, Wang JK, Huang CL, Prasanna BM, Olsen MS, Wang GY, Zhang AM (2019). Enhancing genetic gain through genomic selection: from livestock to plants. Plant Commun 16, 100005.
[23]	Yang N, Lu YL, Yang XH, Huang J, Zhou Y, Ali F, Wen WW, Liu J, Li JS, Yan JB (2014). Genome wide association studies using a new nonparametric model reveal the genetic architecture of 17 agronomic traits in an enlarged maize association panel. PLoS Genet 10, e1004573.
[24]	Zhu XT, Leiser WL, Hahn V, Würschum T (2021). Training set design in genomic prediction with multiple biparental families. Plant Genome 14, e20124.

Trait	Abbreviate	Unit	Description
Days to anthesis	DTA	Days	Recorded the number of days from the planting day to anthesis data when 50% of the plant anthers in the plot were extruded to 1/2 length of the main tassel spindle
Plant height	PH	cm	Measured the height of the stem from the ground to the top of the tassel of 3-5 plants
Ear height	EH	cm	Measured the height of the stem from the ground to the base of the ear of 3-5 plants
Ear length	EL	cm	Measured the length of 3-5 ears
Ear row number	ERN	Count	Counted the number of ear row of 3-5 ears
Kernel number per row	KNR	Count	Counted the number of kernels per row of 3-5 ears

Trait	Abbreviate	Unit	Description
Days to anthesis	DTA	Days	Recorded the number of days from the planting day to anthesis data when 50% of the plant anthers in the plot were extruded to 1/2 length of the main tassel spindle
Plant height	PH	cm	Measured the height of the stem from the ground to the top of the tassel of 3-5 plants
Ear height	EH	cm	Measured the height of the stem from the ground to the base of the ear of 3-5 plants
Ear length	EL	cm	Measured the length of 3-5 ears
Ear row number	ERN	Count	Counted the number of ear row of 3-5 ears
Kernel number per row	KNR	Count	Counted the number of kernels per row of 3-5 ears

Trait	Environment	Range	Means±SD	Skew	Kurt	Coefficient of variation (%)
DTA	22BJ	48.00-72.00	61.41±15.29	-0.56	0.11	0.25
	22MS	54.00-99.00	83.05±31.11	-1.32	2.55	0.37
	23BJ	53.00-82.00	67.88±17.28	-0.13	0.00	0.25
	23MS	69.00-104.00	88.23±12.59	-0.04	0.90	0.14
PH	22BJ	131.50-315.67	230.39±52.01	-0.33	0.17	0.23
	22MS	113.00-302.00	227.27±45.40	-0.28	0.13	0.20
	23BJ	90.33-257.33	183.66±39.61	-0.44	0.68	0.22
	23MS	134.00-323.67	233.58±54.45	-0.29	0.06	0.23
EH	22BJ	34.67-142.33	87.70±23.85	-0.07	-0.46	0.27
	22MS	17.33-168.00	74.66±24.73	0.28	0.36	0.33
	23BJ	20.00-125.33	69.68±19.79	-0.16	-0.28	0.28
	23MS	23.67-145.33	86.81±27.89	-0.07	-0.41	0.32
EL	22BJ	5.00-22.00	13.99±3.99	0.11	0.62	0.29
	22MS	6.70-20.80	14.54±4.03	0.16	0.15	0.28
	23BJ	5.00-22.33	13.23±4.22	0.25	0.16	0.32
	23MS	8.50-20.00	13.60±4.53	0.46	0.17	0.33
KNR	22BJ	2.00-44.67	21.93±8.32	-0.45	0.58	0.38
	22MS	10.00-41.00	24.36±7.90	-0.16	0.05	0.32
	23BJ	3.00-39.00	19.47±7.56	0.13	0.15	0.39
	23MS	10.00-39.33	24.48±8.68	-0.06	-0.22	0.35
ERN	22BJ	7.50-19.67	13.49±3.78	0.04	-0.41	0.28
	22MS	6.00-20.00	13.62±3.89	0.05	-0.10	0.29
	23BJ	8.00-20.67	12.88±4.01	0.43	-0.01	0.31
	23MS	7.33-20.00	13.86±4.48	-0.09	-0.50	0.32

Trait	Environment	Range	Means±SD	Skew	Kurt	Coefficient of variation (%)
DTA	22BJ	48.00-72.00	61.41±15.29	-0.56	0.11	0.25
	22MS	54.00-99.00	83.05±31.11	-1.32	2.55	0.37
	23BJ	53.00-82.00	67.88±17.28	-0.13	0.00	0.25
	23MS	69.00-104.00	88.23±12.59	-0.04	0.90	0.14
PH	22BJ	131.50-315.67	230.39±52.01	-0.33	0.17	0.23
	22MS	113.00-302.00	227.27±45.40	-0.28	0.13	0.20
	23BJ	90.33-257.33	183.66±39.61	-0.44	0.68	0.22
	23MS	134.00-323.67	233.58±54.45	-0.29	0.06	0.23
EH	22BJ	34.67-142.33	87.70±23.85	-0.07	-0.46	0.27
	22MS	17.33-168.00	74.66±24.73	0.28	0.36	0.33
	23BJ	20.00-125.33	69.68±19.79	-0.16	-0.28	0.28
	23MS	23.67-145.33	86.81±27.89	-0.07	-0.41	0.32
EL	22BJ	5.00-22.00	13.99±3.99	0.11	0.62	0.29
	22MS	6.70-20.80	14.54±4.03	0.16	0.15	0.28
	23BJ	5.00-22.33	13.23±4.22	0.25	0.16	0.32
	23MS	8.50-20.00	13.60±4.53	0.46	0.17	0.33
KNR	22BJ	2.00-44.67	21.93±8.32	-0.45	0.58	0.38
	22MS	10.00-41.00	24.36±7.90	-0.16	0.05	0.32
	23BJ	3.00-39.00	19.47±7.56	0.13	0.15	0.39
	23MS	10.00-39.33	24.48±8.68	-0.06	-0.22	0.35
ERN	22BJ	7.50-19.67	13.49±3.78	0.04	-0.41	0.28
	22MS	6.00-20.00	13.62±3.89	0.05	-0.10	0.29
	23BJ	8.00-20.67	12.88±4.01	0.43	-0.01	0.31
	23MS	7.33-20.00	13.86±4.48	-0.09	-0.50	0.32

Trait	σ²_g	σ²_ge	H²	SE (H²)
DTA	27.65***	8.79***	0.67	0.08
PH	736.31***	104.94***	0.76	0.01
EH	370.33***	67.62***	0.74	0.02
EL	2.94***	1.09***	0.50	0.26
KNR	13.74***	7.69***	0.38	0.04
ERN	3.25***	0.49***	0.60	0.46