植物学报 ›› 2024, Vol. 59 ›› Issue (6): 1041-1053.DOI: 10.11983/CBB24087  cstr: 32102.14.CBB24087

• 研究论文 • 上一篇    下一篇

玉米自然群体自交系农艺性状的多环境全基因组预测初探

李园1, 范开建1, 安泰2, 李聪3, 蒋俊霞1, 牛皓1, 曾伟伟2, 衡燕芳1, 李虎1, 付俊杰1, 李慧慧1, 黎亮1,*()   

  1. 1中国农业科学院作物科学研究所, 作物分子育种国家工程研究中心, 作物基因资源与育种全国重点实验室, 北京 100081
    2中国农业大学农学与生物技术学院, 北京 100193
    3黑龙江八一农垦大学农学院, 大庆 163000
  • 收稿日期:2024-06-06 接受日期:2024-10-14 出版日期:2024-11-10 发布日期:2024-10-16
  • 通讯作者: *黎亮, 研究员, 博士生导师, 中国农业科学院杰出青年英才, 农业农村部神农英才“青年英才”。在玉米单倍体育种技术方面的工作(参与)获得2017年度教育部技术发明一等奖、2017年度大北农科技奖植物育种奖和2023年度国家技术发明二等奖。目前, 研究团队主要聚焦单倍体技术和全基因组选择技术的有机整合, 以构建工程化的育种流程, 实现不同环境下的田间表型预测, 为创制高产耐密宜机收种质提供技术支撑。E-mail: liliang05@caas.cn
  • 基金资助:
    国家自然科学基金(32272190);中国农业科学院创新工程(2024)

Study on Multi-environment Genome-wide Prediction of Inbred Agronomic Traits in Maize Natural Populations

Yuan Li1, Kaijian Fan1, Tai An2, Cong Li3, Junxia Jiang1, Hao Niu1, Weiwei Zeng2, Yanfang Heng1, Hu Li1, Junjie Fu1, Huihui Li1, Liang Li1,*()   

  1. 1State Key Laboratory of Crop Gene Resources and Breeding, The National Engineering Center for Crop Molecular Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
    2College of Agronomy and Biotechnology, China Agricultural University, Beijing 100193, China
    3College of Agronomy, Heilongjiang Bayi Agricultural University, Daqing 163000, China
  • Received:2024-06-06 Accepted:2024-10-14 Online:2024-11-10 Published:2024-10-16
  • Contact: *E-mail: liliang05@caas.cn

摘要: 多环境田间测试是选育高产稳产品种的重要途径, 但因其成本高逐渐成为植物育种的瓶颈问题。将稀疏测试与全基因组预测方法相结合可实现对未测表型的预测, 进而减少田间测试的工作量和成本。利用244份玉米(Zea mays)自然群体自交系在两年(2022年和2023年)两点(北京顺义和黑龙江密山) 4个环境下, 针对散粉期、株高、穗位高、穗长、穗行数和行粒数6个代表性农艺性状开展研究, 比较了4种模型(Single、Across、M×E和R-norm)、2种训练群体组成方案(CV1和CV2)以及3种训练集抽样比例(0.5、0.7和0.9)对预测精度的影响。结果表明,上述6个农艺性状的平均预测精度分别为0.67、0.58、0.50、0.33、0.33和0.48; Single模型、Across模型、M×E模型和R-norm模型的平均预测精度分别为0.36、0.52、0.53和0.53; 其中CV1各模型在不同性状中的预测精度范围在0.19-0.65之间, CV2预测精度范围在0.47-0.89之间; 不同抽样比例比较显示, 不同模型中训练集比例的提升对6个性状的预测精度提升有限, 最大提升幅度仅为0.05。综上表明, 在进行多环境预测时, 利用CV2训练群体组成方案并在预测模型中纳入多个环境下的表型数据可提升预测精度。

关键词: 玉米, 全基因组预测, 多环境预测, 训练集优化

Abstract: Multi-environment field testing is an important way to select optimize maize yield and yield stability varieties. However, because of its high cost, it has gradually become a challenge in plant breeding. The combination of field sparse testing and genome-wide prediction method can be used to predict untested phenotypes, reduced the effort and cost on field testing. In this experiment, 244 inbred lines of natural populations were planted in Shunyi, Beijing and Mishan, Heilongjiang in 2022 and 2023. Six agronomic traits were studied, including days to anthesis, plant height, ear height, ear length, kernel number per row and ear row number. The effects of four different models (Single, Across, M×E and R-norm), two different cross-validation schemes (CV1 and CV2) and three different training sets sampling ratios (0.5, 0.7 and 0.9) on the prediction accuracy were compared. The results showed that the average prediction accuracy of the six agronomic traits was 0.67, 0.58, 0.50, 0.33, 0.33 and 0.48. The average prediction accuracy of the Single model, Across model, M×E model and R-norm model was 0.36, 0.52, 0.53 and 0.53 for each trait. In CV1, the average prediction accuracy of each model in six traits ranged from 0.19 to 0.65, and in CV2, the average prediction accuracy ranged from 0.47 to 0.89. The comparison of different training set sampling ratios shows that the improvement of the proportion of the training sets has limited improvement in the prediction accuracy of different traits in different models, and the maximum is only 0.05. The results show that the CV2 training set can be used to form a scheme and include phenotypic data from multiple environments in the prediction model to provide good prediction accuracy for multi-environment prediction.

Key words: maize, genome-wide prediction, multi-environment prediction, optimal training sets