Your browser doesn't support javascript.
loading
Enhancing genomic prediction with Stacking Ensemble Learning in Arabica Coffee.
Nascimento, Moyses; Nascimento, Ana Carolina Campana; Azevedo, Camila Ferreira; de Oliveira, Antonio Carlos Baiao; Caixeta, Eveline Teixeira; Jarquin, Diego.
Afiliação
  • Nascimento M; Laboratory of Intelligence Computational and Statistical Learning (LICAE), Department of Statistics, Federal University of Viçosa, Viçosa, Brazil.
  • Nascimento ACC; Agronomy Department, University of Florida, Gainesville, FL, United States.
  • Azevedo CF; Laboratory of Intelligence Computational and Statistical Learning (LICAE), Department of Statistics, Federal University of Viçosa, Viçosa, Brazil.
  • de Oliveira ACB; Agronomy Department, University of Florida, Gainesville, FL, United States.
  • Caixeta ET; Laboratory of Intelligence Computational and Statistical Learning (LICAE), Department of Statistics, Federal University of Viçosa, Viçosa, Brazil.
  • Jarquin D; Embrapa Coffee, Brazilian Agricultural Research Corporation (Embrapa), Brasília, Brazil.
Front Plant Sci ; 15: 1373318, 2024.
Article em En | MEDLINE | ID: mdl-39086911
ABSTRACT
Coffee Breeding programs have traditionally relied on observing plant characteristics over years, a slow and costly process. Genomic selection (GS) offers a DNA-based alternative for faster selection of superior cultivars. Stacking Ensemble Learning (SEL) combines multiple models for potentially even more accurate selection. This study explores SEL potential in coffee breeding, aiming to improve prediction accuracy for important traits [yield (YL), total number of the fruits (NF), leaf miner infestation (LM), and cercosporiosis incidence (Cer)] in Coffea Arabica. We analyzed data from 195 individuals genotyped for 21,211 single-nucleotide polymorphism (SNP) markers. To comprehensively assess model performance, we employed a cross-validation (CV) scheme. Genomic Best Linear Unbiased Prediction (GBLUP), multivariate adaptive regression splines (MARS), Quantile Random Forest (QRF), and Random Forest (RF) served as base learners. For the meta-learner within the SEL framework, various options were explored, including Ridge Regression, RF, GBLUP, and Single Average. The SEL method was able to predict the predictive ability (PA) of important traits in Coffea Arabica. SEL presented higher PA compared with those obtained for all base learner methods. The gains in PA in relation to GBLUP were 87.44% (the ratio between the PA obtained from best Stacking model and the GBLUP), 37.83%, 199.82%, and 14.59% for YL, NF, LM and Cer, respectively. Overall, SEL presents a promising approach for GS. By combining predictions from multiple models, SEL can potentially enhance the PA of GS for complex traits.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Front Plant Sci Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Brasil País de publicação: Suíça

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Front Plant Sci Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Brasil País de publicação: Suíça