Predicting complex traits using a diffusion kernel on genetic markers with an application to dairy cattle and wheat data

Gota Morota, Masanori Koyama, Guilherme J. M Rosa, Kent A. Weigel, Daniel Gianola

Research output: Contribution to journalArticlepeer-review

20 Scopus citations


Background: Arguably, genotypes and phenotypes may be linked in functional forms that are not well addressed by the linear additive models that are standard in quantitative genetics. Therefore, developing statistical learning models for predicting phenotypic values from all available molecular information that are capable of capturing complex genetic network architectures is of great importance. Bayesian kernel ridge regression is a non-parametric prediction model proposed for this purpose. Its essence is to create a spatial distance-based relationship matrix called a kernel. Although the set of all single nucleotide polymorphism genotype configurations on which a model is built is finite, past research has mainly used a Gaussian kernel. Results: We sought to investigate the performance of a diffusion kernel, which was specifically developed to model discrete marker inputs, using Holstein cattle and wheat data. This kernel can be viewed as a discretization of the Gaussian kernel. The predictive ability of the diffusion kernel was similar to that of non-spatial distance-based additive genomic relationship kernels in the Holstein data, but outperformed the latter in the wheat data. However, the difference in performance between the diffusion and Gaussian kernels was negligible. Conclusions: It is concluded that the ability of a diffusion kernel to capture the total genetic variance is not better than that of a Gaussian kernel, at least for these data. Although the diffusion kernel as a choice of basis function may have potential for use in whole-genome prediction, our results imply that embedding genetic markers into a non-Euclidean metric space has very small impact on prediction. Our results suggest that use of the black box Gaussian kernel is justified, given its connection to the diffusion kernel and its similar predictive performance.

Original languageEnglish (US)
Article number17
JournalGenetics Selection Evolution
Issue number1
StatePublished - 2013

ASJC Scopus subject areas

  • Ecology, Evolution, Behavior and Systematics
  • Animal Science and Zoology
  • Genetics

Fingerprint Dive into the research topics of 'Predicting complex traits using a diffusion kernel on genetic markers with an application to dairy cattle and wheat data'. Together they form a unique fingerprint.

Cite this