Article ID Journal Published Year Pages File Type
4500172 Mathematical Biosciences 2013 10 Pages PDF
Abstract

Recently, genome wide DNA markers have been used in breeding value estimation of livestock species. The computational technique is known as genomic selection. Typically, a large number of marker effects are estimated from a small number of animals, which presents an under-determined problem. In this paper, we propose sparse marker selection methods using haplotypes for both breeding value estimation and QTL mapping. By applying a two-stage regression strategy, markers are selected in the first stage, then in the second stage the selected markers are fitted in a range of models including linear, kernel and semi-parametric models. The estimation accuracy of breeding values is measured by the correlation coefficient, as well as the regression coefficient, between the true breeding values and the estimated breeding values by the models. We show that the estimation accuracy by using sparse markers, as low as 5000 or 500 dimensions, is comparable to that obtained from genome wide markers of about 230,000 dimensions of DNA haplotypes. The selected sparse markers can also be used for QTL mapping. In this paper we use protein yield to demonstrate the methods, and show that loci of large effects confirm published QTL.

► We propose a two-stage regression algorithm for breeding value estimation. ► Sparse DNA markers are selected in the first stage of the algorithm. ► Both linear and nonlinear models are implemented in the second stage of the algorithm. ► QTL mapping is studied, based on the selected sparse markers. ► A novel mixed-length haplotypes dataset is proposed, which is useful to indicate if the QTL mutations are recent.

Related Topics
Life Sciences Agricultural and Biological Sciences Agricultural and Biological Sciences (General)
Authors
, , , ,