Skip to main content

Accuracy of breeding values for production traits in turkeys (Meleagris gallopavo) using recursive models with or without genomics



Knowledge about potential functional relationships among traits of interest offers a unique opportunity to understand causal mechanisms and to optimize breeding goals, management practices, and prediction accuracy. In this study, we inferred the phenotypic causal networks among five traits in a turkey population and assessed the effect of the use of such causal structures on the accuracy of predictions of breeding values.


Phenotypic data on feed conversion ratio, residual feed intake, body weight, breast meat yield, and walking score in addition to genotype data from a commercial breeding population were used. Causal links between the traits were detected using the inductive causation algorithm based on the joint distribution of genetic effects obtained from a standard Bayesian multiple trait model. Then, a structural equation model was implemented to infer the magnitude of causal structure coefficients among the phenotypes. Accuracies of predictions of breeding values derived using pedigree- and blending-based multiple trait models were compared to those obtained with the pedigree- and blending-based structural equation models.


In contrast to the two unconditioned traits (i.e., feed conversion ratio and breast meat yield) in the causal structures, the three conditioned traits (i.e., residual feed intake, body weight, and walking score) showed noticeable changes in estimates of genetic and residual variances between the structural equation model and the multiple trait model. The analysis revealed interesting functional associations and indirect genetic effects. For example, the structural coefficient for the path from body weight to walking score indicated that a 1-unit genetic improvement in body weight is expected to result in a 0.27-unit decline in walking score. Both structural equation models outperformed their counterpart multiple trait models for the conditioned traits. Applying the causal structures led to an increase in accuracy of estimated breeding values of approximately 7, 6, and 20% for residual feed intake, body weight, and walking score, respectively, and different rankings of selection candidates for the conditioned traits.


Our results suggest that structural equation models can improve genetic selection decisions and increase the prediction accuracy of breeding values of selection candidates. The identified causal relationships between the studied traits should be carefully considered in future turkey breeding programs.


The number of traits included in genetic evaluation programs has increased steadily over time. Hence, multivariate linear mixed models (MTM), e.g. [1, 2], are increasingly important in animal breeding. Although MTM have been successfully used to study the genetic and environmental relationships between phenotypes, they do not infer the phenotype networks that describe the interrelationships that are generally present in biological systems. Models that account for recursiveness and feedback between traits, such as structural equation models (SEM) [3, 4],  can be used to evaluate simultaneous relationships that exist between phenotypes. Knowledge about cause-and-effect mechanisms that underlie interrelationships between environmental factors, management practices, animal physiology, and performance outcomes is crucial to explore the behavior of complex systems and can contribute to improving management practices and multi-trait selection strategies in livestock.

A phenotypic correlation between two traits, y1 and y2, can be due to a direct effect of one on the other or to extraneous (confounding) variables that jointly affect both traits, among other possibilities. Defining the causal structure allows prediction of the effect of interventions (e.g., management practices) applied to each trait. When, for instance, y1 affects y2 while y2 does not affect y1, an intervention on y1 will cause changes on y2, but not conversely. Structural equation models have been used in many fields including genetics, economics, psychometrics, social statistics, and biological sciences [5]. From their studies on bovine milk fatty acid traits and meat quality traits of Wagyu beef, Bouwman et al. [6] and Inoue et al. [7], respectively, compared MTM and SEM and observed differences in estimates of parameters between them.

Similar to other species, MTM has been widely applied to assess the associations between production, reproduction and welfare traits in turkeys. Quinton et al. [8], examined the connections between survival and fitness in turkeys using MTM and concluded that selection for growth may decrease survival and traits related to walking ability. Their suggestion was to perform a multiple trait selection program for growth, survival, walking ability, hip and leg structures. Analyzing walking ability, breast meat yield, and feed efficiency traits, Abdalla et al. [9] reported unfavorable genetic correlations of walking ability with body weight and breast meat yield. Other applications of MTM for turkeys were presented by Emamgholi Begli et al. [10] to estimate phenotypic and genetic correlations between clutch and broodiness traits, along with body weight and other conventional egg production traits. Similar to previous literature, the authors reported an unfavorable genetic correlation between egg production and body weight and suggested that including clutch and pause length traits in the selection index may reduce broodiness while increasing egg number.

The objectives of this study were: (1) to search for causal structures among body weight, walking ability, breast meat yield, and two feed efficiency traits in a turkey breeding line; (2) to fit a SEM based on the uncovered causal structures to quantify the relationships between the traits; and (3) to compare MTM and SEM in terms of prediction accuracy of breeding values based on pedigree-based (BLUP) and single-step blending (ssGBLUP) animal models.


Phenotypic data

Data used in this study were previously described [9] and further details are in Table 1. Briefly, the data were provided by Hybrid Turkeys, Kitchener, Canada, from 10 generations of a turkey breeding line from birds hatched between 2009 and 2017. Feed conversion ratio (FCR; N = 5619), residual feed intake (RFI; N = 5619), and breast meat yield (BMY; N = 9634) were recorded on male birds, whereas body weight (BW; N = 170,844) and walking score (WS; N = 170,844) were recorded on both males (N = 99,832) and females (N = 71,012). A standard feeding system with group housing and shared feeders and drinkers was used until 15 weeks of age. From this point on and until 19 to 20 weeks of age, a real-time automated system was used to monitor individual feed intake in males. With this system, each bird was identified during each visit to the feed station and the weight of the feeder in addition to the body weight of the bird were taken using a scale [11]. FCR was calculated as total feed intake divided by weight gain, while RFI was obtained as the residual of a multiple regression of observed feed intake on metabolic mid-weight and body weight gain [12]. Based on walking ability, a subjective WS ranging from 1 to 6 was given to each male and female at 18 and 20 weeks of age, respectively. Birds with better walking ability received a higher WS. Males were slaughtered at 21 or 22 weeks to obtain BMY, which was calculated as:

$${\text{BMY}} \, \text{=} \, \frac{\text{Breast meat weight}}{\text{Live body weight at slaughter}}\times \, {100}.$$
Table 1 Descriptive statistics of the analyzed dataset including number of records, mean, standard deviation, training and validation subsets for different production and fitness traits in a turkey line

Genomic data

Genotypes from a proprietary 65 K single nucleotide polymorphism (SNP) panel (65,000 SNPs; Illumina, Inc.) were available for a subset of the animals as reported in Table 1. SNPs that were located on the sex chromosome, those that deviated significantly from Hardy–Weinberg proportions \((P<1 \times {10}^{-8}),\) and SNPs with a minor allele frequency lower than 5% or with a call rate lower than 90% were removed. The number of SNPs that remained for analysis was 53,455. All genotyped animals had a call rate higher than 90% and all were included in the study.

Structural equation modeling (SEM)

Three main steps are required to perform SEM [5, 13]:

(1) Draw samples from the posterior distribution of the covariance matrices of residuals from a multiple trait model.

(2) Apply the inductive causation (IC) algorithm to query about the statistical independence between two traits. In each query, the IC performs the following:

  • Compute the posterior distribution of residual partial correlation between two traits for each sample that is drawn from the posterior distribution of covariance matrices of residuals.

  • Obtain the highest posterior density (HPD) for the posterior distribution of the residual partial correlation.

  • Two traits are declared conditionally dependent if the HPD interval contains 0.

(3) Fit the SEM based on the selected causal structure.

Multiple trait model (MTM)

Pedigree-based best linear unbiased prediction (PBLUP)

To estimate genetic and residual (co)variance components using pedigree relationships, the following multiple trait linear mixed model was fitted:

$$\mathbf{y}\ \text{=}\ \mathbf{ Xb }\ \text{+}\ \mathbf{ Zu }\ \text{+}\ \mathbf{ e},$$

where, \(\mathbf {y}\) is a vector of observations of FCR, RFI, BMY, BW and WS, sorted by animals; \(\mathbf {b}\) is a vector with the systematic effects of hatch week-year for all traits and sex for BW and WS; \(\mathbf {u}\) is a vector of additive genetic effects, distributed as \(\mathbf{u}\sim N(\mathbf0, \mathbf{A}\otimes\mathbf{G}),\) where \(\mathbf{A}\) is the numerator relationship matrix, derived by including inbreeding coefficients, and \(\mathbf{G}\) is the additive genetic variance–covariance matrix among traits; \(\mathbf {e}\) is a vector of residual effects, distributed as \(\mathbf{e}\sim N(\mathbf0, {\sum }_{\mathrm{i}}^{+}{\mathbf{E}}_{iy})\) where \({\mathbf{E}}_{iy}\) indicates a \({m}_{i}\times {m}_{i}\) matrix corresponding to the trait phenotypes that were available for animal \(i\), and \({m}_{i}\) is the number of trait phenotypes available for animal \(i\); \(\mathbf {X}\) and \(\mathbf {Z}\) are incidence matrices for the respective fixed and random effects.

Single-step genomic best linear unbiased prediction (ssGBLUP)

The same model outlined above for PBLUP was used for ssGBLUP, except that the \(\mathbf{A}\) matrix was replaced by the \(\mathbf{H}\) matrix, which is a combined pedigree and genomic relationship matrix [14]. For both models (PBLUP and ssGBLUP), a Markov chain Monte Carlo (MCMC) method based on Gibbs sampling was used to estimate the joint posterior distribution to derive posterior means and standard deviations of parameters of interest. Using the software THRGIBBS1F90 [15], a single chain of the Gibbs sampler was run, with the first 100,000 samples discarded as burn-in. Thereafter, samples were saved every 50 iterations, resulting in 20,000 samples. Convergence was assessed by visual inspection of trace plots of posterior distributions and Geweke’s diagnostic [16]. The latter ranged from − 0.20 to 0.24 for all variance–covariance parameters.

Inductive causation (IC) algorithm

The IC algorithm can be used to explore the full space of possible networks among traits [17]. When two traits are genetically correlated, residual (co)variances are generated to correct for the confounding among them. The IC algorithm performs a series of statistical decisions based on partial correlations between traits. First, IC computes the posterior distribution of the residual partial correlation between two traits at each sample drawn from the posterior distribution of covariance matrices of residuals derivative from MTM (PBLUP or ssGBLUP). Then, the HPD for the posterior distribution of residual partial correlation is obtained. The process to declare that two traits are conditionally dependent or not can be divided into three main steps: (1) two traits are considered connected with an undirected link (e.g., y1–y2) if the partial correlations among them conditional on every possible set of the remaining traits are declared different from 0; (2) if the two traits are nonadjacent but share a common adjacent trait (e.g., y1 and y3 in y1–y2–y3) and their partial correlations are conditionally non-null on all possible subsets of the remaining traits that contain the adjacent trait (y2 in this case), arrowheads pointing to the common adjacent trait can be added (i.e., y1 → y2 ← y3), which is known as an unshielded collider; and (3) without creating a new unshielded collider or cycle, as many undirected links as possible are oriented based on the partially oriented graph obtained in step 2. Declaration of partial correlations to be null or not was based on the HPD intervals, i.e. a correlation was declared to be null if the HPD contained the value 0. To evaluate the structure sensitivity [5, 13], HPD intervals of 85, 90, and 95% were applied. The IC analysis was carried out by using a previously described software program [17] written in R [18].

Structural equation models

Based on the causal network inferred by the IC algorithm, two SEM models were fitted: the structural equation PBLUP (SEPBLUP) and the structural equation ssGBLUP (SEssGBLUP). For SEPBLUP, the model was:

$${\mathbf{y}} = (\sum_{\text{i}}^{+} \varvec{\Lambda}_{iy}\mathbf{)y + X}{\mathbf{b}}^{*}\mathbf{ + Z} {\mathbf{u}}^{*}\text{ + }{\mathbf{e}}^{*},$$

where \(\mathbf {y}\) is a vector of observations of FCR, RFI, BMY, BW, and WS sorted by animal; \(\mathbf {X}\), \({\mathbf{b}}^{\boldsymbol{*}}\), \(\mathbf{Z}\), \({\mathbf{u}}^{\boldsymbol{*}}\), \({\mathbf{e}}^{\boldsymbol{*}}\), hold similar meanings to those for the MTM described above, except that the vectors here serve as systematic and random effects that directly affect each trait and that are not mediated by other traits [4, 5, 17]. \(\varvec{\Lambda}_{iy}\) is a \({m}_{i}\times {m}_{i}\) matrix of structural coefficients describing the chosen causal structure corresponding to the phenotypes that are available for animal \(i\), and \({m}_{i}\) is the number of phenotypes available for animal \(i\). It was assumed that \({\mathbf{u}}^{\boldsymbol{*}}\sim N(\mathbf0, \mathbf{A}\otimes{\mathbf{G}}^{*})\), where \(\mathbf{A}\) is as defined for the MTM and \({\mathbf{G}}^{*}\) is the SEM additive genetic (co)variance matrix (i.e., it describes variances and covariances of direct genetic effects). Similarly, \({\mathbf{e}}^{\boldsymbol{*}}\sim N(\mathbf0, {\sum }_{\mathrm{i}}^{+}{\mathbf{E}}_{iy})\) where \({\mathbf{E}}_{iy}\) indicates a \({m}_{i}\times {m}_{i}\) matrix with the SEM residual variances corresponding to the phenotypes that were present for animal \(i\), and \({m}_{i}\) is the number of phenotypes available for animal \(i\). The same model used for SEPBLUP was fitted for SEssGBLUP after replacing the \(\mathbf{A}\) matrix by the \(\mathbf{H}\) matrix.

The causal phenotypes inferred by the IC algorithm were included as covariates in both SEM models. As SEM accounts for all random variables that simultaneously affect two or more traits, the residual covariances between traits were set to 0 for both models, which allows the structural coefficients to be identifiable [19, 20]. An MCMC chain with the same specifications as used for the MTM was used to estimate the posterior distributions of the SEM parameters. Visual inspection trace plots of posterior distributions and Geweke’s diagnostic [16] applied for MTM were also applied for the SEM to ensure convergence.

Assessment of accuracy and bias of estimated breeding values

Estimates of the accuracy of estimated breeding values (EBV) for the PBLUP and ssGBLUP models for the data analyzed here were reported in our previous study [9]. Here, the goal was to compare the accuracy obtained based on the SEM to those reported in [9]. First, phenotypes corrected for fixed effects were estimated for all birds using SEBLUP. Then, approximately 10% of the birds (the youngest) had their phenotypes masked, which constituted the validation subset, while the remaining phenotypes were used to train the model (Table 1). The accuracy estimate was the Pearson correlation coefficient between the adjusted phenotypes of the validation subset and their corresponding EBV obtained from the respective models. Bias was assessed using the regression of adjusted phenotypes of the validation subset on their corresponding EBV. In addition to evaluating accuracy and bias of EBV, we also calculated rank correlations between EBV of selection candidates by the two modeling approaches (i.e., between PBLUP and SEPBLUP and between ssGBLUP and SEssGBLUP).

Results and discussion

Multiple trait model and inductive causation algorithm analyses

Posterior means of heritabilities and of genetic and residual correlations obtained from the MTM, along with their respective posterior standard deviations (PSD), are in Table 2. These parameters were similar to those reported by Abdalla et al. [9] for the same dataset. HPD intervals of 95, 90, and 85% were applied to detect causal relationships between traits, resulting in the graphs shown in Fig. 1a–c, respectively. Use of the three HPD intervals showed similar links between the traits with PBLUP or ssGBLUP. However, with an HPD of 95%, the direction of the causality was uncovered neither for the connections of FCR with either RFI or BW, nor for the connection between RFI and BW. When the 90% HPD interval was applied, the direction of two (FCR—RFI and FCR—BW) out of the three unknown links was detected (FCR → RFI and FCR → BW). The last unoriented link (RFI—BW) became oriented (RFI → BW) when the 85% HPD interval was used, resulting in a fully directed acyclic diagram (DAG; Fig. 1c).

Table 2 Posterior means (Mean) and posterior standard deviations (PSD) of variance components for the multi-trait model and the structural equation model
Fig. 1

Links between traits detected by the inductive causation algorithm based on 95 (a), 90 (b) and 85% (c) of highest posterior density intervals of the posterior distribution of the residual partial correlation. Links without arrowheads represent associations between traits and those with arrowheads represent causal relationships towards arrowheads. FCR feed conversion ratio (kg/kg), RFI residual feed intake (kg), BW body weight (kg), BMY breast meat yield (%), WS walking score (1 to 6)

Structural equation model analysis

The causal network that was obtained based on the 85% HPD interval (Fig. 1c) was used to fit the SEM. Posterior means of the variance components and of the genetic and residual correlations, along with their respective PSD for the MTM and the SEM, are in Table 2. Links between traits with their respective structural coefficients that were inferred using the SEM are shown in Fig. 2. For some traits, slight differences were observed in the variance component estimates obtained using the SEM versus the MTM. While both FCR and BMY had similar estimates to those from the MTM, genetic variances were lower based on the SEM than based on the MTM for the other three traits. The SEM resulted in smaller estimates of residual variances for all traits, except for FCR and BW. The traits BW and WS also showed different estimates of heritability based on the SEM versus the MTM: from 0.35 ± 0.07 and 0.24 ± 0.04, respectively, for the MTM, to 0.32 ± 0.07 and 0.21 ± 0.04 when the SEM was applied. This suggests that these parameters were over-estimated when causality between traits was ignored. The upstream traits FCR and BMY did not show similar differences in estimates between the SEM and MTM. Such changes in variance component estimates are expected [6, 7] because the traits RFI, BW, and WS are conditioned on at least one of the other two traits, FCR and BMY in the DAG, (see Fig. 1c).

Fig. 2

Links and posterior means ± posterior standard deviations of structural coefficients between the studied traits based on results of the inductive causation algorithm based on 85% of highest posterior density intervals of the posterior distribution of the residual partial correlation (Fig. 1c). Links represent causal relationships toward arrowheads. FCR feed conversion ratio (kg/kg), RFI residual feed intake (kg), BW body weight (kg), BMY breast meat yield (%), WS walking score (1 to 6)

Posterior means of genetic correlations from the SEM also differed from those obtained with the MTM; the SEM resulted in higher posterior means of genetic correlations among all traits, except for the correlations of FCR with RFI (0.69 ± 0.09) and WS (− 0.12 ± 0.07), as well as for the correlation between BMY and WS (− 0.46 ± 0.13). Because genetic effects could be direct or indirect, compared to the MTM, the SEM has the ability to separately identify these types of effects [17]. Although estimates of genetic correlations differed between the two models, these estimates cannot be directly compared, because their definitions differ [6, 17].

Differences in estimates of variance components parameters and of genetic and residual correlations between MTM and SEM have been previously reported. In a study on bovine milk fatty acid, Bouwman et al. [6] reported that unconditioned traits showed lower reductions in estimates of variance components based on the SEM versus the MTM, than the conditioned traits. Similar results were reported by Inoue et al. [7] in their study on the inference of the causal relationship between six meat quality traits in Wagyu beef.

Posterior means of the structural coefficients obtained from the SEM are shown in Fig. 2. All coefficients were negative, except that between FCR and RFI (1.52 ± 0.05). For traits with direct links (i.e., FCR → RFI, FCR → BW, and BMY → WS), the signs of the posterior means of the structural coefficients are expected to be analogous to the sign of the corresponding posterior means of the residual covariances between the same traits obtained from the MTM. The traits FCR and BMY did not have other traits as causal parents and, therefore, the sign of their direct effects on other traits should follow those based on the MTM. Although this is not guaranteed, posterior means of the indirect structural coefficients between traits (i.e., RFI → BW and BW → WS) had the same sign (negative) as their counterpart residual covariances in the MTM. However, the converse can also be true, as indirect structural coefficients are inferred from a conditional association and the residual covariances are marginal associations [5].

Results indicated that RFI is positively affected by FCR, which was inferred at a magnitude of 1.52 ± 0.05 (Fig. 2). The connection between these two feed efficiency traits has previously been reported for turkeys [12, 21] and other species, e.g. [22, 23]. The main difference between them is that FCR is expressed as a ratio of feed intake and growth rate, while RFI is a linear index of feed intake and growth rate. However, as a ratio of two component traits, genetic selection for FCR may not account for how selection is working on each trait directly [24]. Moreover, FCR is not independent of production traits and also does not account for them. Such problems are not encountered with RFI but animals that have favorable RFI could grow slowly [25]. Regardless, Case et al. [12] compared FCR and RFI in a turkey population and reported that both could be integrated into a turkey selection index, but RFI may be advantageous because it is more independent of performance traits than FCR. In general, both traits are important to improve feed efficiency and both can be considered as “expensive-to-measure” phenotypes. Investigating the direction and the magnitude of the association between FCR and RFI seems critical for selection indexes that aim at improving simultaneous multiple traits through genetic selection.

Estimates of the structural coefficients of FCR with BW (− 0.60 ± 0.14) and of RFI with BW (− 0.04 ± 0.00) were both negative. Slightly lower genetic correlations of FCR (0.65 ± 0.05) and RFI (0.09 ± 0.06) with BW have been reported for turkeys [12] compared to our estimates (0.65 ± 0.05 and 0.09 ± 0.06, respectively). Although the results of this study suggest that FCR has more influence on BW than RFI based on a higher estimate of the structural coefficient, the posterior distribution for the structural coefficient of RFI on BW showed a relatively wider range (SD = 2.51; Table 1) and, therefore, its true value could be quite different from the posterior mean. Moreover, RFI is expected to increase by 1.52 for each unit increase in FCR, which implies that RFI is expected to improve as a result of selection for FCR. Thus, selection on either trait may have similar effects on BW. Although RFI is calculated by adjusting for BW, it may not guarantee that the genetic correlation is zero. Thus, independence of RFI from BW could be confounded by the change in growth rate [26, 27]. Depending on the ultimate goal of the breeding program, the causal effects between BW and these two feed efficiency traits suggest favorable conditions for the joint genetic improvement of these three traits. Selection for lower FCR is expected to lead to better RFI and BW. It is well known that RFI and FCR are genetically strongly related [9, 12] and have weak relationships with BW, given that efficiency traits are implemented to reduce feed consumption and increase/maintain body weight gain. Uncovering causal effects provides insights and hypotheses about how causal relationships between traits may contribute to better selection decisions.

Based on the estimated structural coefficients, BW and BMY are expected to influence WS negatively, which indicates that for each kg of BW and for each 1-unit of BMY, WS is expected to drop by 0.27 ± 0.06 and 0.03 ± 0.00, respectively (Fig. 2). These three phenotypes are key traits in turkey breeding programs. Body weight and BMY are among the most important phenotypes in turkey populations raised for meat, while adequate walking ability ensures that birds maintain good health and fitness. Genetically, BW and BMY are unfavorably correlated with WS (e.g., [9, 26, 27]), which could be due to the relatively faster rate of improvement in BW and BMY as a result of direct selection for these traits, than the rate of improvement in muscles and bones of the legs. The close relationship between body size and the relative proportion of body parts is expected to cause a decline in the proportion of leg muscle and bone as BW and BMY increase [28, 29]. This disproportionate relationship is associated with genetic increases in BW [30].

Nestor et al. [28] reported that leg abnormalities starting at the age of 16 weeks has increased in turkeys as a result of years of selection for body and breast meat. Our study shows that interventions (e.g., management practices [5, 17]) on BW or BMY may block the indirect genetic effects through them on WS. Previous studies have reported that walking ability can be improved by genetic selection without negatively affecting BW [29, 31]. From a six-generation study, Emmerson et al. [29] reported that a single-trait selection for increased shank width resulted in improved walking ability in large-bodied turkeys, while BW continued to increase. These findings are consistent with results reported in Nestor et al. [32] from two selection experiments that aimed at increasing the relative proportions of leg muscle and bone in turkeys. Although the magnitude of the causal coefficient of BMY on WS was low (− 0.03 ± 0.001), the influence of BMY on WS could be strong, given that the phenotype of BMY is large (24.37 ± 2.33; Table 1). The influence of BMY on waking ability could also be due to a physical pressure of the breast on the legs and, as a result, on walking ability.

Accuracy and bias

Estimates of prediction accuracy and bias of EBV from MTM (PBLUP and ssGBLUP; Abdalla et al. [9]) and SEM (SEPBLUP and SEssGBLUP) are in Tables 3 and 4, respectively. As expected, there was no difference in the accuracy of EBV for the upstream traits (FCR and BMY) between MTM and SEM since they were unconditioned in the SEM. However, estimates of the accuracy of EBV for the three downstream traits (RFI, BW and WS) were higher with SEM than with MTM. The accuracy estimates for RFI, BW, and WS increased by approximately 5, 6, and 19%, from 0.21, 0.36 and 0.26 under PBLUP to 0.22, 0.38 and 0.31, respectively, under SEPBLUP. Similar increases in accuracy were also observed between ssGBLUP and SEssGBLUP, with increases by approximately 7, 5, and 20% for RFI, BW, and WS, respectively. Estimates of bias showed slight differences between the MTM and SEM models, with BW and WS having the largest differences (Table 4). The regression coefficients for BW were 0.86 ± 0.03 and 0.89 ± 0.03 with SEPBLUP and SEssGBLUP, respectively, and 0.82 ± 0.03 and 0.83 ± 0.03 with PBLUP and ssGBLUP, respectively. With SEM, estimates of bias for WS also tended to reach 1 for SEPBLUP (0.73 ± 0.04 to 0.78 ± 0.04) and SEssGBLUP (0.75 ± 0.04 to 0.82 ± 0.04).

Table 3 Accuracy of estimated breeding values for the studied traits based on the pedigree-based best linear unbiased prediction (PBLUP), single-step genomic best linear unbiased prediction (ssGBLUP), structural equation PBLUP (SEPBLUP), and the structural equation ssGBLUP (SEssGBLUP)
Table 4 Bias of estimated breeding values for the studied traits based on pedigree-based best linear unbiased prediction (PBLUP), single-step genomic best linear unbiased prediction (ssGBLUP), structural equation PBLUP (SEPBLUP), and the structural equation ssGBLUP (SEssGBLUP))

Our results indicate that the accuracy of EBV could be enhanced by incorporating causal relationships between traits. When traits share causal effects among them, breeding strategies based only on MTM analysis could lead to wrong selection decisions [5], which would result in slower selection progress than expected and the indirect effects between traits would be ignored. As discussed in Abdalla et al. [9], in turkey breeding programs selection is not expected to be among selection candidates from different generations. As a result, the biases in EBV can be neglected. However, biases should be adjusted for and carefully examined when selection candidates are from different generations. Incorporating the causal structures may slightly reduce the biases in EBV, but other techniques should be used to account for any potential bias of predictions that such models may yield.

The MTM and SEM resulted in similar rank correlations of EBV for all traits for both pedigree-based and genomic analyses. While the EBV for the upstream traits (FCR and BMY) showed complete agreement between MTM and SEM, with rank correlations of 1.0, as expected, rank correlations for RFI, BW, and WS were 0.88, 0.93 and 0.86, respectively. Although RFI was conditioned by only one trait (FCR), it showed a lower rank correlation of EBV than BW (conditioned by FCR and RFI) and similar to WS (conditioned by all traits). This could be due to the number of selection candidates for each trait. Abdalla et al. [32] compared rank correlations between linear and threshold animal models using different numbers of selection candidates and reported that the similarity in rankings increased as the number of selection candidates increased. Depending on breeding plans, the highest-ranking animals are more likely to be selected for breeding. Hence, selection of the best parents to produce the next generation may not be as accurate when ignoring causal relationships between traits.

In this study, males were phenotyped for all traits whereas females were only phenotyped for body weight and walking ability. As a result, analysis of only the female population would be limited to the connection between body weight and walking ability. Nevertheless, including data on females enhances the accuracy of EBV and of estimates of genetic parameters [1]. This was even more beneficial in this study because a large portion of the genotyped animals were females, which should improve the identification of the relationships between animals leading to a higher predictive ability for the genetic merit for the selection candidates [33]. It should be noted that EBV obtained from SEM are adjusted for both systematic effects and structural coefficients. In practice, it is quite important for turkey breeders to investigate the causality network between all traits in females and perform more accurate selection decisions, including traits that are not measured in the female population.

Descriptive statistical models that lack information about causal relationships among traits may not detect the changes in dependent phenotypes due to external interventions on causal parents traits [5, 17]. This is where the inevitable ambiguity of relying on correlation information in making inferences about the association among a set of traits lies. One of the most common strategies to assess dependencies and causal effects of interest is conditioning [34]. This can be done, as in this study, by using covariate adjustment to include realized values of explanatory variables in the linear predictor of a statistical model [35]. Investigating the phenotypic networks among physiological traits that may exert causal effects on each other is absolutely relevant for accelerating breeding programs. For instance, increased feed intake in high producing dairy cows enhances liver blood flow and metabolism, which in turn, influences the concentrations of circulating critical innate reproductive hormones, such as estradiol and progesterone, leading to negative effects on their reproductive performance [36]. Thus, ignoring the structural relationships between traits and applying selection based only on MTM models may result in slower genetic progress in economically important traits.


In this study, we investigated the functional relationships between five traits in a turkey line and their effects on the accuracy of EBV under pedigree-based and single-step genomic evaluation models. Applying a 95% posterior density interval for the posterior distribution of the residual partial correlation uncovered links between the traits, but some of them were not directed. A fully DAG was obtained with the use of a narrower (85%) HPD interval. The SEM based on this graph allowed the potential indirect genetic effects between the traits and their magnitudes to be estimated. Differences in the estimates of genetic variance and the ranks of the EBV of selection candidates between the MTM and the SEM suggest that the rate of genetic improvement for the breeding goal could be reduced if selection decisions are based only on EBV derived from the MTM. These findings were supported by the higher prediction accuracies for conditioned traits compared to assuming no connections between phenotypes. Our results also suggest that interventions on BW or BMY may block the indirect genetic effects through them on WS. Using structural equation models can accelerate genetic progress and enhance the accuracy of EBV for the selection of turkey candidates in both pedigree-based and single-step genomic models.

Availability of data and materials

Data that support the findings of this study are available from Hybrid Turkeys upon reasonable request, but restrictions apply to the availability of these data, which were used under license for the current study, and thus are not publicly available.


  1. 1.

    Henderson CR, Quaas RL. Multiple trait evaluation using relatives’ records. J Anim Sci. 1976;43:1188–97.

    Article  Google Scholar 

  2. 2.

    Mrode RA. Linear models for the prediction of animal breeding values. Wallingford: CABI; 2014.

    Google Scholar 

  3. 3.

    Pearl J. Causality: Models, reasoning, and inference. 2nd ed. New York: Cambridge University Press; 2009.

    Google Scholar 

  4. 4.

    Gianola D, Sorensen D. Quantitative genetic models for describing simultaneous and recursive relationships between phenotypes. Genetics. 2004;167:1407–24.

    Article  Google Scholar 

  5. 5.

    Rosa GJM, Valente BD, de los Campos G, Wu XL, Gianola D, Silva MA. Inferring causal phenotype networks using structural equation models. Genet Sel Evol. 2011. 43:6.

  6. 6.

    Bouwman AC, Valente BD, Janss LLG, Bovenhuis H, Rosa GJM. Exploring causal networks of bovine milk fatty acids in a multivariate mixed model context. Genet Sel Evol. 2014;46:2.

    Article  Google Scholar 

  7. 7.

    Inoue K, Valente BD, Shoji N, Honda T, Oyama K, Rosa GJM. Inferring phenotypic causal structures among meat quality traits and the application of a structural equation model in Japanese Black cattle. J Anim Sci. 2016;94:4133–42.

    CAS  Article  Google Scholar 

  8. 8.

    Quinton CD, Wood BJ, Miller SP. Genetic analysis of survival and fitness in turkeys with multiple-trait animal models. Poult Sci. 2011;90:2479–86.

    CAS  Article  Google Scholar 

  9. 9.

    Abdalla EA, Schenkel FS, Emamgholi Begli H, Willems OW, van As P, Vanderhout R, et al. Single-step methodology for genomic evaluation in turkeys (Meleagris gallopavo). Front Genet. 2019;10:1248.

    Article  Google Scholar 

  10. 10.

    Emamgholi Begli H, Wood BJ, Abdalla EA, Balzani A, Willems O, Schenkel F, et al. Genetic parameters for clutch and broodiness traits in turkeys (Meleagris gallopavo) and their relationship with body weight and egg production. Poult Sci. 2019;98:6263–9.

    CAS  Article  Google Scholar 

  11. 11.

    Tu X, Du S, Tang L, Xin H, Wood B. A real-time automated system for monitoring individual feed intake and body weight of group housed turkeys. Comput Electron Agric. 2011;75:313–20.

    Article  Google Scholar 

  12. 12.

    Case LA, Wood BJ, Miller SP. The genetic parameters of feed efficiency and its component traits in the turkey (Meleagris gallopavo). Genet Sel Evol. 2012;44:2.

    Article  Google Scholar 

  13. 13.

    Valente BD, Rosa GJM, de los Campos G, Gianola D, Silva MA. Searching for recursive causal structures in multivariate quantitative genetics mixed models. Genetics. 2010;185:633–44.

  14. 14.

    Misztal I, Legarra A, Aguilar I. Computing procedures for genetic evaluation including phenotypic, full pedigree, and genomic information. J Dairy Sci. 2009;92:4648–55.

    CAS  Article  Google Scholar 

  15. 15.

    Misztal I, Tsuruta S, Lourenco D, Aguilar I, Legarra A, Vitezica Z. Manual for BLUPF90 family of programs. Athens: University of Georgia; 2014.

    Google Scholar 

  16. 16.

    Geweke J. Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments. Staff Report 148. Vol. 196. Research Department Minneapolis: Federal Reserve Bank of Minneapolis; 1991.

  17. 17.

    Valente BD, Rosa GJM. Mixed effects structural equation models and phenotypic causal networks. Methods Mol Biol. 2013;1019:449–64.

    Article  Google Scholar 

  18. 18.

    R Core Team. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2019.

  19. 19.

    Valente BD, Rosa GJM, Silva MA, Teixeira RB, Torres RA. Searching for phenotypic causal networks involving complex traits: an application to European quail. Genet Sel Evol. 2011;43:37.

    Article  Google Scholar 

  20. 20.

    Valente BD, Rosa GJM, Gianola D, Wu X-L, Weigel K. Is structural equation modeling advantageous for the genetic improvement of multiple traits? Genetics. 2013;194:561–72.

    Article  Google Scholar 

  21. 21.

    Willems OW, Miller SP, Wood BJ. Assessment of residual body weight gain and residual intake and body weight gain as feed efficiency traits in the turkey (Meleagris gallopavo). Genet Sel Evol. 2013;45:26.

    Article  Google Scholar 

  22. 22.

    Schenkel FS, Miller SP, Wilton JW. Genetic parameters and breed differences for feed efficiency, growth, and body composition traits of young beef bulls. Can J Anim Sci. 2004;84:177–85.

    Article  Google Scholar 

  23. 23.

    Hoque MA, Kadowaki H, Shibata T, Oikawa T, Suzuki K. Genetic parameters for measures of the efficiency of gain of boars and the genetic relationships with its component traits in Duroc pigs. J Anim Sci. 2007;85:1873–9.

    CAS  Article  Google Scholar 

  24. 24.

    Gunsett FC. Problems associated with selection for traits defined as a ratio of two component traits. In Proceedings of the 3rd World Congress on Genetics Applied to Livestock Production: 16–22 July 1986; Lincoln. 1986. p. 437–42.

  25. 25.

    Berry DP, Crowley JJ. Residual intake and body weight gain: a new measure of efficiency in growing cattle. J Anim Sci. 2012;90:109–15.

    CAS  Article  Google Scholar 

  26. 26.

    Aggrey SE, Karnuah AB, Sebastian B, Anthony NB. Genetic properties of feed efficiency parameters in meat-type chickens. Genet Sel Evol. 2010;42:25.

    Article  Google Scholar 

  27. 27.

    Arthur PF, Archer JA, Johnston DJ, Herd RM, Richardson EC, Parnell PF. Genetic and phenotypic variance and covariance components for feed intake, feed efficiency, and other postweaning traits in Angus cattle. J Anim Sci. 2001;79:2805–11.

    CAS  Article  Google Scholar 

  28. 28.

    Nestor KE, Bacon WL, Moorhead PD, Saif YM, Havenstein GB, Renner PA. Comparison of bone and muscle growth in turkey lines selected for increased body weight and increased shank width. Poult Sci. 1987;66:1421–8.

    CAS  Article  Google Scholar 

  29. 29.

    Emmerson DA, Anthony NB, Nestor KE, Saif YM. Genetic association of selection for increased leg muscle and increased shank diameter with body composition and walking ability. Poult Sci. 1991;70:739–45.

    CAS  Article  Google Scholar 

  30. 30.

    Nestor KE, Anderson JW, Patterson RA. Genetics of growth and reproduction in the turkey. 14. Changes in genetic parameters over thirty generations of selection for increased body weight. Poult Sci. 2000;79:445–52.

    CAS  Article  Google Scholar 

  31. 31.

    Nestor KE, Bacon WL, Saif YM, Renner PA. The influence of genetic increases in shank width on body weight, walking ability, and reproduction of turkeys. Poult Sci. 1985;64:2248–55.

    CAS  Article  Google Scholar 

  32. 32.

    Abdalla EA, Rosa GJM, Weigel KA, Byrem T. Genetic analysis of leukosis incidence in United States Holstein and Jersey populations. J Dairy Sci. 2013;96:6022–9.

    CAS  Article  Google Scholar 

  33. 33.

    Christensen OF, Madsen P, Nielsen B, Ostersen T, Su G. Single-step methods for genomic evaluation in pigs. Animal. 2012;6:1565–610.

    CAS  Article  Google Scholar 

  34. 34.

    Elwert F, Winship C. Endogenous selection bias: The problem of conditioning on a collider variable. Annu Rev Sociol. 2014;40:31–53.

    Article  Google Scholar 

  35. 35.

    Dohoo IR, Martin SW, Stryhn H. Veterinary epidemiologic research. 2nd ed. Charlottetown: VER Inc.; 2009.

    Google Scholar 

  36. 36.

    Wiltbank M, Lopez H, Sartori R, Sangsritavong S, Gümen A. Changes in reproductive physiology of lactating dairy cows due to elevated steroid metabolism. Theriogenology. 2006;65:17–29.

    CAS  Article  Google Scholar 

Download references


The authors extend their gratitude to Jeff Mohr and personnel of the Hybrid Turkeys pedigree farms (Kitchener, Canada) for collecting and providing data used in this study.


This study was conducted as part of the project entitled ‘Application of genomic selection in turkeys for health, welfare, efficiency and production traits’ funded by the government of Canada through the Genome Canada Genomic Application Partnership Program and administered by Ontario Genomics (recipients: C.F. Baes (Academic), B.J. Wood (Industry)). This study was also financially supported by Hybrid Turkeys (Kitchener, Canada).

Author information




EAA, BJW, CFB conceived and designed the experiments. EAA analyzed the data. EAA, BJW, CFB interpreted results and wrote the paper. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Emhimad A. Abdalla.

Ethics declarations

Ethics approval and consent to participate

This research project was conducted with full adherence to the principles espoused by the Canadian Council on Animal Care, the Hendrix Genetics Animal Welfare Policy, and the University of Guelph Animal Care Committee (Animal Use Protocol #3782).

Consent for publication

The authors consent that all parts of this study can be freely available.

Competing interests

The authors declare they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Abdalla, E.A., Wood, B.J. & Baes, C.F. Accuracy of breeding values for production traits in turkeys (Meleagris gallopavo) using recursive models with or without genomics. Genet Sel Evol 53, 16 (2021).

Download citation