Heritability of longevity in Large White and Landrace sows using continuous time and grouped data models
© Mészáros et al. 2010
Received: 20 October 2009
Accepted: 13 May 2010
Published: 13 May 2010
Skip to main content
© Mészáros et al. 2010
Received: 20 October 2009
Accepted: 13 May 2010
Published: 13 May 2010
Using conventional measurements of lifetime, it is not possible to differentiate between productive and non-productive days during a sow's lifetime and this can lead to estimated breeding values favoring less productive animals. By rescaling the time axis from continuous to several discrete classes, grouped survival data (discrete survival time) models can be used instead.
The productive life length of 12319 Large White and 9833 Landrace sows was analyzed with continuous scale and grouped data models. Random effect of herd*year, fixed effects of interaction between parity and relative number of piglets, age at first farrowing and annual herd size change were included in the analysis. The genetic component was estimated from sire, sire-maternal grandsire, sire-dam, sire-maternal grandsire and animal models, and the heritabilities computed for each model type in both breeds.
If age at first farrowing was under 43 weeks or above 60 weeks, the risk of culling sows increased. An interaction between parity and relative litter size was observed, expressed by limited culling during first parity and severe risk increase of culling sows having small litters later in life. In the Landrace breed, heritabilities ranged between 0.05 and 0.08 (s.e. 0.014-0.020) for the continuous and between 0.07 and 0.11 (s.e. 0.016-0.023) for the grouped data models, and in the Large White breed, they ranged between 0.08 and 0.14 (s.e. 0.012-0.026) for the continuous and between 0.08 and 0.13 (s.e. 0.012-0.025) for the grouped data models.
Heritabilities for length of productive life were similar with continuous time and grouped data models in both breeds. Based on these results and because grouped data models better reflect the economical needs in meat animals, we conclude that grouped data models are more appropriate in pig.
Length of productive life is important from economical, herd-health and animal welfare points of view in sustainable animal production. Intensive selection on production and reproduction traits without considering functional and exterior traits can lead to decreased longevity [1, 2]. In Austria, exterior traits are taken in account during selection of replacement gilts before the first insemination. At this stage only a negative selection is carried out, without any official recording for later use. However, data on length of productive life and number of piglets born/weaned are routinely collected and available for Herdbook sows. The total number of piglets born or weaned can also be used to express the lifetime production of sows, but genetic evaluation of litter size is already implemented in the Austrian system.
Length of productive life measured as the number of days between first farrowing and culling has been analyzed in several studies using either Cox  or Weibull models . During the productive life of sows, the period between weaning and conception can be non-productive and optimally, it should be kept as short as to ensure the highest number of litters. Using conventional measurements of lifetime (i.e. number of days between first farrowing and culling), it is not possible to differentiate between productive and non-productive days during a sow's lifetime, which can lead to less pertinent results in breeding value estimation . For this reason, a sow's productive life would be better expressed as the number of completed parities.
This approach requires rescaling of the time axis from a continuous scale into several discrete classes. The consequence of this approach is that Cox and Weibull models will no longer be valid, because these usual approaches assume continuity of the baseline hazard distribution and/or absence of ties between ordered failure times . Instead, grouped survival data (discrete survival time) models introduced by Prentice and Gloeckler  can be used. Grouped data models have been used in beef cattle , rabbits  and dairy cattle to evaluate fertility traits , but not for length of productive life in pigs.
The aim of this study is to compare the performance of Weibull and grouped data models and to estimate heritabilities using different genetic models for Large White and Landrace sows.
For the Weibull model, length of productive life was defined as the number of days between first farrowing and culling. We assumed that culling took place either at the last weaning of the sow or 28 days after the last farrowing if the number of weaned piglets was known, but if both weaning date and number of weaned piglets were missing, culling date was set one day after the last farrowing. The choice of setting the culling date at 28 days after the last farrowing for sows with incomplete weaning date was based on the average nursing period in the whole dataset.
Two intervals per parity were used for the grouped data model: from farrowing to weaning and from weaning to the next farrowing. Intervals were numbered sequentially from 1 up to 18 (i.e. sow alive after the 9th weaning). Animals were censored either at the date of the last weaning, if they were alive at the time of data collection or at the 9th weaning if they were alive at the 10th farrowing. Only sows born after 1995 were included in the evaluation and age at first farrowing between 250 and 550 days was required.
The analysis was carried out using a proportional hazards model (assumed to be either the Weibull model or a grouped data model) with the Survival Kit v6 program package .
where h*y i is the time-dependent random effect of herd and year of farrowing assumed to follow a log-gamma distribution, aff j the fixed time independent effect of class of age at first farrowing, par*pigl k the time-dependent effect of interaction between parity and relative number of piglets (see below for detailed description), hs l the time-dependent effect of annual herd size change. The random genetic component g m differed, defining an animal, sire, sire-maternal grandsire, sire-dam or sire-maternal grandsire-dam within a maternal grandsire genetic model.
For age at first farrowing, 33 classes were created with one-week intervals, where the first group contained animals up to 43 weeks of age and the last group contained animals older than 75 weeks at first farrowing.
Parity and classes for piglets born alive relative to the annual herd's mean were combined into an interaction term and included into the model (similar to ). This was done in several steps:
where cnp is the corrected number of piglets and n is the number of the parity ranging between 2 and 6.
where cnp is the corrected number of piglets born alive and m is the number of the parity ranging between 1 and 9 (maximal parity after censoring).
Ten classes (relative piglet classes or RPC) were created according to percentage deviation from the herd mean, as follows: <75%, 75-85%, 85-90%, 90-95%, 95-100%, 100-105%, 105-110%, 110-115%, 115-125% and >125%. RPC were inserted in the model as an interaction term with the parity number. Classes were recoded as numbers with three digits, where the first digit denoted the parity number (from 1 to 9), and the last two digits the RPC class (from 1 to 10).
Similarly the annual herd size changes were grouped into eight classes, according to number of farrowing per herd and year, where January 1st of each year was considered as cut point. In case the number of farrowings was equal or below 10, no change was accounted for that particular year. The bounds for classes were: decrease by more than 50%, decrease by 30-50%, 30-10%, between decrease by 10% and increase by 10%, increase by 10-30%, 30-50%, 50-100% and increase by 100% and more.
Longevity of sows can also be influenced by index values on growth traits but since these indexes are not routinely saved in Austria, we could not include them into the models.
Grouped data models are a special case of proportional hazards models, where failure times are grouped into intervals Ai = [ai-1, ai), i = 1, ..., r with a0 = 0, ar = +infinite and failure times in Ai are recoded as ti. Therefore the regression vector is assumed to be time-dependent but fixed within each time interval .
For the grouped data models, the same effects as for the Weibull model were used.
For both Large White and Landrace databases, the same structure of fixed and random effects was used. All models accounted for pedigree information up to the third generation of ancestors. The genetic variance was estimated as the mode of its approximate posterior density after Laplace integration of the other parameters . At the same time, the mean, variance and skewness of this posterior density were obtained. Knowing these three parameters, makes it possible to draw the posterior density of the variance component using a Gram-Charlier approximation.
The standard deviation of the posterior density can be interpreted as a conservative estimate of the standard error. From this, the standard error of the heritabilities was computed using the Delta method (see e.g. ).
In this case, the sow's sire was included in the model, accounting for 1/4 of the genetic variance. To be correct, the model implicitly assumes that mates are non-related, non-inbred, non-selected and that each dam has one recorded progeny only. The pedigree file contained the sires' sire and sires' maternal grandsire.
where is the genetic variance, var(h × y) the herd year variance, p the proportion of uncensored animals and genetic variance = 4 * sire variance.
This was similar to the sire model, but the sow's maternal grandsire was also included in the model and recoded jointly with the sires. This model accounts for 1/4 + 1/16 = 5/16 of the genetic variance under the same assumptions as the sire model (i.e. mates are non-related, non-inbred, non-selected and each dam has one progeny only). Additionally dams can be related and selected through their sire (i.e. the maternal grandsire of the progeny).
where is the genetic variance, var(h × y) the herd year variance, p the proportion of uncensored animals and genetic variance = 4 * sire variance. The additional 1/16 genetic variance in the denominator stands for the maternal grandsire's variance.
where is the genetic variance, var(h × y) the herd year variance, p the proportion of uncensored animals and genetic variance = 4 * sire variance. The genetic variance was multiplied by 2 in the denominator (compared to the sire model) because the sire and dam variances are assumed equal.
This model is in some sense a compromise between sire-maternal grandsire and sire-dam models, because the relationship matrix involves only males and it still accounts for repeated records. The sow's sire and maternal grandsire are recoded together in the data preparation step. In the final model the sire, maternal grandsire and dam of the sow are included as separate random effects.
This model does not account for the Mendelian sampling term of the animal but in contrast with the sire-maternal grandsire model, sisters can have different genetic values and more than one progeny each in which case a non genetic maternal effect is also accounted for. The main difference with a sire-dam model including a maternal effect is that dams are considered as related only through their sire (i.e. maternal grand dams are supposed to be unselected and to have only one progeny each).
where is the genetic variance, var(h × y) the herd year variance, p the proportion of uncensored animals, the dam within maternal grandsire variance and genetic variance = 4 * sire variance. The additional 1/16 genetic variance in the denominator stands for the maternal grandsire variance.
where is the genetic variance, var(h × y) the herd year variance, p the proportion of uncensored animals.
Large White a
LPL at last litter (days)
Piglets born alive per parity
Age at first farrowing (days)
Within parities the risk was highest for sows with a litter size below 75% of the herd's average in a given year, with only a slight decrease for sows with a higher number of piglets. First parities of both Large White and Landrace sows seemed to be exceptions from this pattern. In Large White sows, only a slight decrease of the risk ratio was observed throughout the classes and the peak value for the worst class was much lower. In Landrace sows, the risk ratio was much higher for sows with a litter size below that of the farm's average, but for the other classes no clear tendency was observed.
For Large White sows, the risk ratios were similar for farrowings 2 to 4 in both Weibull and grouped data models and from parity 6 (not shown) it increased. However, during the last parity the risk dropped for the grouped data model, compared to the Weibull model. For Landrace sows, the risk ratios were similar only for the first three parities in grouped data models, and increased from parity 4 onwards. Again during the last parity (not shown) the risk ratio dropped considerably. Unlike the other cases, the risk of culling decreased between parities for the Weibull model in the Landrace breed.
Genetic variances and heritabilities for Large White (herd*year var = 0.325)
Grouped data scale
variance (std deviation)
heritability (std deviation)
variance (std deviation)
heritability (std deviation)
Sire mgs dam
0.169 a (0.018)
0.166 b (0.017)
Genetic variances and heritabilities for Landrace (herd*year var = 0.233)
Grouped data scale
Variance (std deviation)
Heritability (std deviation)
Variance (std deviation)
Heritability (std deviation)
Sire mgs dam
0.074 a (0.011)
For the Landrace breed, heritabilities range between 0.05 and 0.08 (s.e. 0.014 - 0.020) for the continuous time and between 0.07 and 0.11 (s.e. 0.016 - 0.023) for the grouped data models. Heritabilities for Large White sows range between 0.08 and 0.14 (s.e. 0.012 - 0.026) for the continuous time and between 0.08 and 0.13 (s.e. 0.012 - 0.025) for the grouped data models. Whatever the breed, heritabilities are highest for the sire and sire-dam models. The sire-mgs and the sire-mgs-dam within mgs models gave similar, but smaller heritabilities. For the latter model, the dam within mgs variance is supposed to include 3/16 of the genetic variance plus the maternal non genetic effect variance if such an effect exists. In fact, for the Large White breed, the estimated within mgs variance of the dam (0.016/0.015) was smaller than 3/16 of the estimated genetic variance (0.031 for both models). This inconsistency was not observed in the Landrace breed. Hence, if a maternal non genetic effect exists, it should be very small. Finally, the lowest genetic variances and heritabilities were obtained with the animal model.
The average length of productive life in the whole dataset was 503 days for Large White and 596 days for Landrace sows, with large standard deviations. Results for the Landrace breed were comparable, but for the Large White breed, they were lower than those reported in the literature i.e. 617 days in  and 602 days in . Such results are heavily dependent on the amount of censored records and the length of the study period.
Prolonged productive life is important for two main reasons: 1. in general, the number of piglets born during farrowing 3 and 4 is higher than during the first farrowings, which means that with a higher proportion of older sows, piglet production increases. 2. On breeding farms with short generation intervals, it is especially important that only a low proportion of sows be culled for health and fertility problems, and the remaining ones be selected according to production traits, like litter size, fattening or carcass traits.
Our results show that age at first farrowing affects the risk of culling only for animals that have their first litter very early or late in life. A very young sow is not prepared to give birth because of its body development is not sufficient. This particular problem seems to affect only sows farrowed before 43 weeks of age. This result is in agreement with  in Danish Landrace herds for which in the case of an early first mating i.e. before 210 days of age, the risk for culling was higher than for sows mated later. After the 43rd week, the risk ratio dropped to a level around that of the reference class. Age at first farrowing increased the risk of culling again, for sows older than 60 weeks. If we assume that all sows are supposed to be put into reproduction at the same age, then it is likely that these sows had certain problems preventing them to conceive earlier. If these problems had persisted, they could have been culled early based on the higher risk ratios. Similar results have been published by [2, 16, 17].
When the production level of the animals is included in the statistical model, the genetic value of the "functional" length of productive life can be approximated, as production is usually the main source of voluntary culling. Hopefully, selection on functional longevity would lead to a reduction in involuntary culling because of reproduction or health problems. When the production of the animal is not taken into account in the model, the genetic effect reflects the "true" longevity, which means that voluntary (i.e. for low production) and involuntary culling reasons are considered together.
In this paper, we have decided to focus on functional longevity, and therefore, we have included the number of piglets born alive relative to the annual herd's mean in our model, to account for phenotypic selection on litter size. Working only with absolute numbers of piglets would not be appropriate, because production at a young age is generally lower than at an age when the body is fully developed (milk production in cows , goats , litter size in sheep , litter size in pigs ). Also culling decisions based on litter size may vary with herd management and year. In other words, the same litter size can be treated differently in different farms or on the same farm in different years. It is therefore necessary to evaluate the farmers' decisions based on time and place, when and where they take place.
The risk ratios for the interaction between parity and RPC decreased within parity. This means that sows with a higher number of piglets born alive are clearly favoured, regardless of age of the sow or farrowing number. In most cases the risk of culling for sows with litters 25% above the herd's average is two to three times lower than for sows 25% under the herd's average. This was not the case at first parity, when the risk was only slightly lower for Large White sows, and without any clear tendency for nearly all classes of Landrace sows. The reason can be that the farmer did not want to cull the younger sows with a low number of piglets immediately, but rather wanted to give them another chance. Our results are similar to those of  for risk ratios between parities. Although in our case the parity number was included in an interaction term, the risk ratios followed a similar pattern.
The results can be compared better with the studies by [2, 3] who also evaluated the risk of culling as an interaction term with parity number. In both cases, they found an increased risk of culling for sows with poor performance, which is similar to our results. There was a difference when comparing risk of culling between parities. Engblom et al.  have found that the risk of culling was relatively low for parities 2 to 7, while in our study, risks of culling were similar throughout the first parities and increased later in life. Brandt et al.  have used standardized values for the number of piglets born and have reported a relatively stable risk at the beginning of the productive life. They have concluded that culling decisions based on performance are made between parities 4 and 5. These results are in agreement with our findings, which show that the risk of culling was low for the first parities, and increased from parity 4 in Landrace sows. In Large White sows, the risk of culling was higher in parities 2 to 6, compared to the first parity, with an even more rapid increase from parity 7 onwards.
The risk of culling for effect of herd size change was similar for both breeds. The risk was highest for sows from herds that dramatically decreased in numbers with more than 50%. Decrease of farm size to such an extent could mean, that some extraordinary event happened (major financial problems or disease outbreak), which led to closure of the farm or extreme shrinkage of the herd. In the future, these records should be treated as right censored, because most likely the majority of the animals would live longer in normal circumstances.
In Large White herds for the second worst class, the risk was twice the one of the reference class, but it was stable in the case of only a slight decrease or increase in herd size. For the Landrace dataset the risk was extremely high for the worst class, but decreased rapidly in the second worst category, when the risk of culling was only 1.5 times higher for the Weibull model and 1.3 times higher for the grouped data model, in comparison with the reference class. Only a slight reduction in risk of culling for Large White herds increasing in size by more than 50% was detected. These results suggest that the farmer's decision to increase his herd size has a lower impact on keeping existing animals in the herd compared to the culling as a result of herd shrinkage. This indicates that the culling process on these farms continues in its usual way, probably because the expansion is done by introducing younger animals either from their own production or from other farms.
Growth performance undoubtedly influences the length of the productive life in pigs, and as such it should be included in the evaluation. New indexes for growth are calculated every two weeks in Austria and selection is based on these indexes. Including these in our models would be desirable, but unfortunately these indexes are not routinely saved. Nevertheless, the functional length of productive life could be modelled even better if they are added in the model.
Most estimations of genetic variance in survival models have been based on sire or sire-maternal grand sire models [4, 16, 22, 23]. Two reasons have justified such choices: first, the Laplace approximation of the posterior density of the genetic variance requires the repeated inversion of the Hessian matrix of the log-posterior density, which quickly becomes too time-consuming for large (animal model) applications. Due to the existence of time-dependent variables, this matrix is often less sparse than the usual mixed model coefficient matrix. Second, the quality of the Laplace approximation has been shown to depend on the number of observations per level of genetic effects . Indeed, for many years, it was believed that animal models could not fit with the Survival Kit because of this alleged poor performance of the Laplace approximation. It has been shown recently (see  for references) that this concern was not justified when the data structure is adequate (several generations of related females). Increase in computing power has also made estimation easier for animal models on larger populations.
In this study, the sire model systematically led to the largest estimates of genetic variances (140 to 180% of the animal model genetic variance). However, it can be noted that there is a large uncertainty associated with the estimation of the genetic variance and that the overlap of its credible set with those of the other models is large. It has been claimed that the sire survival model is not consistent  because the error term of the survival models is not normally distributed and therefore does not properly include the remaining 3/4 of the genetic variance. One potential reason for the overestimation of the genetic variance may be a poor partition of the genetic variance between the sire variance and the error term.
In contrast, animal models gave low estimates of genetic variances and heritabilities. The other models gave estimates of genetic parameters very similar to the animal model in Landrace sows, but intermediate between the sire and animal models in Large White sows. The reason of this difference between breeds is unclear. Whatever the breed, the sire-mgs-dam within mgs does not present much advantage: the genetic variance estimates are almost exactly those obtained with the sire-mgs model and the dam component is small, not even representing the expected 3/16 of the genetic variance in Large White sows. The use of an animal model or a sire-mgs model to estimate the genetic variance seems advisable to account for all (or most) of the relationships in the population. The sire-mgs is less satisfactory from a modelling point of view but is favourable when the data structure is not adequate for the use of the animal model and also for easier computing.
Heritabilities in our study ranged between 0.08 and 0.14 for Large White and between 0.04 and 0.11 for Landrace sows depending on the model type. In general, heritabilities showed some variation, but after considering the standard error of estimates it seems that none of the genetic models is clearly superior to another.
To compare our results with those of other authors, we considered only the studies using survival analysis and not those using linear models, because estimates from these methods are not comparable .
Yazdi et al.  have published a rather wide range of heritabilities for longevity in Landrace populations ranging from 0.11 to 0.27. Serenius and Stalder  have found heritabilities around 0.16-0.17 for Landrace and 0.17-0.19 for Large White populations. It is important to note that, in both these papers, heritabilities were computed assuming that the residual effects followed an extreme value distribution with the variance π2/6, while we used the effective heritability according to . This could potentially lead to differences in results depending on parameters of the Weibull distribution. For more details see . In other words the direct comparison of heritabilities on log or original scale and the effective heritability is not possible.
To obtain more comparable results, we used the variances of [16, 23] and inserted them in the equation of  (the same as we used for the sire model in our study). We used the sire variance of 0.037 in  using a model scenario similar to that in our study (herd*year treated as random, time dependent variable with 1 year intervals, with gamma parameter 5.76). Based on the estimated variance of the random herd*year effect and computed from the estimate of the gamma parameter, their effective heritability was 0.09. This result is very similar to our result of 0.07 from the sire model in the Landrace breed.
Using the same approach, we recalculated the heritability from  using the original sire's variances 0.068 to 0.081, resulting in effective heritabilities ranging between 0.18 and 0.21. These newly computed results were close to those in the paper (0.16-0.19) and were higher than our results in all cases. The herd*year effect was treated as a time-dependent fixed effect in this study, so no further adjustment was possible.
Engblom et al.  have used the same heritability definition as in our study, and found values of 0.06 for Landrace and 0.12 for Yorkshire (Large White) breeds with a sire survival model. Our study supports the previous findings that heritabilities for longevity in different breeds are not necessarily the same, even when using identical models. Their results were very similar to h2 = 0.07 for Landrace and h2 = 0.14 for Large White in our sire models.
The production cycle (interval between two farrowings) in sows consists of three periods: gestation period, suckling period and days open (service period). The heritability for gestation length is 0.25-0.3 [26, 27], but altering its length is not a breeding objective. In the Austrian Large White population, the mean gestation length is 114.8 ± 1.5 days, and in the Landrace 115.3 ± 1.5 days. The nursing period is heavily affected by breeders' decision for herd management. It is usually four weeks in Austria for both Large White (28.2 ± 5.0 days) and Landrace (28.2 ± 6.6 days) populations. The interval from weaning to first ovulation is optimally 5 to 6 days. If fertilisation is not successful, it can be longer with cycles repeated every 21 days. The time intervals for days open show much wider variation compared to gestation length and nursing period, with 16.0 ± 17.2 days in Large White and 14.0 ± 17.3 days in Landrace populations. From the average values and standard deviations of these three major components of the farrowing interval, it is obvious that the larger part of the variation is caused by days open. When considering days of productive life as the dependent variable, sows with longer reproductive cycles will tend to have higher estimated breeding values. This inconsistency is lifted when using grouped data models [6, 8]. More reproductive cycles increase the breeding value of an animal not because it lives longer, but because it produces more.
In this study, alternative models for the genetic evaluation of longevity in sows using continuous time and grouped data models in Large White and Landrace sows were examined. Risk of culling associated with length of productive life was influenced by all fixed and random effects. An increased risk was observed for animals having their first litter before 43 weeks or after 60 weeks of age, for sows from herds rapidly decreasing in size and sows with litter size under the herd's average in the given year. Some differences between the two breeds were observed, but in general the risks of culling for fixed effects in all models were comparable.
Heritabilities for the length of productive life were estimated using sire, sire-maternal grandsire, sire-dam, sire-maternal grandsire-dam within maternal grandsire and animal models with Weibull and grouped data models in both breeds. They ranged from 0.08 to 0.14 (s.e. 0.012-0.026) in Weibull and from 0.08 to 0.13 (s.e. 0.012-0.025) in grouped data models in Large White sows, and from 0.05 to 0.08 (s.e. 0.014-0.020) for the Weibull and from 0.07 to 0.11 (s.e. 0.016-0.023) for the grouped data models in Landrace sows. Sire-mgs models offer a good compromise between accurate modelling of the genetic part and computation requirements in case the data structure is not adequate to use an animal model. Heritabilities are low as for functional traits in general, but given the high economic importance of the length of productive life, it is really worthwhile to consider it as a selection criterion in pig breeding.
The grouped data models in pig tend to focus on the exact number of reproduction cycles rather than on the duration of productive life. Therefore sows with a higher number of parities are favoured, in contrast with sows which live longer just because of longer intervals between parities. The performance of grouped data models was comparable to continuous time models. Based on these results and because grouped data models reflect better the economical needs in meat animals, we conclude that grouped data models are more appropriate in pig.
The final outcome of the study is to provide basic information for routine genetic evaluation of the length of productive life in Austrian pig populations. The grouped data model will be used for its favourable contribution to deal with the non-productive days during a sow's lifetime.
The authors would like to acknowledge Austrian Pig Producers, the Federal Ministry of Agriculture, Österreichische Schweineprüfanstalt GesmbH for founding the project "Genetic Analysis of Lifetime Production (Longevity and Fertility) of Sows" and Austrian Science Found for funding the project "Correlated random effects in survival analysis applied to problems in genetics and epidemiology of dairy cattle and human population", which made the publication of this paper possible. The constructive criticisms and valuable suggestions of both reviewers are greatly appreciated.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.