Skip to main content

Identification of homozygous haplotypes carrying putative recessive lethal mutations that compromise fertility traits in French Lacaune dairy sheep



Homozygous recessive deleterious mutations can cause embryo/fetal or neonatal lethality, or genetic defects that affect female fertility and animal welfare. In livestock populations under selection, the frequency of such lethal mutations may increase due to inbreeding, genetic drift, and/or the positive pleiotropic effects of heterozygous carriers on selected traits.


By scanning the genome of 19,102 Lacaune sheep using 50 k single nucleotide polymorphism (SNP) phased genotypes and pedigree data, we identified 11 Lacaune deficient homozygous haplotypes (LDHH1 to LDHH11) showing a highly significant deficit of homozygous animals ranging from 79 to 100%. These haplotypes located on chromosomes 3, 4, 13, 17 and 18, spanned regions from 1.2 to 3.0 Mb long with a frequency of heterozygous carriers between 3.7 and 12.1%. When we compared at-risk matings (between carrier rams and daughters of carrier rams) and safe matings, seven of the 11 haplotypes were associated with a significant alteration of two fertility traits, a reduced success of artificial insemination (LDHH1, 2, 8 and 9), and/or an increased stillbirth rate (LDHH3, 6, 8, 9, and 10). The 11 haplotypes were also tested for a putative selective advantage of heterozygous carrier rams based on their daughter yield deviation for six dairy traits (milk, fat and protein yields, fat and protein contents and lactation somatic cell score). LDHH1, 3, 4, 5, 7, 9 and 11 were associated with positive effects on at least one selected dairy trait, in particular milk yield. For each haplotype, the most probable candidate genes were identified based on their roles in lethality of mouse knock-out models and in mammalian genetic disorders.


Based on a reverse genetic strategy, we identified at least 11 haplotypes with homozygous deficiency segregating in French Lacaune dairy sheep. This strategy represents a first tool to limit at-risk matings in the Lacaune dairy selection scheme. We assume that most of the identified LDHH are in strong linkage disequilibrium with a recessive lethal mutation that affects embryonic or juvenile survival in sheep but is yet to be identified.


Most of the individuals in a population are likely to be heterozygous for several loss-of-function mutations. When these mutations are homozygous, they very often lead to early embryo death (embryonic lethal mutations), or to developmental defects that affect fetuses and subsequently young individuals with varying degrees of severity [1]. Advances in genomic approaches and whole-genome sequencing in humans or in species of agronomic interest have shown that an individual can carry about a hundred of these mutations [2, 3].

In livestock under selection, the effective size of populations (Ne), which is used as an indicator of genetic diversity, is limited (Ne ~ 100–300) compared with that of the human population (Ne ~ 10,000); and as a result, the number of reproducers is rather small, particularly given the widespread use of artificial insemination (AI) [2, 4]. Even if genetic diversity and inbreeding parameters are managed, selection programs provide a fairly favorable context for the emergence of homozygous individuals with genetic defects that increase in frequency with inbreeding and/or overuse of certain sires and can finally jeopardize fertility in the whole population [5]. This has been observed in cattle where about 1% of the embryos die due to their homozygosity at one of the 10 identified lethal embryonic mutations [1]. In addition, the frequency of recessive lethal alleles could also increase in a population if they are associated with heterozygous advantages due to positive pleiotropic effects on selected production traits such as milk production in dairy cattle [6, 7], although in the homozygous state they are responsible for embryonic losses. Identification of these causal mutations has become a major issue with the emergence of genetic defects with obvious consequences on animal welfare and also have major economic implications. Indeed, in France these disorders cause losses that range from 50 to 100 million euros per year in cattle populations when their impact on fertility (about 5% decrease), loss of calves, and veterinary procedures are included in the calculation [8].

In recent decades, several genomic tools have been developed to help improve fertility in dairy cattle [9]. Among these tools, two methods have enabled the identification and characterization of recessive genetic defects and lethal mutations that affect fertility. First, homozygosity-mapping is an efficient way to map genetic defects based on a case/control approach using only a few biological samples (e.g. DNA or tissues) from affected and non-affected live animals [10]. However, embryonic and fetal lethal mutations, which are more frequently associated with fertility, have not been identified using this approach due to the difficulty to obtain biological samples. These mutations are more efficiently detected by a reverse genetic screen approach using large sets of single nucleotide polymorphism (SNP) chip genotyped animals and fertility records, such as those provided by genomic selection. In cattle, the original works of VanRaden et al. [11] and Fritz et al. [12] were based on the identification of haplotypes for which homozygous carrier animals are absent or show a more significant homozygous haplotype deficiency (HHD) than expected. Their strategy used phased 50 k SNP genotypes from trios (offspring, sire, dam or maternal grand-sire), and the search for statistically significant HHD based on sliding windows of 20 to 100 SNPs. The underlying hypothesis is based on the linkage disequilibrium between these haplotypes and deleterious recessive mutations located nearby. This reverse genetic screen strategy has led to the identification of HHD regions that harbor 14 causal mutations in seven dairy cattle breeds. Among these, 11 HHD are associated with embryonic lethal mutations in Holstein [11,12,13,14,15,16,17,18], Jersey [19], Fleckvieh [20], Montbéliarde [12, 21], and Normande [22], and three are associated with juvenile mortality in Ayshire [23], Brown Swiss [24], and Fleckvieh [20]. With the recent increased use of genomic selection, the accumulation of genotyping data has enabled the identification of recessive lethal mutations by reverse genetic screening in other species such as pig and chicken [25, 26]. However, to date there are no such studies in sheep.

Compared to cattle, the management of genetic diversity in dairy sheep takes advantage of their more local selection and breed management and of the use of a wider range of rams to produce fresh semen during a short reproductive campaign (May to August) [27]. For example, the efficiency of the management of genetic diversity in Lacaune dairy sheep was explained by an effective population size of 336 [28]. However, since the implementation of genomic selection in 2015, the number of Lacaune rams that enter the AI program was reduced to balance the cost of genotyping [27]. Thus, the widespread use of a limited number of AI rams could favor the emergence of recessive alleles and possibly embryonic or fetal lethal mutations that affect fertility.

In order to discover such mutations, a reverse genetic screen method was applied to the large genome-wide SNP dataset available from a genomic selection program in Lacaune dairy sheep. The specific objectives of this study were to identify haplotypes with a deficit of homozygous animals, to test the hypothesis of a negative impact of these haplotypes on fertility traits in the case of at-risk matings, to test their putative pleiotropic effects on milk production traits, and to propose candidate genes that could harbor the causal mutations.


Animal and genotyping data

The genotyped dairy Lacaune animals [n = 19,102 born between 1996 and 2019 (see Additional file 1: Figure S1)] were obtained from the selection schemes of two breeding companies, OVITEST (Saint-Léon, France) and the Confédération Générale de Roquefort (Millau, France). Table 1 lists the details on all the animals used in the study (mainly rams) that were genotyped either on a medium-density (MD) SNP chip (Illumina Ovine SNP50 BeadChip, 54,241 SNPs, n = 12,600 genotyped animals between 1 and 12 months of age, born since 1996) or a low-density (LD) SNP chip (SheepLD v.1, 15,000 SNPs, n = 6502 genotyped animals between 1 and 5 months of age, born since 2017).

Table 1 Description of genotyped animals

Both SNP chips (LD and MD) purchased from Illumina Inc. (San Diego, USA) were used to genotype the animals at Labogena ( or Aveyron Labo ( genotyping facilities. Genotype data were obtained within the framework of different research programs before 2015, and subsequently from the ongoing Lacaune dairy sheep genomic selection program [29]. The pedigree of the genotyped animals was extracted from the official French livestock data system (Systèmes Nationaux d’Information Génétique, France Génétique Elevage, Paris, France).

Genotype quality control, imputation and phasing

The quality control of each SNP was based on three criteria: (i) a call frequency higher than 97% (% of genotyped animals for each SNP), (ii) a minor allele frequency higher than 1%, and (iii) accordance with the Hardy–Weinberg equilibrium \(({\text{P}} > 10^{ - 5} )\). LD to MD genotype imputation and phasing of all genotypes were implemented with the FImpute v2.2 software [30]; the LD and MD chips had 11,342 common SNPs. The accuracy of LD to MD imputation of Lacaune genotypes was previously assessed and showed a concordance rate per animal of 99.05%, a concordance rate per SNP of 99.12%, and a squared Pearson’s correlation coefficient of 94.95% between imputed and observed SNP genotypes [31]. For subsequent identification of HHD, 38,696 SNPs on the 26 autosomal sheep chromosomes were available and mapped to the Ovis aries genome assembly Oar_v2.0 ([32], GigaDB, Oar_v2.0 coordinates are available at, the version used for the current genomic evaluation.

Detection of homozygous haplotype deficiency

Based on phased MD genotype data, the ovine genome was scanned using a sliding window approach of 20 consecutive SNPs to identify HHD on the 26 sheep autosomes by comparing the observed and expected number of homozygous animals using the method developed by Fritz et al. [12] in French dairy cattle, and adapted to ovine data as described below.

For each window of 20 consecutive SNPs, the frequencies of all observed haplotypes were calculated from the maternal phase, which is associated with a greater diversity of haplotypes. Only haplotypes with a frequency higher than 1% were selected. The choice of a sliding window of 20 consecutive SNPs, representing approximatively 1.0 to 1.5 Mb (50 k SNP chip with an informative SNP every 60 kb, 3 Gb genome), and of a haplotype frequency higher than 1% was based on a previous simulation to estimate the frequency of recessive lethal mutations in breeding populations [2]. With an effective population size ranging from 100 to 500, as in Lacaune sheep (Ne = 336 [28]), the frequency of recessive lethal mutations is expected to range from 1 to 3%.

For each selected haplotype \(k\), the number of observed homozygous animals (\(N_{{{\text{Obs}}}} \left( k \right)\)) was compared to the expected number of homozygous animals (\(N_{{{\text{Exp}}}} \left( k \right)\)). The number of expected homozygous animals \(N_{Exp} \left( k \right)\) was estimated using the within-trio transmission probability with the formula described in Fritz et al. [12]. Two types of trios were considered, 38 progeny-sire-dam trios (transmission probability of haplotype \(k\) is estimated based on the genotyped sire and dam) and 15,530 progeny-sire-maternal grandsire trios (transmission probability of haplotype \(k\) is estimated based on the genotyped sire and maternal grandsire).

The probability of observing “\(q\)” homozygotes with an expectation “\(lambda\)” was estimated using the Poisson distribution and calculated with the ppois function \(\left( {ppois\left( {q = N_{Obs} \left( k \right),lambda = N_{Exp} \left( k \right)} \right)} \right)\) in the RStudio software (Version 1.1.456), as previously described in Mesbab-Uddin et al. [22]. Each \(k\) haplotype was assumed to be significantly deficient in homozygotes when the P-value was lower than 1.9 × 10−4. This threshold was obtained by a Bonferroni correction for multiple testing at a 5‰ level of significance assuming that the number of independent tests was equal to the number of chromosomes (n = 26). Among the significant haplotypes, only those with a severe deficiency that ranged from 75 to 100%, were retained as HHD \(\left( {N_{Exp} \left( k \right) - N_{Obs} \left( k \right))/N_{Exp} \left( k \right) \ge 0.75} \right).\)

When the significant HHD of 20 SNPs were consecutive (i.e., shifted from the previous one by a single SNP) and showed the same minimum number of homozygous animals, they were clustered together to define a larger haplotype (all these primary HHD are in total linkage disequilibrium with each other), which we refer to as ‘Lacaune deficient homozygous haplotype’ (LDHH) (see Additional file 2: Tables S1 and S2). The homozygous, heterozygous, and non-carrier status of each haplotype constituting an LDHH region was then determined for each animal in the studied population (n = 19,102).

Linkage disequilibrium was estimated between two LDHH regions on the same chromosome by the r2 coefficient measure that was introduced by Hill and Robertson [33]. For each LDHH region, a bi-allelic locus was defined as allele 1, i.e., the detected LDHH showing a deficit in homozygotes, and as allele 2, i.e., all other haplotypes identified in the same region. The coordinates of the SNPs included in each haplotype were obtained for the ovine genome assembly Oar_v2.0 and were repositioned on the genome assembly Oar_v3.1 [32] (available from GenBank, GCA_000298735.1) for further genetic analyses.

Analysis of fertility and dairy production traits

Analysis of fertility traits

Trait records of Lacaune matings between 2006 and 2018 were obtained from the national database. We studied a first set of two fertility traits, i.e. artificial insemination success (AIS) and stillbirth rate (SBR). Among all the records, we focused only on matings between ewes with a genotyped sire and a genotyped ram, the sire and ram both having a known status at each LDHH (n = 1,155,835 matings). AIS was coded “1” for success and “0” for failure based on lambing date according to the gestation length starting from the day of AI (147 ± 5 days). SBR was determined only in the AI success group, and coded “1” if there was at least one stillbirth in the litter or “0” if all lambs were born alive (n = 804,577 matings). Four different types of mating are possible for each LDHH: (i) non-carrier ram × ewe from a non-carrier sire, (ii) non-carrier ram × ewe from a carrier sire, (iii) carrier ram × ewe from a non-carrier sire, and (iv) carrier ram × ewe from a carrier sire. Mating type (iv) was considered as at-risk mating, and the cluster of mating types (i), (ii) and (iii) were considered as safe mating. A logistic threshold binary model with a logit link function was used to compare AIS and SBR between at-risk and safe matings, using the GLIMMIX procedure in the SAS software (version 9.4; SAS Institute Inc., Cary, NC). The model used is \({\mathbf{y}} = {\mathbf{X}{\varvec{\upbeta}}} + {\mathbf{Z}\varvec{\upgamma }} + {\mathbf{e}}\), where \({\mathbf{y}}\) is a vector of “0” or “1” coding for AIS or SBR by considering the corresponding observations \(Z\) (Z = {0,1} for AIS, Z = {0,1,2+} for SBR) and a threshold for each variable, so that \({\mathbf{y}} = 1 \;if\; Z \ge 1 \;or\; {\mathbf{y}} = 0 \;{\text{otherwise}}\); \({\mathbf{X}}\) is the incidence matrix of fixed effects; \({{\varvec{\upbeta}}}\) is a vector of fixed effects; \({\mathbf{Z}}\) is an incidence matrix of random effects; \({{\varvec{\upgamma}}}\) is a vector of random effects, and \({\mathbf{e}}\) is a vector of residual error effects. The fixed effects for AIS and SBR were mating type (safe or at-risk), month of AI (March to September), and lactation number (L1, L2, L3 and L4+). For SBR only, prolificacy of the ewe (1, 2, 3+ lambs/litter) was added as a fixed effect. For AIS and SBR, the random effect was interaction herd × year (n = 470 herds between 2006 and 2018). Traits were considered to differ significantly when the fixed effect mating type had a P-value lower than 6.3 × 10–4. This threshold was obtained by Bonferroni correction for multiple testing at a 1% level of significance by correcting the number of independent tests assumed to be the number of significant LDHH regions multiplied by the two independent fertility traits studied.

Analysis of milk parameters

Daughter yield deviations (DYD) for milk parameters from genotyped sires with known status at each LDHH were computed from genetic evaluations (GenEval, Jouy-en-Josas, France). The DYD corresponds to the average performance of the daughters of each sire, corrected for environmental effects and the average genetic value of the mothers [29]. The six parameters studied were milk yield (MY), fat (FC) and protein (PC) contents, fat (FY = MY × FC) and protein (PY = MY × PC) yields, and lactation somatic cell score (LSCS). LSCS corresponds to the average SCS per lactation, i.e. the log-transformation of test-day somatic cell count (SCC) defined by \(SCS = log_{2} \left( {\frac{SCC}{{10000}}} \right) + 3\) [34,35,36]. To compare all the traits on the same scale, each DYD was divided by its genetic standard deviation and referred to as standardized DYD (sDYD). Only genotyped rams with records from at least 20 daughters were included in the analysis in order to obtain sufficiently accurate DYD values (n ~ 5400 rams). Each trait was tested by variance analysis comparing LDHH carrier and non-carrier rams using the GLM procedure in the SAS software (version 9.4; SAS Institute Inc., Cary, NC). The fixed effect model is \({\mathbf{y}} = {\mathbf{X }\varvec{\upbeta}} + {\mathbf{e}}\), where \({\mathbf{y}}\) is a vector of sDYD data for each trait; \({\mathbf{X}}\) is an incidence matrix of fixed effects; \({{\varvec{\upbeta}}}\) is a vector of the fixed effects and \({\mathbf{e}}\) is a vector of residual error effects. The fixed effects are the genetic status (carrier, non-carrier) and year of birth (2000 to 2016) to correct for annual genetic gain. Traits differ significantly between carrier and non-carrier rams when the effect of the genetic status is significant, i.e. with a P-value lower than 3.1 × 10–4. This threshold was obtained by Bonferroni correction for multiple testing at a 1% level of significance by correcting the number of independent tests assumed to be the number of significant LDHH regions multiplied by the four studied traits (MY, FC, PC and LSCS).

Identification of positional and functional candidate genes

The coordinates of each LDHH region were obtained from the ovine genome assembly Oar_v3.1. and extended 1 Mb upstream and 1 Mb downstream to obtain gene annotations from the Ensembl release 99 using the Biomart tool [37] (accessed on 20/03/2020; see Additional file 3: Table S3). Annotations and genome organization within these regions were found to be the same as those of the most recent ovine genome assembly Rambouillet v1.0 (NCBI Ovis aries Annotation Release 103, 2019-02-06, GCF_002742125.1).

Gene information (gene description, biological process, and molecular function) was extracted from several databases (NCBI Entrez:; MGI:; IMPC: The list of identified genes was then sorted to identify the most relevant candidate genes according to (i) their known implication in lethal phenotypes in knockout/loss-of-function mouse models based on viability information in the IMPC database and mortality/aging (embryonic, prenatal, perinatal, neonatal, postnatal, preweaning, premature death) information in the MGI database, and (ii) their association with abortion/death/autosomal recessive disorders from Online Mendelian Inheritance in Man (OMIM) ( and OMIA: Online Mendelian Inheritance in Animal (


Identification of HHD in Lacaune dairy sheep

By screening the genome of 16,346 animals (belonging to trios) from real/imputed 50 k genotyping data, we detected 266 highly significant HHD of 20 consecutive SNPs, each with a frequency higher than 1% and a deficit of homozygous animals higher than 75%. The location of these haplotypes along the ovine genome is shown as a Manhattan plot (see Additional file 4: Figure S2). As explained in “Methods” section, when significant HHD of 20 SNPs were consecutive (shifted by a single SNP) and showed the same minimum number of homozygous animals, they were clustered to define 11 larger haplotypes containing 23 to 48 SNPs and named LDHH (Table 2).

Table 2 List of Lacaune deficient homozygous haplotypes

Among these haplotypes, five LDHH presented a complete deficit of observed homozygous animals (LDHH1 to 5) and six LDHH presented a partial deficit ranging from 79 to 96% of the expected number of homozygous animals (LDHH6 to 11). The length of the identified haplotypes ranged from 1.2 to 3.0 Mb on the ovine genome v3.1. LDHH3, 4, 5, 6 and 11 are located on Ovis aries (OAR) chromosome 3. Only LDHH4 (24 SNPs) and LDHH5 (29 SNPs) are in high linkage disequilibrium (71%). Although they share nine SNPs at their ends, LDHH4 and LDHH5 were not originally clustered together since some consecutive 20-SNP HHD between the two LDHH did not follow our clustering rule (they were neither significant nor had the same minimum number of homozygous animals). LDHH8 is located on OAR18, 3.7 Mb from LDHH9 and 5.4 Mb from LDHH10, and shows a moderate linkage disequilibrium of 55% and 40% with these two haplotypes, respectively. Likewise, LDHH9 (37 SNPs) and LDHH10 (26 SNPs) are in high linkage disequilibrium (72%), share 11 SNPs, but were not clustered together for the same reason as LDHH4 and 5. Other LDHH are located on OAR4 (LDHH1), OAR13 (LDHH2) and OAR17 (LDHH7). Consequently, the 11 LDHH identified are most probably associated with only eight independent causal mutations in the eight following genomic regions on OAR3 (LDHH3, 32.0–34.9 Mb; LDHH11, 128.9–131.1 Mb; LDHH4–5, 131.1–133.9 Mb and LDHH6, 136.2–137.4 Mb), OAR4 (LDHH1, 43.4–46.3 Mb), OAR13 (LDHH2, 44.8–46.8 Mb), OAR17 (LDHH7, 0–1.6 Mb) and OAR18 (LDHH8–9–10, 25.7–34.6 Mb). When calculated based on the whole genotyped population, the frequency of carriers ranged from 3.7 to 6.7% for LDHH with a total deficit of homozygotes, and from 4.4 to 12.1% for LDHH with a partial deficit.

Impact of LDHH on the success rate of AI and on stillbirth rate

To check for a putative effect of the 11 LDHH on embryonic, fetal and/or perinatal lethality in the dairy Lacaune population, two fertility-associated traits that could reflect the consequences of these mutations were analyzed: AI success (AIS: 1,155,835 matings) and stillbirth rate (SBR: 804,577 matings) (Fig. 1).

Fig. 1

Effects of LDHH on the success rate of artificial insemination and on the stillbirth rate in at-risk matings compared to safe matings. AIS artificial insemination success, SBR stillbirth rate. For each LDHH, the frequency of at-risk mating is shown in parentheses. Significant effects are indicated by the corrected P-value for multiple tests with a threshold set at α = 1%: *P < 6.3 × 10−4; **P < 6.3 × 10−5 and ***P < 6.3 × 10−6

AIS could be a good proxy for embryonic losses during the first weeks after AI. In our study population, the average AIS was 69.6%. Interestingly, at-risk matings for four LDHH (1, 2, 8 and 9) had a significant negative effect on AIS leading to a decreased success rate of 2.2% for LDHH1 (P = 3.2×10−6) up to 3.2% for LDHH2 (P = 2.6×10−9).

In parallel, evaluating the stillbirth rate could be a useful way to identify mutations that cause perinatal lethality. In our study population, 5.1% of litters included at least one stillbirth. Five significant LDHH (3, 6, 8, 9 and 10) were associated with a 1.1% increase in SBR for LDHH6 (P = 5.9×10−7) up to 3.4% for LDHH9 (P = 8.5×10−12) in at-risk compared to safe matings.

Pleiotropic effects of LDHH on milk production traits

For the 11 LDHH regions detected in dairy Lacaune, we tested whether a putative selective advantage existed at the heterozygous state for six dairy traits routinely included in genomic evaluations. sDYD of five milk production traits (milk, fat and protein yields, and fat and protein contents) and lactation somatic cell score (a proxy for mastitis) were compared between carrier and non-carrier rams of each LDHH (n ~ 5400 genotyped rams with DYD). Figure 2 shows the relative differences in sDYD calculated between heterozygous and non-carrier rams. For all milk production traits, positive values indicate an improvement of the selected traits, whereas for the lactation somatic cell score, a positive value indicates a deterioration of udder health in the progeny of heterozygous rams.

Fig. 2

sDYD relative difference between heterozygous and non-carrier rams for 6 selected traits. MY milk yield, FY fat yield, PY protein yield, FC fat content, PC protein content, LSCS Lactation somatic cell score, sDYD standardized daughter yield deviation (DYD divided by genetic standard deviation). Significant effects are indicated by the corrected P-value for multiple tests with a threshold set at α = 1%: *P < 3.1 × 10−4; **P < 3.1 × 10−5 and ***P < 3.1 × 10−6. Error bars indicate standard errors. Significant favorable effects of heterozygous are in green and significant unfavorable effects are in red

Among the 11 haplotypes, seven had a significant effect on sDYD for at least one trait. Daughters of LDHH1 carrier rams had a higher protein content (sDYD + 0.11), whereas those of LDHH3 carriers had a significantly lower protein yield (sDYD − 0.12) and protein content (sDYD − 0.13). Moreover, LDHH3 was significantly associated with a lower somatic cell score in milk (sDYD − 0.10). Both LDHH4 and LDHH5 carriers showed a significant increase in milk production (sDYD + 0.22 and + 0.21, respectively) and in protein yield (sDYD + 0.14 and + 0.13, respectively) but had negative effects on fat (sDYD − 0.19 and − 0.21, respectively) and protein content (sDYD − 0.20 in both). LDHH7 and LDHH9 carriers showed a higher sDYD for milk production with an increase of + 0.17 and + 0.11, respectively. LDHH11 was also associated with a + 0.12 increase in sDYD for milk production, and a wider effect spectrum, a positive effect on fat and protein yields sDYD (with + 0.09 and + 0.14, respectively), but a negative effect on LSCS (+ 0.12) and fat content sDYD (− 0.12).

Candidate genes from LDHH regions

Among the 340 protein coding genes identified in the 11 LDHH regions extended by 1 Mb on each side (see Additional file 3: Table S3), 59 are associated with lethality when invalidated in mice (all homozygous mutant alleles lead to lethality, complete penetrance) and 40 show sub-viable phenotypes (higher deficit of homozygous mutant animals than expected according to Mendelian inheritance, incomplete penetrance). Among these genes, several are listed in the OMIM and OMIA databases as being associated with mammalian autosomal recessive disorders (Table 3). Most of these 99 genes are involved in nucleic acid synthesis/DNA replication/transcription (PCNA, CAD, HOXC13, and ARNT2), or encode structural or signaling proteins (MAGI2, RELN, PRNP, FERMT1, IFT172, CEP83, KRT8, WNT1, CCDC65, and TLL1) or are involved in basal metabolic processes (PMPCB, IDI1, POMC, HADH-A/B, EIF2B4, SCN8A, PFKM, FAH, MPI, CYP1A2, STRA6, and CRADD). Knock-out mouse models of these genes have mainly revealed defects that affect offspring soon after birth (perinatal to weaning stages) and jeopardize the survival of the young.

Table 3 Most probable candidate genes affecting viability in mouse knockout models and associated with mammalian autosomal recessive disorders


We identified eight independent genomic regions (named LDHH) in Lacaune dairy sheep showing complete or partial deficit of homozygous haplotypes, which suggests that several independent recessive deleterious mutations segregate in this population. To successfully identify HHD in dairy sheep, we used the same criteria as those used in dairy cattle, i.e. sliding windows of 20 consecutive SNPs and considered significant haplotypes with a frequency higher than 1% [12]. Using about 20,000 related animals genotyped with the 50 k SNP chip, the number of HHD identified and the carrier frequencies (between 4 and 12%) in dairy sheep are in line with the estimations reported in cattle [17]. Detection of HHD segregating at a lower frequency would have required more animals, but this will be possible in the near future thanks to the ongoing accumulation of genotyping data. Even if the length of these haplotypes depends on population structure, genome size, or the density of the SNP chip used, the length (from 1.2 to 3.0 Mb) of the LDHH identified here is in line with previously reported observations in other livestock with haplotypes spanning regions of less than 5 Mb [1]. One notable difference between our study on sheep and the literature on cattle is that less than 1% of the genotyped Lacaune animals have their two parents genotyped. Indeed, in Holstein and Normande cattle for which HHD were identified, this number reaches 30%. However, in sheep, dams are not routinely genotyped for genomic selection programs, which reduces the HHD detection power of our analysis since, in the trios included here, most of the dam genotypes are estimated by the status of the maternal grandsire and the frequency of the haplotype in the population.

In our population, we estimated that the frequencies of LDHH carriers ranged from 3.7 to 12.1%, which is in line with those of previous studies on cattle and pig (recently reviewed in Georges et al. [1]). Considering that these haplotypes are associated with deleterious mutations in the homozygous state, such frequencies could be considered as being quite high compared to the estimated frequency in humans, which is less than 1% [2]. In livestock populations with a small effective size (Ne = 336 in dairy Lacaune [28]), allele frequencies fluctuate randomly from one generation to the next due to small sampling of reproducers. Thus, the frequency of a deleterious recessive mutation can increase sharply by genetic drift from an initial value close to 0 up to several percent, as we observed in Lacaune (from 2 to 6%), due to the spread of very influential carriers and their progeny [5]. The Lacaune sheep population has a complex history with the creation of two lines, one for meat and one for dairy purposes, and four independent selection schemes. Based on this particular population structure, further simulation studies would be useful to estimate the effect of drift on the frequency of LDHH carriers. Apart from genetic drift, maintaining or increasing the frequency of deleterious alleles in a livestock population can be accomplished by balancing selection when deleterious alleles provide a heterozygous advantage on selected traits [5]. Several examples of homozygous deleterious mutations have been reported with a heterozygous advantage in livestock [38], such as the 660-kb deletion (spanning four genes) identified in Nordic Red cattle that leads to higher milk production in heterozygous carriers [7], the 2-pb deletion in the MRC2 gene leading to the Crooked Tail syndrome in Belgian Blue cattle, but enhances muscular development in heterozygous carriers [39], or the 212-kb deletion affecting the BBS9 and BMPER genes associated with higher growth rates in pig [40]. In the current study, seven of the 11 LDHH (i.e. six of the eight independent regions) are associated with positive effects on at least one milk trait (milk, protein and fat yields, protein and fat contents and LSCS) in the heterozygous state. The observed difference between carrier and non-carrier rams corresponds mainly to 1 or 2 years of genetic gain for the given traits [34]. For example, the observed difference in milk yield corresponds to + 4.0 L up to + 8.1 L for LDHH9 and LDHH4 carriers, respectively. These results are relatively similar to the effect observed for lethal mutations detected in US dairy cattle [6]. Surprisingly, LDHH6 which is the most frequent haplotype in our study (carrier frequency of 12.1%) had no pleiotropic effect on milk traits, which was also the case of the less frequent LDHH2, LDHH8 and LDHH10. Apart from the hypotheses of genetic drift or association with selected traits, the high frequency observed for LDHH6 could be explained by balancing selection of other traits that are obviously not implemented in the selection scheme, such as morphological phenotypes that match breed criteria (stature, fleece type and color, hornless).

Based on linkage disequilibrium, the 11 LDHH identified delimit only eight independent genomic regions probably associated with eight causal mutations. Within these regions, we searched for candidate genes that may be affected by these mutations in accordance with the characteristics of the genotyped population, the deficit of homozygous animals and the observed impact on AIS and SBR (assuming lethality from the embryonic to the juvenile stage). We reviewed mouse knockout models and report genetic disorders that are listed in human and animal databases (Table 3). Knowledge based on bovine studies shows that recessive lethal alleles are caused by loss-of-function mutations (stop-gain, frameshift, missense, splice site and deletion) affecting genes mainly involved in cell division, DNA replication, transcription, RNA processing, or coding for structural proteins or in essential metabolic processes [2, 41]. Based on AIS and SBR analyses, we were able to classify LDHH into four patterns, each with a hypothesized associated effect. The first pattern groups LDHH1 on OAR4 and LDHH2 regions on OAR13. Both haplotypes have a significant impact on AIS that we associate with early embryonic loss. Accordingly, we found that PMPCB and PCNA are strong candidate genes in the LDHH1 and LDHH2 regions, respectively. Indeed, homozygous invalidation of both genes in mouse causes embryonic lethality around the implantation stage [MGI:1920328, MGI:97503]. The second pattern corresponds to LDHH that have an impact on both AIS and SBR and that we associate with embryo/fetal loss throughout the gestation period, i.e. LDHH8, 9 and 10 on OAR18. In this case, we suspect a unique lethal mutation that affects one of the four following candidate genes FAH, ARNT2, MPI and STRA6, which are reported to be homozygous lethal in knock-out mouse models (Fah [MGI:95482], Arnt2 [MGI:107188], Mpi [MGI:97075] [42] and Stra6 [MGI:107742]). The third pattern groups LDHH3 and LDHH6 on OAR3 and affects only SBR. We assume that these LDHH harbor lethal mutations with effects that occur at the end of gestation and/or very soon after birth. Within the LDHH3 region in complete homozygous deficiency, the CAD gene is the most obvious candidate. Indeed, invalidation of the Cad gene in mouse causes preweaning lethality with complete penetrance [MGI:1916969], and a missense mutation in CAD (g.72399397T>C; p.Tyr452Cys) associated with the NH7 homozygous deficient haplotype [OMIA 002201-9913] causes late abortion during the last months of gestation in Normande cattle [22]. In the LDHH6 region, three genes Wnt1, Ccdc65, and Pfkm, are associated with perinatal, neonatal or preweaning lethality in mouse knock-out models [MGI:98953, MGI:2146001, MGI:97548]. Among these, a mutation in CCDC65 encoding a nexin-dynein regulatory complex, is reported to cause “ciliary dyskinesia, primary 27” [CILD27, OMIM #615504], which is a neonatal respiratory distress syndrome in humans [43]. Interestingly, a respiratory syndrome is also reported for the homozygous deficient Braunvieh haplotype 2 (BH2) [OMIA 001939-9913] associated with perinatal and juvenile mortality in cattle due to a mutation in TUBD1 (tubulin delta 1) that disorganizes the microtubules in airway cilia [24]. Finally, the fourth pattern groups LDHH4–5, LDHH7 and LDHH11 for which we failed to produce evidence for an alteration of fertility traits when comparing at-risk and safe matings. This could be due to the relatively low frequency of at-risk matings in our dataset for LDHH4–5 (0.2%) and LDHH7 (0.1%), which made it impossible to reach statistical significance. Alternatively, these haplotypes could also host mutations that affect lamb survival in a later growth period, as observed for heart malformation in mouse [44] and in humans (“atrial septal defect 6” [ASD6, OMIM #613087]; [45]) associated with the TLL1 gene located in the LDHH7 region. The last hypothesis for these LDHH in the fourth pattern concerns an effect related to morphological defects or breed standards that is possibly counter-selected in selection schemes.

Particular attention should also be focused on OAR3 and the region spanned by LDHH4–5 and LDHH11. These haplotypes are located in the region of the p.R96C mutation (OAR3, g.129722200C>T, genome assembly v3.1) that has been previously detected in the suppressor of cytokine signaling 2 (SOCS2) gene in Lacaune dairy sheep and is associated with sensitivity to mastitis [46]. Interestingly, in our study, these LDHH, and particularly LDHH11 that includes SOCS2, had positive effects on milk production, on fat and protein yields, but negative effects on LSCS and fat content, as already observed for the C variant of the p.R96C mutation. Since 2017, genotypes for OAR3:129722200C>T are available for all animals genotyped on the LD chip when candidate rams are selected to enter the breeding center. Although LDHH4 and LDHH5 are not in linkage disequilibrium with LDHH11, all three are associated with the SOCS2 mutation. Indeed, 69% of the LDHH4, 92% of the LDHH5 and 97% of the LDHH11 heterozygous carriers were C/T for the SOCS2 mutation but only 16%, 18% and 23% of the heterozygotes for the SOCS2 mutation carried one of these haplotypes, respectively (see Additional file 5: Table S4). This could be explained by the fact that the SOCS2 mutation is carried by two different haplotypes, one in strong linkage disequilibrium with LDHH4–5 and the other with LDHH11. Thus, linkage disequilibrium with the SOCS2 mutation may explain the observed effect of LDHH4–5 and LDHH11 on milk traits. However, OAR3:129722200C>T in SOCS2 is not a candidate lethal mutation associated with these haplotypes that could explain their deficit of homozygotes. We hypothesize that a recessive lethal allele located in one of those haplotypes is in partial linkage disequilibrium with the mutant allele of SOCS2. Maintaining frequencies of LDHH4–5 (4%) and LDHH11 (7%) in the Lacaune population would then not be explained by a heterozygous advantage of deleterious mutations but by genetic hitchhiking due to SOCS2 and its association with selected dairy traits [5].


In this study, we demonstrated the possible segregation of recessive lethal alleles in a sheep population by applying a reverse genetic screen approach on a large genotyping dataset and searching for homozygous haplotype deficiency. Among the 11 LDHH genomic regions detected, we hypothesize that at least eight independent recessive mutations cause early embryonic loss, peri/neonatal lethality or severe health defects in young lambs. We identified the most obvious candidate genes that are assumed to be altered by these mutations, and provide strong working hypotheses to identify them by whole-genome sequencing of heterozygous carriers of the different LDHH. Identification of causal mutations, and of the corresponding altered genes, is important to accurately identify the phenotype they control and to improve our knowledge of the fundamental mechanisms underlying the phenotype. As already observed for haplotypes associated with deleterious recessive mutations, particularly in dairy cattle, most of the LDHH evidenced in dairy sheep were associated with decreased fertility, but also had positive pleiotropic effects on milk production. Management of these haplotypes—or of their causal mutations once they are discovered—in the Lacaune dairy sheep selection scheme through reasoned mating of carrier rams and putative carrier ewes could improve overall fertility and lamb viability. Moreover, if these mutations segregate more widely in other ovine populations, the consequences would apply more broadly to sheep breeding in general.

Availability of data and materials

The datasets analyzed during this study are available from the corresponding author on reasonable request.


  1. 1.

    Georges M, Charlier C, Hayes B. Harnessing genomic information for livestock improvement. Nat Rev Genet. 2019;20:135–56.

    Google Scholar 

  2. 2.

    Charlier C, Li W, Harland C, Littlejohn M, Coppieters W, Creagh F, et al. NGS-based reverse genetic screen for common embryonic lethal mutations compromising fertility in livestock. Genome Res. 2016;26:1333–41.

    Google Scholar 

  3. 3.

    MacArthur DG, Balasubramanian S, Frankish A, Huang N, Morris J, Walter K, et al. A systematic survey of loss-of-function variants in human protein-coding genes. Science. 2012;335:823–8.

    Google Scholar 

  4. 4.

    Danchin-Burge C, Danvy S, Laloë D, Verrier E. Création d’un observatoire de la VARiabilité génétique des RUMinants et des Equidés (VARUME). Innov Agron. 2017;55:235–45.

    Google Scholar 

  5. 5.

    Boichard D, Grohs C, Danchin-Burge C, Capitan A. Genetic defects: definition, origin, transmission and evolution, and mode of action. INRAE Prod Anim. 2016;29:297–306.

    Google Scholar 

  6. 6.

    Cole JB, Null DJ, VanRaden PM. Phenotypic and genetic effects of recessive haplotypes on yield, longevity, and fertility. J Dairy Sci. 2016;99:7274–88.

    Google Scholar 

  7. 7.

    Kadri NK, Sahana G, Charlier C, Iso-Touru T, Guldbrandtsen B, Karim L, et al. A 660-Kb deletion with antagonistic effects on fertility and milk production segregates at high frequency in Nordic Red cattle: additional evidence for the common occurrence of balancing selection in livestock. PLoS Genet. 2014;10:e1004049.

    Google Scholar 

  8. 8.

    Hozé C, Fritz S, Baur A, Grohs C, Danchin-Burge C, Boichard D. Accounting for genetic conditions in breeding objectives in dairy cattle. In: Proceedings of the 24ème Rencontres Autour des Recherches sur les Ruminants: 5–6 December, Paris; 2018.

  9. 9.

    Capitan A, Michot P, Baur A, Saintilan R, Hozé C, Valour D, et al. Genetic tools to improve reproduction traits in dairy cattle. Reprod Fertil Dev. 2015;27:14–21.

    Google Scholar 

  10. 10.

    Charlier C, Coppieters W, Rollin F, Desmecht D, Agerholm JS, Cambisano N, et al. Highly effective SNP-based association mapping and management of recessive defects in livestock. Nat Genet. 2008;40:449–54.

    Google Scholar 

  11. 11.

    VanRaden PM, Olson KM, Null DJ, Hutchison JL. Harmful recessive effects on fertility detected by absence of homozygous haplotypes. J Dairy Sci. 2011;94:6153–61.

    Google Scholar 

  12. 12.

    Fritz S, Capitan A, Djari A, Rodriguez SC, Barbat A, Baur A, et al. Detection of haplotypes associated with prenatal death in dairy cattle and identification of deleterious mutations in GART, SHBG and SLC37A2. PLoS One. 2013;8:e65550.

    Google Scholar 

  13. 13.

    Daetwyler HD, Capitan A, Pausch H, Stothard P, van Binsbergen R, Brøndum RF, et al. Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle. Nat Genet. 2014;46:858–65.

    Google Scholar 

  14. 14.

    McClure MC, Bickhart D, Null D, VanRaden P, Xu L, Wiggans G, et al. Bovine exome sequence analysis and targeted SNP genotyping of recessive fertility defects BH1, HH2, and HH3 reveal a putative causative mutation in SMC2 for HH3. PLoS ONE. 2014;9:e92769.

    Google Scholar 

  15. 15.

    Adams HA, Sonstegard TS, VanRaden PM, Null DJ, Van Tassell CP, Larkin DM, et al. Identification of a nonsense mutation in APAF1 that is likely causal for a decrease in reproductive efficiency in Holstein dairy cattle. J Dairy Sci. 2016;99:6693–701.

    Google Scholar 

  16. 16.

    Schütz E, Wehrhahn C, Wanjek M, Bortfeld R, Wemheuer WE, Beck J, et al. The Holstein Friesian lethal haplotype 5 (HH5) results from a complete deletion of TBF1M and cholesterol deficiency (CDH) from an ERV-(LTR) insertion into the coding region of APOB. PLoS One. 2016;11:e0154602.

    Google Scholar 

  17. 17.

    Hozé C, Escouflaire C, Mesbah-Uddin M, Barbat A, Boussaha M, Deloche MC, et al. Short communication: a splice site mutation in CENPU is associated with recessive embryonic lethality in Holstein cattle. J Dairy Sci. 2020;103:607–12.

    Google Scholar 

  18. 18.

    Fritz S, Hozé C, Rebours E, Barbat A, Bizard M, Chamberlain A, et al. An initiator codon mutation in SDE2 causes recessive embryonic lethality in Holstein cattle. J Dairy Sci. 2018;101:6220–31.

    Google Scholar 

  19. 19.

    Sonstegard TS, Cole JB, VanRaden PM, Van Tassell CP, Null DJ, Schroeder SG, et al. Identification of a nonsense mutation in CWC15 associated with decreased reproductive efficiency in Jersey cattle. PLoS One. 2013;8:54872.

    Google Scholar 

  20. 20.

    Pausch H, Schwarzenbacher H, Burgstaller J, Flisikowski K, Wurmser C, Jansen S, et al. Homozygous haplotype deficiency reveals deleterious mutations compromising reproductive and rearing success in cattle. BMC Genomics. 2015;16:312.

    Google Scholar 

  21. 21.

    Michot P, Fritz S, Barbat A, Boussaha M, Deloche M-C, Grohs C, et al. A missense mutation in PFAS (phosphoribosylformylglycinamidine synthase) is likely causal for embryonic lethality associated with the MH1 haplotype in Montbéliarde dairy cattle. J Dairy Sci. 2017;100:8176–87.

    Google Scholar 

  22. 22.

    Mesbah-Uddin M, Hozé C, Michot P, Barbat A, Lefebvre R, Boussaha M, et al. A missense mutation (p.Tyr452Cys) in the CAD gene compromises reproductive success in French Normande cattle. J Dairy Sci. 2019;102:6340–56.

    Google Scholar 

  23. 23.

    Venhoranta H, Pausch H, Flisikowski K, Wurmser C, Taponen J, Rautala H, et al. In frame exon skipping in UBE3B is associated with developmental disorders and increased mortality in cattle. BMC Genomics. 2014;15:890.

    Google Scholar 

  24. 24.

    Schwarzenbacher H, Burgstaller J, Seefried FR, Wurmser C, Hilbe M, Jung S, et al. A missense mutation in TUBD1 is associated with high juvenile mortality in Braunvieh and Fleckvieh cattle. BMC Genomics. 2016;17:400.

    Google Scholar 

  25. 25.

    Derks MFL, Megens H-J, Bosse M, Lopes MS, Harlizius B, Groenen MAM. A systematic survey to identify lethal recessive variation in highly managed pig populations. BMC Genomics. 2017;18:858.

    Google Scholar 

  26. 26.

    Derks MFL, Megens H-J, Bosse M, Visscher J, Peeters K, Bink MCAM, et al. A survey of functional genomic variation in domesticated chickens. Genet Sel Evol. 2018;50:17.

    Google Scholar 

  27. 27.

    Buisson D, Astruc J-M, Barillet F. Overview and perspectives of the management of genetic diversity in French dairy sheep. INRAE Prod Anim. 2018;31:1–12.

    Google Scholar 

  28. 28.

    IDELE. VARUME. 2016. Accessed 2 Apr 2021.

  29. 29.

    Astruc J-M, Baloche G, Buisson D, Labatut J, Lagriffoul G, Larroque H, et al. Genomic selection in French dairy sheep. INRAE Prod Anim. 2016;29:41–55.

    Google Scholar 

  30. 30.

    Sargolzaei M, Chesnais JP, Schenkel FS. A new approach for efficient genotype imputation using information from relatives. BMC Genomics. 2014;15:478.

    Google Scholar 

  31. 31.

    Larroque H, Chassier M, Saintilan R, Astruc J-M. Imputation accuracy from a low density SNP panel in 5 dairy sheep breeds in France. In: Proceedings of the 68th annual meeting of the European association for animal production: 28 August–1 September, Tallinn; 2017.

  32. 32.

    Jiang Y, Xie M, Chen W, Talbot R, Maddox JF, Faraut T, et al. The sheep genome illuminates biology of the rumen and lipid metabolism. Science. 2014;344:1168–73.

    Google Scholar 

  33. 33.

    Hill WG, Robertson A. Linkage disequilibrium in finite populations. Theor Appl Genet. 1968;38:226–31.

    Google Scholar 

  34. 34.

    Barillet F, Lagriffoul G, Marnet P-G, Larroque H, Rupp R, Portes D, et al. Breeding objectives and reasoned strategy of implementation at the population level for French dairy sheep breeds. INRAE Prod Anim. 2016;29:19–40.

    Google Scholar 

  35. 35.

    IDELE. Comment sont évalués génétiquement les ovins laitiers en France. 2016. Accessed 2 Apr 2021.

  36. 36.

    Rupp R, Lagriffoul G, Astruc JM, Barillet F. Genetic parameters for milk somatic cell scores and relationships with production traits in French Lacaune dairy sheep. J Dairy Sci. 2003;86:1476–81.

    Google Scholar 

  37. 37.

    Yates AD, Achuthan P, Akanni W, Allen J, Allen J, Alvarez-Jarreta J, et al. Ensembl 2020. Nucleic Acids Res. 2020;48:D682–8.

    Google Scholar 

  38. 38.

    Hedrick PW. Heterozygote advantage: the effect of artificial selection in livestock and pets. J Hered. 2015;106:141–54.

    Google Scholar 

  39. 39.

    Fasquelle C, Sartelet A, Li W, Dive M, Tamma N, Michaux C, et al. Balancing selection of a frame-shift mutation in the MRC2 gene accounts for the outbreak of the crooked tail syndrome in Belgian Blue cattle. PLoS Genet. 2009;5:e1000666.

    Google Scholar 

  40. 40.

    Derks MFL, Lopes MS, Bosse M, Madsen O, Dibbits B, Harlizius B, et al. Balancing selection on a recessive lethal deletion with pleiotropic effects on two neighboring genes in the porcine genome. PLoS Genet. 2018;14:e1007661.

    Google Scholar 

  41. 41.

    Fritz S, Michot P, Hozé C, Grohs C, Boussaha M, Boichard D, et al. Anticipate the emergence of genetic defects from genomics data. INRAE Prod Anim. 2016;29:339–50.

    Google Scholar 

  42. 42.

    DeRossi C, Bode L, Eklund EA, Zhang F, Davis JA, Westphal V, et al. Ablation of mouse phosphomannose isomerase (Mpi) causes mannose 6-phosphate accumulation, toxicity, and embryonic lethality. J Biol Chem. 2006;281:5916–27.

    Google Scholar 

  43. 43.

    Austin-Tse C, Halbritter J, Zariwala MA, Gilberti RM, Gee HY, Hellman N, et al. Zebrafish ciliopathy screen plus human mutational analysis identifies C21orf59 and CCDC65 defects as causing primary ciliary dyskinesia. Am J Hum Genet. 2013;93:672–86.

    Google Scholar 

  44. 44.

    Clark TG, Conway SJ, Scott IC, Labosky PA, Winnier G, Bundy J, et al. The mammalian tolloid-like 1 gene, Tll1, is necessary for normal septation and positioning of the heart. Development. 1999;126:2631–42.

    Google Scholar 

  45. 45.

    Stańczak P, Witecka J, Szydło A, Gutmajster E, Lisik M, Auguściak-Duma A, et al. Mutations in mammalian tolloid-like 1 gene detected in adult patients with ASD. Eur J Hum Genet. 2009;17:344–51.

    Google Scholar 

  46. 46.

    Rupp R, Senin P, Sarry J, Allain C, Tasca C, Ligat L, et al. A point mutation in suppressor of cytokine signalling 2 (Socs2) increases the susceptibility to inflammation of the mammary gland while associated with higher body weight and size and higher milk production in a sheep model. PLoS Genet. 2015;11:e1005629.

    Google Scholar 

Download references


The authors thank the Phenofinlait, SheepSNPQTL, Roquefort’In, Degeram and GOLD project consortia for genomic selection implementation in French dairy Lacaune. We are grateful to the UPRA Lacaune, CNBL, FGE and Valogène who made the genotyping data available for this study. We thank also Sébastien Fritz (Allice) for helpful discussions.


This project has received funding from the European Union’s Horizon 2020 research and innovation program under the Grant Agreement No. 772787 (SMARTER). MB was supported by a Ph.D. grant for the HOMLET program co-funded by APIS-GENE and Région Occitanie.

Author information




MB performed the analyses, interpreted the results, and drafted the manuscript. CH designed the script to identify homozygous haplotype deficiency, MB and CMR adapted the script in sheep. JMA managed access to genotype and phenotype data. CMR and SF conceived and designed the research. CMR and SF supervised the analyses, helped interpret the results, wrote, reviewed and edited the final manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Stéphane Fabre.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Figure S1.

Distribution of genotyped animals. Figure S1 shows the number of genotyped animals according to sex and year of birth.

Additional file 2: Table S1.

Clustering HHD in LDHH regions. Table S1 shows all significant haplotypes of 20 markers (i.e., 266 HHD with frequency > 1%, P-value < 1.9 × 10−4 and deficit ≥ 75%). As described in the “Methods” section, the 266 HHD could be associated with 11 LDHH regions which are shown in green. Table S2. SNPs defining the LDHH regions. Table S2 gives the position of each SNP within LDHH regions according to the sheep reference genome v2.0 and v3.1, and the phased alleles of each deficient haplotype.

Additional file 3: Table S3.

Positional candidate genes within LDHH regions. Table S3 summarizes all the protein coding genes that are located within the 11 LDHH regions extended by 1 Mb on each side. Genomic coordinates refer to the sheep reference genome v3.1. When available, gene information and association with autosomal recessive disorders in mammals are reported for each protein coding gene based on the following databases: NCBI Entrez:; MGI:; IMPC:; OMIM: Online Mendelian Inheritance in Man ( and OMIA: Online Mendelian Inheritance in Animal (

Additional file 4: Figure S2.

Manhattan plot of HHD. Each point represents one haplotype of 20 markers with a frequency > 1% in the maternal phase. The red line represents the P-value threshold (1.9 × 10−4) to consider a haplotype significantly deficient in homozygotes. Only HHD with a deficit in homozygotes ≥ 75% were selected and resulted in the identification of 266 significant HHD (represented by green dots).

Additional file 5: Table S4.

Contingency table between LDHH4, 5 and 11 status and genotypes for the SOCS2 mutation OAR3:129722200C>T. Ram genotypes at both SOCS2 OAR3:129722200C>T and LDHH loci.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Ben Braiek, M., Fabre, S., Hozé, C. et al. Identification of homozygous haplotypes carrying putative recessive lethal mutations that compromise fertility traits in French Lacaune dairy sheep. Genet Sel Evol 53, 41 (2021).

Download citation