Skip to main content

How do introgression events shape the partitioning of diversity among breeds: a case study in sheep



From domestication to the current pattern of differentiation, domestic species have been influenced by reticulate evolution with multiple events of migration, introgression, and isolation; this has resulted in a very large number of breeds. In order to manage these breeds and their genetic diversity, one must know the current genetic structure of the populations and the relationships among these. This paper presents the results of a genetic diversity analysis on an almost exhaustive sample of the sheep breeds reared in France. Molecular characterization was performed with a set of 21 microsatellite markers on a collection of 49 breeds that include five breed types: meat, hardy meat, dairy, high prolificacy and patrimonial breeds.


Values of expected heterozygosity ranged from 0.48 to 0.76 depending on the breed, with specialized meat breeds exhibiting the lowest values. Neighbor-Net, multidimensional analysis or clustering approaches revealed a clear differentiation of the meat breeds compared to the other breed types. Moreover, the group that clustered meat breeds included all the breeds that originated from the United Kingdom (UK) and those that originated from crossbreeding between UK breeds and French local breeds. We also highlighted old genetic introgression events that were related to the diffusion of Merino rams to improve wool production. As a result of these introgression events, especially that regarding the UK breeds, the breeds that were clustered in the ‘meat type cluster’ exhibited the lowest contribution to total diversity. That means that similar allelic combinations could be observed in different breeds of this group.


The genetic differentiation pattern of the sheep breeds reared in France results from a combination of factors, i.e. geographical origin, historic gene flow, and breed use. The Merino influence is weaker than that of UK breeds, which is consistent with how sheep use changed radically at the end of 19th century when wool-producing animals (Merino-like) were replaced by meat-producing breeds. These results are highly relevant to monitor and manage the genetic diversity of sheep and can be used to set priorities in conservation programs when needed.


It is generally accepted that sheep domestication occurred about 11 000 years ago in a region of the Middle East along the Taurus-Zagros arc, probably through several domestication events [1, 2]. Since then, domesticated sheep have spread throughout the world, following human migration roads, and have been selected for different purposes and environments [2, 3]. As a consequence of reticulated evolution (i.e. multiple episodes of migration, introgression, and isolation), the current genetic structure of domesticated sheep is rather complex [4] with 1129 breeds reported by the FAO [5]. With 57 sheep breeds officially recognized, France is an interesting example on how sheep populations are genetically structured. Until the 19th century, local sheep populations were differentiated according to their regions of origin (mainly, the northern part of France, and the Alps, Pyrénées and Massif Central mountains) [6]. Since the end of the 18th century, several introgression events have been very well documented. The first major event was the use of Merino rams from a national flock that is still maintained at the Bergerie Nationale of Rambouillet. It was imported from Spain in 1786 and promoted by the Napoleonian administration to improve the French breeds’ wool quality (e.g. [7]). Then, because at the national level wool demands decreased from the 1860s [8], sheep breeding aimed at improving meat production by two successive episodes of sheep importation from United Kingdom. The first rams imported belonged to the “Longwool” group while the breeds imported in the second episode belonged to the “Down” group. At the end of the 19th century, flock-books were created and since then, selection programs have been implemented in the sheep populations that had been impacted by those introgression events at different levels.

Several studies have investigated the genetic structure of sheep breeds based on molecular markers [4, 913] and showed that sheep breed differentiation depends on geographical origin, Merino introgression and/or breed use. Molecular tools are also useful to investigate conservation issues and the contribution of various populations to genetic diversity at different scales [14, 15]. However, although there are many sheep breeds in France, few molecular-based studies have been carried out to analyze their diversity.

Our aim was to investigate the genetic structure of sheep breeds in France using a near complete sample of the populations that are maintained over the country. For this purpose, 1826 individuals from 49 breeds were genotyped using 21 microsatellite markers, which allowed us to assess the genetic diversity of sheep breeds in France in relation to their history and conservation policy issues.


Sample collection and genotypes

Fifty-one populations belonging to 49 sheep breeds raised in France were sampled (Table 1). These populations belonged to five breed types (Table 1): (i) 15 meat breeds (M), (ii) five dairy breeds (D) among which, the Lacaune dairy breed comprised two subpopulations, (iii) three high prolificacy breeds (P), (iv) 25 hardy meat breeds (H) among which, the Lacaune meat breed comprised two subpopulations, and (v) one patrimonial breed (Pa), i.e. the Mérinos de Rambouillet breed (MeRa). For the sake of clarity, the four subpopulations of the Lacaune breed will be considered hereafter as separate breeds. The geographical coordinates (latitude and longitude) relative to the region of origin were determined for each breed. A total of 1826 individuals were sampled and the number of animals per breed ranged from 12 (Romanov) to 55 (Roussin de la Hague). When pedigree data were available, animals that were as little related as possible were chosen. For the six breeds (BeIl, Land, LaBr, MoNo, RoRo and ThMa) for which there was no pedigree data, animals were sampled from as many different birth flocks and birth periods as possible.

Table 1 Name, sample size and region of origin for the 48 sheep breeds

Twenty one microsatellite markers were used to perform the analysis. Eight of these microsatellites came from the French panel for parentage testing (CSRD0247, HSC, INRA49, McM42, MAF65, McM527, MAF0214, OaRFCB20). The 13 other microsatellites (HUJ616, ILSTS005, ILST011, MAF209, MAF70, OarFCB128, OaRCP34, OaRFCB193, OaRFCB304, OaRJMP29, OarJMP58, SR-CRSP9, BM8125) were chosen from those available in the UE Econogen project [16]. Thirteen of the 21 selected loci were part of the ISAG-FAO recommended microsatellite markers. Amplifications and analyses of all the samples were performed by the same laboratory (Labogena, France), using a capillary sequencer (ABI PRISM 3100 Genetic Analyzer, Applied Biosystems).

Statistical analysis

The presence of null alleles was tested using FreeNA [17] i.e. loci with an estimated frequency of null alleles (r) higher than 0.2 were considered as potentially problematic for calculations [9]. Allele frequencies, numbers of alleles, observed heterozygosity (Ho), non-biased expected heterozygosity (He), effective number of alleles (Ae) and F-statistics [18] were estimated with GENETIX 4.05.2 [19]. GENEPOP 4.07 [20] was used to evaluate departure from Hardy-Weinberg equilibrium and pairwise genic differentiation among breeds [21]. Allelic richness (Ar) was computed with the rarefaction method using FSTAT [22]. Significance levels of the tests were corrected with sequential Bonferroni correction on loci. Potential hierarchical genetic structure was investigated with the AMOVA procedure implemented in ARLEQUIN [23]. Breeds were divided into different groups according to: (i) type (dairy, meat, hardy meat, and high prolificacy; see Table 1) or (ii) original geographic location (Massif Central, South-West, South-East, North-West, Plain from Center/Northern part of France, and the United-Kingdom (UK); see Table 1). The Mérinos de Rambouillet breed (MeRa) was excluded from all the AMOVA analyses because this breed is the unique representative of its type (patrimonial). Romanov (Roov) and Texel (Texe) breeds were also excluded because they were the only representatives of their geographical groups (respectively, Russia and The Netherlands). Significance levels were determined after 16 000 permutations.

The matrix of Reynolds distances (D R [24]) was computed using PHYLIP 3.69 [25] and used to draw a Neighbor-Net [26] network with SPLITSTREE 4.5 [27]. A principal component analysis was also performed using PCAGEN [28]. The significance of the axis was evaluated using permutation tests (1000 randomizations of the genotypes).

Clustering approaches were performed on the 51 breeds using a Bayesian clustering procedure implemented in STRUCTURE [29] with the number of K clusters ranging from 1 to 10 and then equal to 15, 20, 25, 30, 35, 40, 45, 48, 51, and 55. For each value of K, 50 runs were performed with 1 000 000 iterations following a burn-in period of 100 000, under the admixture and correlated allele frequency model. Since consistency across runs seems to be an informative method for assessing species structure across breeds [30, 31], we used CLUMPP [32] to estimate the similarity function G’ over runs for the different values of K, using the LARGEKGREEDY algorithm. We selected a subset of runs that included the run with the highest number of similar runs (symmetric similarity coefficients (SSC) greater than 0.90) grouped with the corresponding similar runs. We used this subset to compute a mean Q-matrix. Breed assignment was performed as in Leroy et al. [31]. Animals were considered as correctly assigned to their breed if they were primarily associated to the cluster that included the largest number of animals belonging to the breed, using results for K = 51. For clusters that comprised two breeds, runs were performed for K = 2 using only the breeds that were associated within the sub-cluster.

The contribution of each breed to the diversity of the whole set of breeds was computed according to the method of Caballero and Toro [33]. Let p ki be the average frequency of allele k in breed i, then, the average coancestry between breeds i and j is:

$$ {f}_{ij}={\varSigma}_k\kern0.5em {p}_{ki}{p}_{kj} $$

When several markers are used, coancestry is averaged over loci. The total genetic diversity (GD T ) is assumed to be the sum of the within-breed genetic diversity (GD WS ) and the between-breed genetic diversity (GD BS ):

$$ G{D}_T= 1\mathit{\hbox{-}}{\varSigma}_i\kern0.5em {\varSigma}_j\kern0.5em {f}_{ij}/{n}^2\kern0.5em , $$
$$ G{D}_{WS}= 1-{\varSigma}_i\kern0.5em {f}_{ii}/n, $$
$$ G{D}_{BS}={\varSigma}_i\kern0.5em {\varSigma}_i\kern0.5em {D}_{ij}/{n}^2 $$

In these equations, n is the number of breeds and D ij is Nei’s minimum distance between breeds i and j. Contribution of a breed to the diversity of the whole set of breeds was computed by the loss or gain of diversity ∆GD when the breed is removed.


Genetic diversity

For the complete dataset of breeds and markers, 357 alleles were identified. The average number of alleles per locus was 17 and ranged from nine (loci OarCP3 and ILSTS011) to 28 (locus UHJ616).

Heterozygosities, mean number of alleles (MNA), effective number of alleles (Ae) and allelic richness (Ar) are in Table 2. He ranged from 0.48 in the Belle-Île breed (BeIl) to 0.76 in the Corse breed (Cors), with a mean value of 0.66 (±0.07). He were significantly higher in hardy meat breeds (P < 0.0001; Wilcoxon-Mann Whitney test) and dairy breeds (P = 0.0005) than in specialized meat breeds, whereas differentiation between hardy meat and dairy breeds was not significant (P = 0.68). Ae ranged from 2.14 (BeIl) to 5.18 (Cors), with a mean value of 3.64 (±0.72). Ar values (computed for breeds with at least 18 individuals genotyped for each locus) varied from 2.96 (MeRa) to 7.72 (Cors), with a mean value of 5.6 (±1.05). F IS per breed ranged from −0.058 (Rava) to 0.117 (LaBr). The larger He, Ae and Ar values obtained for the Cors breed are probably related to a lower selection intensity than for other dairy breeds, linked to an extensive production system. After sequential Bonferroni correction, five breeds showed a significant deficit of heterozygotes for one locus, and one breed carried one locus with an excess of heterozygotes. Only one locus per breed combination out of 1071 was identified with a potential null allele (r > 0.2; data not shown) i.e. the McM42 locus in the LaBr breed. However, excluding this locus had very minor effects on F IS and He (Wilcoxon test; P-value > 0.05), which suggests that null alleles are not the main cause of significant F IS values (data not shown). Thus, we chose to conserve all 21 loci. Implementation of the pair-wise population differentiation test in the software GENEPOP 4.07 [20] showed that all breed pairs were significantly differentiated, including the six breed pairs that included the four Lacaune subpopulations. F IS , F IT and F ST values were equal to −0.001, 0.12 and 0.12, respectively.

Table 2 Summary of genetic diversity measures across the 51 populations

Breed relationships and clustering

The Neighbor-Net network based on D R distance (Fig. 1) formed a star-like pattern, with several clusters. Meat breeds (M) were clustered within two groups that included a few other breeds. Notably, one group included the four meat breeds that originated from the United Kingdom (Suff, Hamp, DoDo and Sout), three French meat breeds (RoHa, MoCh and MoVe), one high prolificacy breed (BeIl) and one hardy meat breed (LaBr). The other group clustered seven French meat breeds (Avra, BlMa, RoOu, Char, IlFr, Cote and BeCh), one breed from The Netherlands (Texe), one hardy meat breed (Boul) and the two other high prolificacy breeds (Roma and Roov).

Fig. 1
figure 1

Neighbor-Net network for the 51 sheep populations, based on Reynold’s D R distance. Brown = dairy breeds; red = hardy meat breeds; yellow = patrimonial breed; green = meat breeds and blue = high-prolificacy breeds

PCA analysis (Fig. 2) displayed a clear differentiation between the meat breeds and the hardy meat and dairy breeds. All meat breeds except BeCh and IlFr were plotted on the left side of the figure, whereas all dairy and hardy meat breeds (except five) clustered within the bottom right quadrant. Only two hardy meat breeds Boul and LaBr were plotted very close to the meat breeds. One high prolificacy breed (BeIl) was plotted close to the meat breeds, whereas the two other high prolificacy breeds (Roma and Roov) were on the other side (top right quadrant) near the BeCh and IlFr breeds. The patrimonial Mérinos de Rambouillet breed (MeRa), was clearly isolated from all other breeds, probably because of its low level of genetic variability i.e. this population has been maintained as a closed flock since around 230 years).

Fig. 2
figure 2

Principal component analysis for the 51 sheep populations. The projection is shown on the first two axes. Population codes are in Table 1. Brown diamonds = dairy breeds; red crosses = hardy meat breeds; yellow triangles = patrimonial breed; green plus signs = meat breeds; blue circles = high-prolificacy breeds. Axis 1: 9.9 % inertia; P-value = 0.001. Axis 2: 8.6 % inertia; P-value = 0.001

Bayesian clustering methods provided complementary information on the genetic relationships among the populations. For these, we used the Q-matrix averaged over the most similar runs (Fig. 3) for K = 2 to 5 and 51 (or overall runs for K = 2 to 10 and 51 [See Additional file 1: Figure S1]) and a combined analysis of the distribution of membership coefficients according to breed and geographical location (location of origin) of these breeds (Fig. 4; K = 2 to 5). Likelihood values (Ln(P(D))) across runs reached a plateau when K was close to 45 [See Additional file 2: Figure S2]. With K = 2, a group that comprised all the breeds from the UK (South, DoDo, Suff, and Hamp), the Netherlands (Texe) and nine western French breeds (MoVe, LaBr, RoHa, BeIl, Avra, BlMa, Cote, RoOu, and Char) and the Mouton Charollais (MoCh) breed was clearly differentiated from a second group that included all the other breeds. As K increased, this first group segregated in two subgroups, one including the UK breeds and MoCh, MoVe, LaBr, RoHa, and BeIl breeds (i.e. SubGroup 1 or SG1) and the other including Texe, Avra, Cote, RoOu, and Char breeds (SG2).

Fig. 3
figure 3

Estimated membership coefficients of each individual in the inferred K cluster, with K = 2 to 5 and K = 51. In brackets, number of runs with similar solutions (SSC > 0.90) that was used to compute the mean Q-matrix

Fig. 4
figure 4

Geographical interpolation of structure results for K = 2 to 5 using the mean Q-matrix over runs with similar solutions. Breeds are distributed according to their location of origin. Brown = dairy breeds; red = hardy meat breeds; yellow = patrimonial breed; green = meat breeds; blue = high-prolificacy breeds. Each pie shows for a given breed the proportions of membership coefficients relative to clusters (see Fig. 3)

From K = 3 to 4, a third subgroup (SG3) that included IlFr, Boul, BeCh, Roma, Roov, and MeRa breeds was separated from the second group. As K increased (3 to 10), the breeds in SG3 clustered together. However the Mérinos de Rambouillet breed (MeRa) and the Île-de-France breed (IlFr) separated at K = 7 and K = 10, respectively [See Additional file 1: Figure S1]. For the other breeds, two subdivisions occurred as K increased to 5 i.e. SG4 and SG5. SG4 included the MeAr, EsLM, and Mour breeds, the Southwestern breeds (MnTN, MnTR, BaBe, Land, Lour, Cast, AuCa, Bare, Limo, and Tara), two Alpine breeds (Griv and ThMa), one Mediterranean breed (Cors) and two breeds from the Massif Central (NoVe and MoNo; although the MoNo breed is now bred in the Southwest area). The last subdivision (SG5) included all the other breeds i.e. from the Massif Central, except for the Préalpes du Sud breed (PrSu, in the Alps) with a differentiation pattern changing as K increased.

For K = 51, 41 breeds were assigned to a private cluster, i.e. they were primarily associated to the cluster that clustered the largest number of animals that belonged to the breed. For the three breeds LaBr, MoNo, and Mour, each one was associated with two clusters that consisted mainly of individuals that belonged to the same breed, and these pairs of clusters were considered as private breed clusters. Four pairs of breeds shared the same cluster BlMa/RoOu, Hamp/Suff, MoCh/Sout, and Roma/Roov, respectively. Each pair of breeds was analyzed individually (data not shown) and, in each case, individuals of the two breeds were assigned in their own private cluster. In contrast, one breed, the Barégeoise breed (Bare), was not assigned to a specific cluster and all Bare individuals except two pairs were assigned to different clusters. Finally, excluding the Barégeoise breed, 90.1 % of the individuals of the 50 remaining breeds were assigned to their putative breed (Table 2), this percentage ranging from 47 % for the Tara breed to 100 % for 19 breeds.

Partition of diversity

In the hierarchical analysis (Table 3), the “within-breed” component explained the largest part of the total genetic variance (88 to 89 %; P < 0.0001), regardless of the hypothetical breed structure tested. Two models of population structure i.e. breed types and geographical origin were investigated. The greatest variation among groups (1.55 % of the total variance; P < 0.0001) was observed with the geographical model compared to the breed type model (1.14 %; P < 0.0001).

Table 3 Hierarchical partitioning of the genetic variance (AMOVA)

Contributions to the genetic diversity are in Table 4. ΔGD WS ranged from −0.002 (Cors) to 0.0034 (BeIl), while ΔGD BS ranged from −0.0038 (MeRa) to 0.0013 (Bare). The largest decrease in total gene diversity (ΔGD T ) was observed when the Corse (Cors; − 0.0012), Landaise (Land; − 0.0012) or Basco-Béarnaise (BaBe; −0.0011) breeds were removed. In contrast, when the Roussin de La Hague (RoHa), Blanc du Massif Central (BlMa) or Belle-Île (BeIl) breeds were removed, diversity increased by 0.0016, 0.0014 or 0.0013, respectively.

Table 4 Contributions of the different breeds to genetic diversity, according to the method of Caballero and Toro [33]


Investigating the genetic structure of sheep breeds that are raised in France, using an approach based on their geographical origin and not the regions where they are currently raised, provides interesting insights into the recent history of sheep breeding in France.

Results from the Bayesian approach clustered the breeds according to geographical origin and to the impact of the successive crossing events (Fig. 4). We showed that two groups SG1 and SG2 were influenced by genetic introgression from UK breeds. SG1 includes breeds of French (BeIl, LaBr, MoCh, MoVe, RoHa) and UK origins (DoDo, Hamp, Suff, Sout) related to Down meat breeds (SG1). In the SG2 group, for some breeds (Avra, BlMa, Char, Cote, RoOu), introgression of former UK Longwool breeds may still have a dominant influence on population clustering, since they cluster with the Texel breed, which has a Dutch origin and is also considered as genetically similar to the Longwool breeds [11]. Results for the breeds that cluster in the SG3 group show that they are related to the extensive use of Merino rams at the end of 18th and beginning of the 19th century (“Merinization”) to improve wool production [6]. SG3 includes Mérinos de Rambouillet (MeRa), which is the breed that was originally used for merinization in France, breeds (IlFr, Boul and BeCh) that were created by crossing MeRa with UK Longwool breeds such as the Dishley breed, and the Romane breed (Roma), which is a recent breed that was created by crossing the Berrichon-du-Cher (BeCh) and Romanov (Roov) breeds, Romanov being also included in the cluster. Three other breeds of Merino origin, i.e. the current Merino hardy meat breeds (MeAr, EsLM) and the Mourerous breed (Mour), are clustered in another group (SG4). All other breeds, including hardy meat or dairy breeds that originate from the south of France, are clustered according to two main geographical origins, namely South West (Pyrénées) of France for cluster SG4 and Massif Central for SG5. Overall, the results from the Bayesian approach are consistent with what is known on the history of introgression events that took place in French sheep populations during the 19th century [6], even if the Merino SG3 group may appear artificial since it aggregates breeds that have been influenced by Merino and Romane (Roma) breeds (see above). Genetic drift and founder effects within the French Romanov breed (Roov) together with the small number of sampled Romanov individuals, may explain why this breed clustered within the SG3 group. Neighbor-Net (Fig. 1) and PCA (Fig. 2) methods provided results that agree with the theoretical expectation since the Romane breed is placed between the two breeds that were crossed to create it. This is a good example to illustrate the need to combine different approaches when analyzing the genetic history of populations with molecular markers.

Independently from the breeds’ specific histories, our analysis also investigated to what extent breed type or geographical origin may account for genetic breed structure. The overall F ST estimate (11.7 %) was consistent with that reported in previous studies on sheep breeds (~13 %; [9, 12]). Here, geographical origin and breed type explained 1.6 % and 1.1 %, respectively, of the total genetic variation, while Lawson-Handley et al. [9] found 1 % and 2.7 %, respectively in an analysis on European sheep breeds. Obviously, these two parameters are not independent from each other (Fig. 4). For instance, meat breeds, which mainly originate from the UK, were used to create the breeds from the northwestern part from France i.e. through strong introgression from Longwool or Down breeds. Specialization for meat types can be related to the socio-economic background of the northwestern part of France (combination of high demands for meat and availability of rich pastures) [6]. In contrast, dairy breeds are from three distinct origins, i.e. western Pyrénées (BaBe, MaTR, MaTN), Corsica (Cors) and the southern part of Massif central (LaLC, LaLO); these last two breeds have a different genetic background from the four previous breeds. Thus, based on these findings, it can be hypothesized that the genetic differentiation of sheep breeds in France results from a combination of geographical origin, historic gene flow, and breed use, in relation to the socio-economic background and the main farm systems that comprise specialized meat types and more intensive production in the northern part, and hardy (meat or dairy) types and more or less extensive farm systems in the southern part of the country [6].

Among the applications of this study for breed management, our results on breed assignment (90.1 % of individuals assigned to their putative breed) confirm that each breed constitutes a rather homogeneous genetic group. Even the four subpopulations from the Lacaune breed (LaLC, LaVG, LaLO, LaVO) that have been subjected to different selection programs for about 40 years, appeared relatively well differentiated (Fig. 3). Many of the detected misassignments were found for breeds with high levels of genetic diversity. For instance the Barégeoise breed (Bare), which is the breed with the third highest He (0.72), could not be assigned to any specific cluster. The relatively recent creation of this breed (beginning of the 20th century) as a sub-population of the Lourdaise breed (Lour), with a flock-book that started only in 1975, and the former use of crossbreeding with several breeds [34] explains partly the high He as well as the high rate of misassignments. When the Barégeoise breed and the breeds that are considered as historically close (AuCa and Lour) or frequently used in crossbreeding (BeCh and Tara) were analyzed independently, they were each assigned to a private cluster, with Barégeoise individuals showing more heterogeneity (data not shown). Using a larger number of markers would probably improve the assignment results and allow to register individuals that lack a known pedigree within a given selection nucleus.

Regarding diversity partitioning, a high correlation was observed between He and various diversity components. This correlation was negative with ΔGD WS (r = −0.99), positive with ΔGD BS (r = 0.87), and negative with ΔGD T (r = −0.62), which indicates that breeds with a high diversity level contribute more to total genetic diversity. It is interesting to note that the three breeds with the largest contribution to total diversity (Cors, Land and BaBe), i.e. with the most negative ΔGD T , showed a high level of heterozygosity, as expected, but also shared a similar genetic background according Neighbor-Net and Structure methods. Two of these breeds are local breeds (Cors and BaBe) and the other (Land) is a rare breed. The five breeds with the lowest contribution to total genetic diversity (i.e. the highest ΔGD T ) also showed a low genetic diversity and were related either to Down (BeIl, RoHa, Cote) or Longwool (BlMa, Cha) genetic groups (Figs. 3 and 4). These breeds were classified either as endangered breeds (Cote, BeIl, BlMa) or local breeds with limited numbers (Char, RoHa). All these breeds (except BeIl for which there is no pedigree information) were studied by Danchin-Burge et al. [35] who considered them as having acceptable levels of genetic variability as estimated from pedigree data. More generally, for all but one (Texe) of the breeds belonging to the subdivisions SG1 and SG2 (hence originating from or related to British breeds, and undergoing conservation programs for six of them: Avra, Bell, Cote, Char, LaBr, RoHa), we could observe a gain of total diversity (ΔGD T  > 0; Table 4) if one of them was removed from the analysis. The most likely interpretation is that if one of these breeds is removed, a similar allelic combination will still exist within the set of the remaining breeds. This result illustrates quite well the fact that in a large dataset, contribution of a given breed to the total diversity depends both on its within diversity and its position within the genetic architecture of the species [31, 36, 37]. Based on our results, our recommendation would be to focus conservation efforts toward the Landaise breed (Land), although contribution to global diversity is only one of the many methods that can be used to prioritize livestock breeds for conservation. As discussed in Fabuel et al. [38] and in Leroy et al. [36], using other approaches can lead to dissimilar results. One advantage of the Caballero and Toro [33] method is that it does not suffer from the computation time limitation of the Weitzman approach for large datasets [39, 40]. It also gives the same weight to within- and between-breeds components of diversity, which can be discussed, based on what component should be emphasized for conservation purpose.

Based on these genetic diversity measures, specific recommendations can be made on the genetic management and conservation of these French breeds.

Most of the breeds that we analyzed are large breeds for which a selection program is ongoing (mainstream breeds; Table 1) and artificial insemination is used. These breeds display a wide range of genetic diversity. From this point of view, the Corsican dairy breed is clearly apart from the other breeds. As an illustration, it is the only French sheep breed with multiple color patterns since it was never submitted to coat color standardization. Lenstra et al. [41] showed that the French Corsican goat breed was more related to Italian breeds than to French breeds. The strong differentiation of the Corsican sheep breed suggests a similar history. Among this group of breeds with selection programs, the three Pyrenean dairy breeds (BaBe, MaTN and MaTR) all have also a high level of genetic diversity as well as displaying genetic distinction. These characteristics strongly support the efforts made by local organizations to promote these breeds through a PDO (protected designation of origin) product, i.e., the Ossau-Iraty cheese. Moreover, although these breeds have been subjected to fairly high selection pressures for milk production, they retain a high level of genetic diversity, probably because their genetic variability was high to start with and they have benefited from good management practices in terms of genetic variability. However, the genetic diversity of Char and the BeCh breeds is limited although the population numbers are fairly large. For both breeds, the use of artificial insemination with limited awareness of their genetic diversity combined with an erosion of the number of animals (BeCh) or a limited gene pool to start with (Char) led to a decrease in genetic diversity. Thus, we recommend that efficient measures aimed at preserving genetic diversity are taken.

Another group of breeds is composed of local breeds, which, compared to those discussed above, have a smaller population size and are part of less efficient or organized selection programs. For most of these breeds, measures of genetic diversity have intermediate values with moderate levels of inbreeding and little genetic originality. From a genetic point of view, our interpretation is that the absence or weakness of selection that is acting in these breeds preserves them. Nevertheless, for the BLMa and RoHa breeds, we recommend a short-term implementation of specific rules to slow down the rate of the loss of genetic variability.

The last group of breeds includes rare breeds. Clearly, the Land breed stands out by its high contribution to the total genetic diversity and our recommendation is to secure the existing conservation program. For instance, there are only five rams stored in the French national Cryobank ( which is not sufficient to preserve this breed’s genetic variability in case of a disease outbreak. The same recommendation is made for the patrimonial breed, MeRa, which has a level of heterozygosity of only 0.50, which is expected for a breed that has been inbred for over 200 years.

However, regardless of our recommendations based on this work, the French Minister of Agriculture does support a global conservation of French sheep breeds. All these breeds are undergoing a selection or a conservation program, and all except MeRa are bred mostly by farmers for production. Therefore, our results rather than being used to prioritize which breeds to protect should be used as a tool to help the breeders to manage their populations.


The genetic structure of French sheep breeds was shaped by reticulate evolution that involved both genetic drift and several introgression events, which correspond to similar patterns identified in Italian sheep breeds [13] or other domestic species [31, 36, 42, 43]. It is generally assumed that introgression events of Merino genetic material have occurred over the whole French sheep population [6, 7]. In comparison, introgression of UK breeds is easier to follow, probably because it is more recent. Moreover, a large part of the Merino flocks was eliminated during the second part of 19th century and the beginning of the 20th century [8], which resulted in a three-fold reduction of the French sheep population.

Conservation approaches could be applied to a larger number of breeds to assess conservation priorities. However, conservation issues cannot be reduced only to the analysis of genetic diversity and within- and between-population contributions. Other considerations, such as genetic structure and admixture patterns [37], or socio-cultural value and specific use of a breed, should be taken into account when making final conservation decisions [44, 45].


  1. Meadows JRS, Cemal I, Karaca O, Gootwine E, Kijas JW. Five ovine mitochondrial lineages identified from sheep breeds of the near East. Genetics. 2007;175:1371–9.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  2. Zeder MA. Domestication and early agriculture in the Mediterranean basin: Origins, diffusion, and impact. Proc Natl Acad Sci U S A. 2008;105:11597–604.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  3. Chessa B, Pereira F, Arnaud F, Amorim A, Goyache F, Mainland I, et al. Revealing the history of Sheep domestication using retrovirus integrations. Science. 2009;324:532–6.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  4. Kijas JW, Lenstra JA, Hayes B, Boitard S, Neto LRP, San Cristobal M, et al. Genome-wide analysis of the world’s sheep breeds reveals high levels of historic mixture and strong recent selection. PLoS Biol. 2012;10, e1001258.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  5. FAO. The state of the world’s animal genetic ressources for food and agriculture. Rischkowsky B and Pilling D editors. Rome: FAO, Commission on Genetic Resources for Food and Agriculture; 2007.

  6. Charlet P, Bougler J. Races ovines. Tech Agric. 1981;5:1–36.

    Google Scholar 

  7. Wood RJ, Orel V. Genetic prehistory in selective breeding: a prelude to Mendel. New York: Oxford University Press; 2001.

    Google Scholar 

  8. Montmeas L. L’élevage ovin en France de 1945 à nos jours : du plan d’encouragement à l’élevage ovin (1946) à la reconquête ovine (2009). Ethnozootechnie. 2011;91:97–104.

    Google Scholar 

  9. Lawson Handley LJ, Byrne K, Santucci F, Townsend S, Taylor M, Bruford MW, et al. Genetic structure of European sheep breeds. Heredity. 2007;99:620–31.

    Article  CAS  PubMed  Google Scholar 

  10. Peter C, Bruford MW, Perez T, Dalamitra S, Hewitt G, Erhardt G. Genetic diversity and subdivision of 57 European and Middle-Eastern sheep breeds. Anim Genet. 2007;38:37–44.

    Article  CAS  PubMed  Google Scholar 

  11. Kijas JW, Townley D, Dalrymple BP, Heaton MP, Maddox JF, McGrath A, et al. A genome wide survey of SNP variation reveals the genetic structure of sheep breeds. PLoS ONE. 2009;4:e4668.

    Article  PubMed Central  PubMed  Google Scholar 

  12. Blackburn HD, Paiva SR, Wildeus S, Getz W, Waldron D, Stobart R, et al. Genetic structure and diversity among sheep breeds in the United States: Identification of the major gene pools. J Anim Sci. 2011;89:2336–48.

    Article  CAS  PubMed  Google Scholar 

  13. Ciani E, Crepaldi P, Nicoloso L, Lasagna E, Sarti FM, Moioli B, et al. Genome-wide analysis of Italian sheep diversity reveals a strong geographic pattern and cryptic relationships between breeds. Anim Genet. 2014;45:256–66.

    Article  CAS  PubMed  Google Scholar 

  14. Glowatzki-Mullis ML, Muntwyler J, Bäumle E, Gaillard C. Genetic diversity of Swiss sheep breeds in the focus of conservation research. J Anim Breed Genet. 2009;126:164–75.

    Article  CAS  PubMed  Google Scholar 

  15. Dumasy JF, Daniaux C, Donnay I, Baret PV. Genetic diversity and networks of exchange: a combined approach to assess intra-breed diversity. Genet Sel Evol. 2012;44:17.

    Article  PubMed Central  PubMed  Google Scholar 

  16. ECONOGENE Project. [].

  17. Chapuis MP, Estoup A. Microsatellite null alleles and estimation of population differentiation. Mol Biol Evol. 2007;24:621–31.

    Article  CAS  PubMed  Google Scholar 

  18. Weir BS, Cockerham CC. Estimating F-statistics for the analysis of population structure. Evolution. 1984;38:1358–70.

    Article  Google Scholar 

  19. Belkhir K, Borsa P, Chikhi L, Raufaste N, Bonhomme F. GENETIX, logiciel sous Windows TM pour la génétique des populations. Montpellier (France): Laboratoire Génome, Populations, Interactions, CNRS UMR 5171, Université de Montpellier II 1996–2004. Version 4.05.2.

  20. Rousset F. Genepop’007: a complete re-implementation of the genepop software for Windows and Linux. Mol Ecol Resour. 2008;8:103–6.

    Article  PubMed  Google Scholar 

  21. Goudet J, Raymond M, de-Meeus T, Rousset F. Testing differentiation in diploid populations. Genetics. 1996;144:1933–40.

    CAS  PubMed Central  PubMed  Google Scholar 

  22. Goudet J. FSTAT, a program to estimate and test gene diversities and fixation indices. J Hered. 1995;86:485–6 [version 2.9.3: software available from].

    Google Scholar 

  23. Excoffier L, Lischer HEL. Arlequin suite ver. 3.5: A new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Res. 2010;10:564–7.

    Article  Google Scholar 

  24. Reynolds J, Weir BS, Cockerham CC. Estimation of the coancestry coefficient: basis for a short-term genetic distance. Genetics. 1983;105:767–79.

    CAS  PubMed Central  PubMed  Google Scholar 

  25. Felsenstein J. PHYLIP - Phylogeny Inference Package (Version 3.2). Cladistics. 1989;5:164–6.

    Google Scholar 

  26. Bryant D, Moulton V. Neighbor-Net: An agglomerative method for the construction of phylogenetic networks. Mol Biol Evol. 2004;21:255–65.

    Article  CAS  PubMed  Google Scholar 

  27. Huson DH, Bryant D. Application of phylogenetic networks in evolutionary studies. Mol Biol Evol. 2006;23:254–67 [software available from].

    Article  CAS  PubMed  Google Scholar 

  28. Goudet J. PCA-GEN, (version 1.2) Lausanne, Switzerland. [software available from].

  29. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155:945–59.

    CAS  PubMed Central  PubMed  Google Scholar 

  30. Wang S, Lewis Jr CM, Jakobsson M, Ramachandran S, Ray N, Bedoya G, et al. Genetic variation and population structure in native Americans. PLoS Genet. 2007;3, e185.

    Article  PubMed Central  PubMed  Google Scholar 

  31. Leroy G, Verrier E, Meriaux JC, Rognon X. Genetic diversity of dog breeds: between-breed diversity, breed assignation and conservation approaches. Anim Genet. 2009;40:333–43.

    Article  CAS  PubMed  Google Scholar 

  32. Jakobsson M, Rosenberg NA. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics. 2007;23:1801–6.

    Article  CAS  PubMed  Google Scholar 

  33. Caballero A, Toro MA. Analysis of genetic diversity for the management of conserved subdivided populations. Conserv Genet. 2002;3:289–99.

    Article  CAS  Google Scholar 

  34. Perret G. La race Barégeoise. In: Perret G, editor. Races ovines. Paris: ITOVIC; 1986. p. 39–46.

    Google Scholar 

  35. Danchin-Burge C, Palhiere I, Francois D, Bibé B, Leroy G, Verrier E. Pedigree analysis of seven small French sheep populations and implications for the management of rare breeds. J Anim Sci. 2010;88:505–16.

    Article  CAS  PubMed  Google Scholar 

  36. Leroy G, Callede L, Verrier E, Mériaux JC, Ricard A, Danchin-Burge C, et al. Genetic diversity of a large set of horse breeds raised in France assessed by microsatellite polymorphism. Genet Sel Evol. 2009;41:5.

    Article  PubMed Central  PubMed  Google Scholar 

  37. Ginja C, Gama L, Cortes O, Delgado J, Dunner S, Garcia D, et al. Analysis of conservation priorities of Iberoamerican cattle based on autosomal microsatellite markers. Genet Sel Evol. 2013;45:35.

    Article  PubMed Central  PubMed  Google Scholar 

  38. Fabuel E, Barragán C, Silió L, Rodríguez MC, Toro MA. Analysis of genetic diversity and conservation priorities in Iberian pigs based on microsatellite markers. Heredity. 2004;93:104–13.

    Article  CAS  PubMed  Google Scholar 

  39. Thaon d’Arnoldi C, Foulley JL, Ollivier L. An overview of the Weitzman approach to diversity. Genet Sel Evol. 1998;30:149–61.

    Article  Google Scholar 

  40. Ollivier L, Fouley JL. Aggregate diversity: New approach combining within- and between-breed genetic diversity. Livest Prod Sci. 2005;95:247–54.

    Article  Google Scholar 

  41. Lenstra JA, Econogen Consortium. Evolutionary and demographic history of sheep and goats suggested by nuclear, mtdna and y-chromosome markers. In Proceedings of the International Workshop on the Role of Biotechnology for the Characterisation and Conservation of Crop, Forestry, Animal and Fishery Genetic Resources: 5–7 March 2005; Turin. [available at].

  42. Ginja C, Telo Da Gama L, Penedo MCT. Analysis of STR markers reveals high genetic structure in Portuguese native cattle. J Hered. 2010;101:201–10.

    Article  CAS  PubMed  Google Scholar 

  43. Gautier M, Laloë D, Moazami-Goudarzi K. Insights into the genetic history of French cattle from dense SNP data on 47 worldwide breeds. PLoS ONE. 2010;5, e13038.

    Article  PubMed Central  PubMed  Google Scholar 

  44. Reist-Marti SB, Simianer H, Gibson J, Hanotte O, Rege JEO. Weitzman’s approach and conservation of breed diversity: An application to African cattle breeds. Conserv Biol. 2003;17:1299–311.

    Article  Google Scholar 

  45. Lauvie A, Audiot A, Couix N, Casabianca F, Brives H, Verrier E. Diversity of rare breed management programs: Between conservation and development. Livest Sci. 2011;140:161–70.

    Article  Google Scholar 

Download references


We would like to thank the breeders, the technicians of Breeders Associations, and Races-de-France (French federation of Breeders Associations) for their cooperation in providing samples. We also thank Jean-Marc Babillot (LABOGENA) for his laboratory assistance and Claude Chevalet for useful discussion. This study was financially supported by INRA (Animal Genetics Division, UMR GenPhySE, UMR GABI) and AgroParisTech.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Xavier Rognon.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

CDB, EV, IP, MSC and XR conceived the project; CDB, IP and XR planned the selection of samples and data collection; GL, YN and XR analysed the data; CDB, GL and XR wrote the paper, and all the authors participated in the discussion. All authors have read and approved the final manuscript.

Additional files

Additional file 1: Figure S1.

STRUCTURE analysis with the 51 populations for K = 2–10 and 51. Estimated membership fractions for each individual of the 51 populations to the inferred K cluster, using Q-matrix averaged overall 50 runs.

Additional file 2: Figure S2.

STRUCTURE analysis with the 51 populations. Evolution of (a) likelihood Ln(P(D)) and (b) similarity (G') according to the number of clusters K (K = 1–15, 20, 25, 30, 35, 40, 48, 50, 51, and 55).

Rights and permissions

Open Access  This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit

The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Leroy, G., Danchin-Burge, C., Palhière, I. et al. How do introgression events shape the partitioning of diversity among breeds: a case study in sheep. Genet Sel Evol 47, 48 (2015).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: