Population structure and genetic diversity of 25 Russian sheep breeds based on whole-genome genotyping
- Tatiana E. Deniskova†1Email author,
- Arsen V. Dotsev†1,
- Marina I. Selionova2,
- Elisabeth Kunz3,
- Ivica Medugorac3,
- Henry Reyer4,
- Klaus Wimmers4,
- Mario Barbato5,
- Alexei A. Traspov1,
- Gottfried Brem1, 6 and
- Natalia A. Zinovieva1Email author
© The Author(s) 2018
Received: 8 October 2017
Accepted: 16 May 2018
Published: 24 May 2018
Russia has a diverse variety of native and locally developed sheep breeds with coarse, fine, and semi-fine wool, which inhabit different climate zones and landscapes that range from hot deserts to harsh northern areas. To date, no genome-wide information has been used to investigate the history and genetic characteristics of the extant local Russian sheep populations. To infer the population structure and genome-wide diversity of Russian sheep, 25 local breeds were genotyped with the OvineSNP50 BeadChip. Furthermore, to evaluate admixture contributions from foreign breeds in Russian sheep, a set of 58 worldwide breeds from publicly available genotypes was added to our data.
We recorded similar observed heterozygosity (0.354–0.395) and allelic richness (1.890–1.955) levels across the analyzed breeds and they are comparable with those observed in the worldwide breeds. Recent effective population sizes estimated from linkage disequilibrium five generations ago ranged from 65 to 543. Multi-dimensional scaling, admixture, and neighbor-net analyses consistently identified a two-step subdivision of the Russian local sheep breeds. A first split clustered the Russian sheep populations according to their wool type (fine wool, semi-fine wool and coarse wool). The Dagestan Mountain and Baikal fine-fleeced breeds differ from the other Merino-derived local breeds. The semi-fine wool cluster combined a breed of Romanian origin, Tsigai, with its derivative Altai Mountain, the two Romney-introgressed breeds Kuibyshev and North Caucasian, and the Lincoln-introgressed Russian longhaired breed. The coarse-wool group comprised the Nordic short-tailed Romanov, the long-fat-tailed outlier Kuchugur and two clusters of fat-tailed sheep: the Caucasian Mountain breeds and the Buubei, Karakul, Edilbai, Kalmyk and Tuva breeds. The Russian fat-tailed breeds shared co-ancestry with sheep from China and Southwestern Asia (Iran).
In this study, we derived the genetic characteristics of the major Russian local sheep breeds, which are moderately diverse and have a strong population structure. Pooling our data with a worldwide genotyping set gave deeper insight into the history and origin of the Russian sheep populations.
The sheep (Ovis aries) is one of the economically most important agricultural species and produces a wide range of valuable products including food (meat, milk) and raw materials (wool, sheepskin) . Since their domestication approximately 11,000 years ago (YA) [2, 3], sheep have spread to all continents where they were reared under different environmental, management, and selection conditions. Consequently, diverse local breeds with a unique composition of various traits were developed.
Sheep breeding has always been an important branch of animal husbandry in Russia. The harsh climate conditions, which are characterized by low temperatures and 120 to 240 windy days per year, dictate a steady public demand for wool, sheepskins and felt products. Furthermore, Russia offers more than 75 million hectares of natural grasslands and pastures that are suitable for sheep rearing. Until 1990, Russia, along with Australia, China and New Zealand, was one of the world leaders in wool sheep production. However, the radical reformation of the economy reduced the number of sheep from 58 million in 1990, to 24.7 million in 2014 . This trend was partly associated with a worldwide reduction of the demand of wool. Currently, sheep breeding is recovering and turning its production to meat instead of wool. Thus, the proportion of wool breeds has decreased from 90% in 1990 to 56% in 2014, while that of meat types has increased from 10 to 44% . These developments threaten many wool breeds and they have even abolished several of them . From the 45 breeds that were recorded in 1990, only 28 are still maintained . Wool breeds comprise breeds with coarse wool and breeds with fine and semi-fine wool. The Russian coarse wool breeds originated from local sheep that were well adapted to the local environmental conditions of certain regions, such as the Edilbai and Kalmyk fat-rumped breeds in the hot dry steppe regions in the south of Russia, the Tuva short-fat-tailed breed in the Trans-Baikal area with a harsh continental climate, the Andean and Lezgin breeds in the mountain areas of the North Caucasus with poor forage resources, and Romanov sheep in the Central Russia with cold winters. The coarse wool breeds were created mainly by folk selection practices and were only slightly improved by crossbreeding with high-producing foreign breeds [8, 9]. Furthermore, the Russian coarse wool breeds exhibit a large diversity in tail fat deposition as well as in tail length, and they include the short-thin-tailed Romanov, the long-fat-tailed Kuchugur, Karakul and Caucasian Mountain breeds, the short-fat-tailed Buubei and Tuva, and the fat-rumped Edilbai and Kalmyk breeds.
The Russian semi-fine wool breeds were established from local ewes and were substantially influenced by the Romney and Lincoln breeds [10, 11]. Most of the Russian fine wool breeds were developed during the Soviet period by improving local breeds with low productivity, mainly through crossbreeding with Merino-derived breeds such as Rambouillet and Australian Merino sheep.
The development of high-throughput arrays for genotyping of multiple single nucleotide polymorphisms (SNPs) has revolutionized modern genetic studies [12, 13]. This technology allows unambiguous scoring and the combination of standardized data from different laboratories [14–16], thus providing a powerful tool to address a number of genetic issues [17, 18] including the successful application for studies on population structure in farm animals. During the last decade, detailed studies of the biodiversity and admixture levels in sheep breeds from Asia, Africa, America, Europe, Australia and New Zealand were performed using SNPs [19–23]. To date, only a few Russian sheep breeds have been genotyped using the OvineSNP50K BeadChip , whereas most of them have been analyzed using mitochondrial  and microsatellite markers exclusively [26–28].
In this work, we investigated the patterns of whole-genome diversity and the population structure of 25 local Russian sheep breeds using genome-wide genotype data. Furthermore, we determined the genetic relationship of the studied breeds with other breeds worldwide to elucidate the origin of the Russian sheep breeds.
Descriptive statistics of the genetic diversity of the 25 Russian sheep breeds analyzed
Coarse wool breeds
Semi-fine wool breeds
Fine wool breeds
DNA extraction and whole-genome SNP genotyping
Genomic DNA was extracted using Nexttec columns (Nexttec Biotechnology GmbH, Germany) following the manufacturer’s instructions. The concentrations of DNA solutions were determined using a NanoDrop-2000 (Thermo Fisher Scientific, Wilmington, DE, USA) and a Qubit 3.0 fluorimeter (Life Technologies). DNA concentrations and the OD260/OD280 ratio of DNA solutions were determined by NanoDrop. A Qubit dsDNA HS (high sensitivity, 0.2–100 ng) Assay Kit was used to measure the concentration of dsDNA according to the manufacturer’s protocols. The DNA quality was checked by 1% agarose gel electrophoresis. Whole-genome SNP genotyping was performed using the OvineSNP50 BeadChip (Illumina, San Diego, CA, USA).
Construction of datasets
Two datasets were included in the analyses. The first one comprised 25 Russian sheep breeds (see Additional file 1: Table S1), while the second one included 24 of the 25 Russian sheep breeds mentioned above (except for the Baikal fine-fleeced breed, which was excluded from the combined dataset due to the small number of samples) and 2791 samples from 58 worldwide sheep breeds from publicly available sources [19, 21–23]. To account for the effects of family structures within the subpopulations, the genome-wide relationships between all animal pairs were inferred by estimating a unified additive relationship (UAR) matrix according to Yang et al. . After exclusion of one of 1157 pairs of highly related animals (relationship > 0.25), the combined dataset comprised the SNP genotypes of 1592 relatively unrelated individuals from 82 breeds. Outliers were identified using a neighbor-joining tree based on identical-by-state (IBS) allele-sharing distances (–distance 1-ibs). Three outliers were found and removed from the Stavropol, Tushin, and Altai Mountain datasets.
The worldwide breeds were pooled according to their historical geographic origin and included 13 breeds from the British Isles, five breeds from Northern Europe, six breeds from Central Europe, 22 breeds from Southwestern Europe, three breeds from Asia, three breeds from Southwestern Asia, two breeds from South Africa, and four breeds from the Americas. Breed acronyms and color codes are available in Table S2 (see Additional file 2: Table S2).
SNP quality control
First, the accuracy and efficiency of SNP genotyping were assessed. Valid genotypes for each SNP were determined by applying a cut-off of 0.5 for the GenCall (GC) and GenTrain (GT) scores . Next, PLINK 1.07  was used to exclude SNPs for which less than 90% of the individuals were genotyped (–geno 0.1), that had a minor allele frequency (MAF) lower than 5% (–maf 0.05), that departed from Hardy–Weinberg equilibrium at p < 10−6 (–hwe 1e-6) and that were in linkage disequilibrium (–indep-pairwise 50 5 0.5). Finally, only SNPs that are located on autosomes were kept for further analyses. Individuals with more than 10% missing genotypes (–mind 0.1) were removed. A Hardy–Weinberg equilibrium test was not performed for comparisons with worldwide breeds because too many SNPs would be excluded due to the Wahlund effect .
Whole-genome SNP data processing
The R package ‘diveRsity’  was used to calculate expected heterozygosity (HE) , rarefied allelic richness (AR) and pairwise FST values based on SNP genotypes. Multi-dimensional scaling (MDS) analysis based on pairwise identical-by-state (IBS) distances was performed with PLINK 1.07 (–cluster, –mds-plot 4) and visualized with the R package “ggplot2” . Pairwise Nei’s genetic distances  were calculated using the R package ‘adegenet’ . Neighbor-net graphs both for the Russian and the combined dataset based on pairwise FST values were computed using SplitsTree 4.14.5 .
Genetic admixture calculations were performed using Admixture v1.3  and plotted with the R package “pophelper” . Values of K (the number of assumed ancestral populations) ranging from 1 to 25 for the Russian dataset and from 1 to 74 for the combined dataset as well as their respective cross-validation (CV) errors were evaluated.
Trends of effective population size (Ne) were estimated from linkage disequilibrium (LD) as implemented in SNeP . Default parameters were applied, except for the sample size correction, occurrence of mutation (α = 2.2; ), and recombination rate between a pair of genetic markers according to Sved and Feldman . The most recent estimate of Ne was taken five generations back (Ne5). Furthermore, Ne estimates for c = 1 Mb (~ 50 generations ago; Ne50), where c is the distance between the SNPs in Morgans, were used for comparison with results from Kijas et al. [19, 23, 46]. A ‘Ne changing ratio’ (NeC) analysis was used as a proxy of the speed in Ne changes in the 20 most recent generations. The slope of each segment that links a pair of neighboring Ne estimates was calculated and normalized using the median of the most recent 20 Ne estimates.
R version 3.3.2 was used to create input files .
Analysis of genetic diversity, population structure and genetic differentiation within 25 Russian sheep breeds
Descriptive statistics of the genetic diversity of the 25 Russian sheep breeds analyzed are in Table 1. Estimates of expected heterozygosity (HE) and rarified allelic richness (AR) in the Russian breeds under study were higher than 0.358 and 1.900, respectively. Only the Romanov breed had a lower level of genetic diversity with an HE of 0.354 and AR of 1.890.
The mean Ne5 value was around 228, with the Karakul and Kuchugur breeds displaying the highest (543) and lowest (65) values, respectively. The recorded Ne 50 values showed a similar trend i.e. 2171 for the Karakul and 357 for the Kuchugur breeds.
Phylogenetic relationships between Russian and global sheep breeds
The global admixture analysis revealed that the genetic backgrounds that predominate in Chinese and Iranian sheep are present in all Russian coarse wool breeds except for the Romanov and Kuchugur breeds. In addition, the fat-rumped Edilbai and Kalmyk as well as the short-fat-tailed Buubei and Tuva breeds shared a significant common genetic ancestry with Chinese (Tibet) sheep. We detected similar patterns for the Russian Karakul and the Iran Afshari breeds. Most of the Russian sheep breeds analyzed here revealed a complex ancestry, but two Russian indigenous breeds (Romanov and Kuchugur) formed specific genetic patterns that were not detected in the other studied sheep populations. We observed a high level of consolidation for the Romanov breed, while the extent of admixture for the Kuchugur breed was more obvious.
Due to their vast extension and unique Eurasian geographical position, Russian local livestock are of special interest [26, 48–50]. The first key point of interest for us was to investigate the whole-genome diversity of the breeds under study. This was crucial since no Russian sheep breeds were included in the OvineSNP50 BeadChip (Illumina) discovery panel. We found that the levels of variability of Russian breeds were similar to those reported for other sheep breeds [19, 21–23].
Regarding the slope changes in the Ne trend lines (see Additional file 4: Figure S1), the major peak of Ne decline for 24 of the 25 breeds analysed occurred about eight generations ago. This decline is most likely due to the beginning of the restructuring of the Soviet economy, the so-called Perestroika, which resulted in the destruction of the planned economy system and in a deep crisis of the agricultural sector. The subsequent lack of forage and food resources led to a considerable decrease in the number of all livestock populations including sheep, which can be detected in the evolution of the Ne. The negative consequences continued during the next decade of the post-soviet times, which could explain the shifts of the peaks in the Ne slopes of some breeds between 6 and 8 generations ago. However, one breed i.e. the Dagestan Mountain breed did not follow this trend and maintained its population size during the Perestroika. A possible explanation for this trend might be the great popularity of the Dagestan Mountain sheep in their breeding region because of their combined good meat and wool productivity. In addition, we observed that the coarse wool breeds did not display any further recent significant peaks, whereas fine and semi-fine wool breeds do. This could be indirectly associated with the growing interest of farmers in local coarse wool breeds that are highly adapted to specific regions.
We observed a decline in Ne over time for the breeds analyzed (Fig. 5). The most rapid decline in Ne occurred over the last 200 to 400 generations in all breeds. In general, this decrease corresponded to the results obtained by Kijas et al.  on sheep breeds included in the HapMap Project data . However, some breeds showed interesting patterns regarding changes in ancestral Ne. Until 250 generations ago, the Ne curve of the Tsigai breed was almost parallel to the x-axis. The same tendency towards smooth curves until 200 to 250 generations ago was also observed for the Tuva, Karachaev, Kalmyk, Edilbai, Karakul and Lezgin breeds. This pattern most likely reflects their ancient origin and wide geographic distribution. In addition, all mentioned breeds currently have large Ne (Table 1). However, in their latest study, Prieur et al.  suggested that the 50K SNP BeadChip is not suitable for estimating the Ne more than 100 generations ago. Consequently, these inferences onto many generations ago based on a 50K DNA array data should be treated with caution.
Overall, the current effective population size estimates (Ne50) for the Russian sheep groups were larger than those of the other worldwide sheep breeds [19, 23, 46]. The Kuchugur breed recorded the smallest Ne5 and Ne50 values (65 and 357, respectively), which most likely reflect the low management conditions of the breed, for which no precise information on the population size is available . However, although the Ne50 values are not as critical as those for Dorset Horn (Ne50 = 134) and Wiltshire (Ne50 = 100) breeds , the most recent Ne5 estimate for the Kuchugur breed is around 50, which is considered as the threshold risk of extinction in the short term . This implies that the breed should be monitored closely as a relevant candidate for conservation efforts.
On the history of the Russian coarse wool sheep breeds
The analysis of a combined dataset of local and worldwide sheep genotypes allowed us to gain insight into the history and ancestry of the Russian sheep population. The Russian coarse wool breeds are characterized by differences in tail phenotypes and included sheep with thin tails and sheep with fat tails and fat rumps. Among these different tail types, the thin tail is likely to be the ancestral trait, since it is present in the mouflon, which is the most probable wild ancestor of modern sheep. According to archaeological findings, fat-tailed sheep were developed from thin-tailed sheep and were first mentioned about 5000 years ago . In this regard, fat deposition in the tail is an important genetic trait that is considered one of the major post-domestic adaptations to harsh environments (drought seasons, extreme cold winters and food shortages) as well as an energy source for long migrations [56, 57]. In our study, the tail types of the Russian coarse wool breeds could provide valuable information on their origin.
Here, we recorded a strong differentiation between the thin-tailed Romanov and the local fat-tailed and fat-rumped groups (Figs. 2, 3, 4, 6, and 7). A further subdivision was detected within the group with fat deposition in the tail. This group comprised the long-fat-tailed Kuchugur breed and two subclusters: Karakul (long-fat-tailed), Buubei and Tuva + Edilbai + Kalmyk (short-fat-tailed and fat-rumped), and Andean Black + Lezgin + Tushin + Karachaev (long-fat-tailed). For a better understanding of the results, some aspects of the origin of each breed are discussed below.
The Romanov breed, which is the only short-thin-tailed Russian coarse wool breed, was created by local farmers in the seventeenth century in the Yaroslavl region. Today, the Romanov breed is famous worldwide for its extraordinary prolificacy, early sexual maturity and out-of-season breeding ability . Compared with the other coarse wool breeds, the Romanov breed clearly showed different ancestry, which was well demonstrated by the results at the local level (Figs. 2, 3, and 4). Neighbor-net (Fig. 6) and admixture graphs (Fig. 7) confirmed the North European genetic roots of the breed. Indeed, the Romanov breed clustered outside the other Russian coarse wool breeds and formed a group with the Finnsheep and Norwegian Spaelsau breeds (Fig. 6). Romanov and Finnsheep are the most well known and numerous representatives of the Northern European short-tailed breeds [49, 58]. It is believed that Norse Vikings spread these northern sheep to several countries from the late eighth century to the middle of the eleventh century AD . The patterns obtained at K = 5, 6 and 7 (Fig. 7) also suggested a common ancestry between Romanov and Finnsheep. However, at K = 14 and higher, all breeds clearly differentiated from one another (Fig. 7). Originating from the same ancient Nordic ancestor group, each breed (including Romanov) most likely formed their unique gene pool under different selection, geographical and feed conditions. Such interpretation is in agreement with historical records, which consider the Romanov an independent branch of the Northern European short-tailed breeds .
Neighbor-net and admixture graphs (Figs. 6, 7) suggested a common ancestry between the fat-tailed Russian coarse wool breeds, Asian (Chinese and Indian), and Southwestern Asian (Iran) sheep. The range of the fat-tailed and fat-rumped sheep overlaps with the European and Asian Russian territory, which was proposed to be the consequence of nomadic expansions including invasions and the intensive east–west trading via the Silk Road [57, 61, 62]. Specifically, sheep from the Middle Eastern domestication center were brought to the Caucasus, the area east of the Caspian Sea and Central Asia, and finally arrived in North and Southwest China and the Indian subcontinent via the Mongolian Plateau region [57, 62]. Furthermore, the gene flow could have taken place through the major Turkic migrations and later Mongol invasions [57, 61], which were accompanied by sheep flocks. Indeed, this may explain the admixture of Caucasian Mountain fat-tailed sheep and the Chinese breeds.
The fat-tailed local sheep, Andean, Karachaev, Lezgin, and Tushin formed the Caucasian Mountain fat-tailed cluster. Sheep husbandry has always been of special value to the Russian south regions, especially in mountain regions, and it represents an inseparable part of the local cultural heritage. Andean, Karachaev, Lezgin, and Tushin sheep are versatile breeds that produce meat, wool and milk in equivalent proportions. These sheep easily withstand long marches over great distances and are highly adapted to grazing the mountain and lowland pastures. The wool is used for manufacturing felt shoes and fabrics to sew the traditional men’s clothing. All these breeds were created by folk selection practices during the nineteenth and twentieth century in different mountain parts of the North Caucasus [63, 64].
The second cluster of the fat-tailed local sheep included breeds with more significant Asian ancestry (China and Tibet): Kalmyk, Edilbai, Buubei and Tuva. The fat-rumped Edilbai and Kalmyk sheep combine high meat and grease productivity with excellent adaptability to year-round grazing in extreme semi-desert and desert climatic conditions . Although the breeds are reared mostly in the southern part of Russia (Fig. 1) and (see Additional file 1: Table S1), they are of Asian ancestry. Thus, the Edilbai breed was obtained by crossing Astrakhan rams with Kazakh fat-rumped ewes between the Ural River and the Volga River. The Kalmyk originated from indigenous fat-rumped sheep from China and improved with sheep from the Edilbai and Torgudsk breeds. The close relation between Edilbai and Kalmyk sheep was very well illustrated by the formation of a common branch in the neighbor-net (Fig. 4) and by the low pairwise FST value (FST = 0.007), (see Additional file 33 Table S3).
The Buubei breed is the result of long-term improvement of the indigenous Buryat sheep. This breed is characterized by a high prolificacy and good adaptation to the severe climatic conditions of the Republic of Buryatia [65, 66]. In the middle of the twentieth century, the indigenous Buryat sheep had become extinct . In the 1980s’, a small group of indigenous Buryat sheep was found in China and was later transported to their historic homeland. This is compatible with our findings that the Chinese genetic background significantly contributed to the Buubei breed.
The ancient Tuva breed was raised under the harsh climate of the Republic of Tyva by local nomadic tribes approximately 2000 YA. These sheep can survive on small amounts of forage while accumulating body fat and they can take snow instead of water, which is an important advantage for surviving in steppe and mountain pastures. Their coarse wool, which is composed of down, guard and dead hair, is the feedstock for shoes and felt fabrics for traditional clothing . The Republic of Tyva has a common border with Mongolia across which the gene flow with China could have taken place. Furthermore, both Buubei and Tuva are short fat-tailed and are very similar to Chinese breeds. A study of the demographic history of Chinese native sheep showed that the expansion of short-fat-tailed sheep into China was mainly associated with the invasions of Mongols, who reared the short-fat-tailed sheep, from the Mongolian Plateau during the twelvetieth and thirtieth centuries . Consequently, the Buubei, Tuva and Chinese breeds probably share Mongolian ancestry.
The position within the fat-tailed coarse wool group of the Russian Karakul breed is not perfectly clear. The local neighbor-net (Fig. 4) suggested a closer relation with the Kalmyk, Edilbai and Tuva breeds. However, the global admixture results (Fig. 7) showed significant co-ancestry between the Karakul and Iranian breeds, which is more consistent with the breed’s origin. The history of the creation of the Karakul breed is still in question and there are two main theories. Some scientists believe that the Karakul breed results from crossing the black indigenous sheep of Bukhara (Turkestan) with Afghan and native fat-rumped sheep . Others assumed that the Arabs brought the ancestors of the Karakul breed to Middle Asia in the eighth century . Both theories agree with our findings.
The long-fat-tailed Kuchugur showed a pattern of admixture that was quite similar to that of the other fat-tailed Russian coarse wool breeds at K = 5, 6, 7 and 14 (Fig. 7). However, Kuchugur appeared as an outlier according to the neighbor-net analyses (Figs. 4 and 6), with a branch that is positioned between the Tsigai + Altai Mountain cluster (with lower genetic distance) and the fat-tailed local cluster. This most likely reflects the crossbred origin of the Kuchugur breed. It is assumed that the Kuchugur breed resulted from the cross of indigenous crossbred coarse wool ewes with large Voloshian (Valakhian) rams . Furthermore, the lowest pairwise FST value for the Kuchugur breed was detected with the Tsigai breed (FST = 0.068) (see Additional file 3: Table S3). Since both the Tsigai and Voloshian breeds originated in the Balkans, they are genetically close and have influenced many sheep breeds in Eastern Europe [71–74], which also confirms the European ancestry of Kuchugur. Moreover, historical records suggest that a foreign breed—most likely one of the English Longwool type—was used to improve the local crossbreds towards curly wool and good body conformation .
On the history of the Russian semi-fine wool sheep breeds
Analysis of the phylogeny of the Russian semi-fine wool breeds revealed several ancestry backgrounds. The local neighbor-net analysis indicated the presence of two main clusters of which one includes the Altai Mountain and Tsigai breeds and the other the Kuibyshev, North Caucasian and Russian Longhaired breeds. The history of the creation of these breeds’ provided insight into this differentiation.
Both admixture patterns (Figs. 3, 7) showed a common genetic background for the Tsigai and Altai Mountain breeds. The Roman origin of the Tsigai sheep and its subsequent spread in the Balkans was previously suggested [73, 74, 76]. The history of the Russian Tsigai began when Transylvanian farmers brought Tsigai sheep from Romania to the former Russian Empire in 1914 [75–77]. Since the establishment of the Tsigai herd book, this breed was kept pure. However, possible admixture with fine wool breeds could probably have taken place at the early stages of Tsigai breeding after the breed was imported to Russia. Unfortunately, no original Romanian Tsigai SNP data is available to better evaluate the relationship between Russian and Romanian Tsigai sheep.
The Altai Mountain breed resulted from crossing local coarse wool sheep with the Groznensk breed, as confirmed by the admixture analysis (Figs. 3, 7). Furthermore, the Tsigai breed was involved in the breeding process of the Altai Mountain breed during the period from 1945 to 1970 [53, 70]. Their common ancestry is illustrated by the MDS, admixture plots and neighbor-net analyses (Figs. 2, 3 and 4), and confirmed by the low pairwise FST values (FST = 0.013) (see Additional file 3: Table S3).
The origin of the other semi-fine wool sheep was closely associated with the English long-wool breeds. Thus, the Kuibyshev breed was obtained from an ancestry that involved Romney Marsh rams . At the first stages of the North Caucasian breed creation, both Romney Marsh and Lincoln rams were widely used. Because the Lincoln progeny showed higher growth rates and were characterized by a better external phenotype, only Lincoln rams were maintained in the breeding process [10, 11, 53]. Nevertheless, due to the close genetic relatedness between North Caucasian and Kuibyshev sheep (FST = 0.020), we assume that the Romney Marsh genetic background is still present in the modern North Caucasian sheep. The shared ancestry of both breeds and Romney Marsh was identified by the admixture analysis (Fig. 7). Interestingly, the neighbor-net analysis identified some genetic overlap between the North Caucasian and the Russian longhaired breeds (Fig. 6), which is consistent with the origin of the Russian Longhaired breed that was created with the participation of Lincoln sheep (see Additional file 1: Table S1), and by a relatively large Galway ancestry component, the Galway breed being a long-wool breed as the Lincoln breed (Fig. 7). Finally, Kuchugur is believed to have been involved in the development of the Russian Longhaired breed . Although FST values between these breeds were significant (FST = 0.09), the presence of the Kuchugur background was obvious in the Russian Longhaired at K = 42 in the global admixture plot (Fig. 7).
On the history of the Russian fine wool sheep breeds
Ciani et al.  conducted a study that focused on the Merino influence on the development of new breeds distributed throughout the world; however, the Russian Merino-derived sheep breeds were not included in the analysis. In the former USSR, wool production was one of the most prioritized branches of animal husbandry. In this regard, the majority of Russian fine wool breeds were created between 1920 and 1980. Thus, most fine wool breeds (Groznensk, Stavropol, Soviet Merino and Salsk) result from the improvement of local fine wool Mazaev and Novocaucasian ewes with commercial rams that have a high wool productivity such as the Spanish Merino, French and American Rambouillet, and Merino Précoce breeds [22, 70, 79].
The Manych Merino breed was developed from Stavropol ewes that were improved with Australian Merino rams . The close genetic relationship between Manych Merino and Stavropol was evidenced by both by the neighbor-net analyses (Figs. 4 and 6), and by their low FST value (0.012) (see Additional file 3: Table S3). The Volgograd sheep resulted from a complex crossing that involved Groznensk rams  as suggested by the results of the neighbor-net analysis (Fig. 4) and the FST value (0.018) (see Additional file 3: Table S3).
Later, from 1990 to 2004, Australian Merino sheep were used to improve the quality of the wool of most of the Russian fine wool breeds . However, the genetic background of the Dagestan Mountain and Baikal fine-fleeced breeds is clearly different to that of other local fine wool breeds (Fig. 2). This could most likely be due to the fact that local crossbred coarse wool ewes, specifically Gunib for Dagestan Mountain sheep and Buryat-Mongolian for Baikal fine-fleeced sheep, were used instead of Mazaev and Novocaucasian Merino sheep . Nonetheless, an authentic Russian origin of the fine- and semi-fine-wool sheep is indicated by the K = 42 pattern of the global admixture plot (Fig. 7), in which these breeds share a (violet) ancestral component that is not present in any other breed.
In this study, we investigated the genome-wide diversity and population structure of 25 Russian local sheep breeds for the first time. We identified three clusters corresponding to the wool type. We identified a main discriminating factor within the Russian coarse wool cluster i.e. tail type, with the short-thin-tailed Romanov breed clearly differentiated from the other fat-tailed or fat-rumped breeds. The combination of local Russian sheep data with a worldwide sheep SNP genotyping set provided admixture patterns that gave deeper insights into the origin of the local Russian sheep. Thus, our findings suggest shared ancestry of local fat-tailed coarse wool breeds and Southwestern Asian (Iran) sheep, which may be a consequence of nomadic migrations, including invasions and east–west trading. Although co-ancestry between the Romanov breed and the Northern short-tailed group was clearly confirmed, we also noted that this breed is genetically distinct, which may be clarified by future studies using a larger sample size, denser SNP panels or whole-genome sequencing. The computation of the most recent effective population sizes revealed a few local breeds with critically small values that constitute a warning flag for the implementation of conservation efforts (e.g. the Kuchugur breed). This study is the first step to design a more effective selection and conservation program for Russian local sheep breeds based on whole-genome SNP genotyping data. This is essential for sustainable sheep breeding at the global level and for the future prosperity of sheep breeding at the local level across Russia.
NAZ and GB developed the concept and designed the study. MIS collected sheep samples. TED and HR conducted the molecular genetic work. AVD, HR, MB, EK and IM processed the molecular genetic data. TED, AVD, MIS, MB, KW, GB and NAZ analyzed and discussed the data. AT assisted in data analysis. TED, AVD, MIS and NAZ wrote the manuscript. All authors read and approved the final manuscript.
We thank the staff of the laboratory of molecular bases of breeding for preparation of DNA samples used in the analyses. We would like to express our sincere gratitude to the reviewers and the editors for the attention to our paper and for valuable comments that helped us to improve the manuscript significantly.
The authors declare that they have no competing interests.
Availability of data and materials
The 50K genotypes of the Russian sheep breeds that were used in the current study are available from the corresponding author upon reasonable request.
Consent to participate
Consent for publication
The authors declare that animal tissue samples were collected by trained personnel under strict veterinary rules. Sampling was performed in accordance with the ethical guidelines of the L.K. Ernst Federal Science Center for Animal Husbandry.
This study was supported by the Russian Scientific Foundation (RSF) within Project No. 14-36-00039. The authors declare that the RSF financed the project and did not have any influence on the results and their interpretation. The biomaterials that were used in this study stemmed from the genetic resource collection of the L.K. Ernst Federal Science Center for Animal Husbandry, supported by the Federal Agency for Scientific Organizations.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Chessa B, Pereira F, Arnaud F, Amorim A, Goyache F, Mainland I, et al. Revealing the history of sheep domestication using retrovirus integrations. Science. 2009;324:532–6.View ArticlePubMedPubMed CentralGoogle Scholar
- Zeder MA. Domestication and early agriculture in the Mediterranean Basin: origins, diffusion, and impact. Proc Natl Acad Sci USA. 2008;105:11597–604.View ArticlePubMedPubMed CentralGoogle Scholar
- Vigne JD, Carrère I, Briois F, Guilaine J. The early process of mammal domestication in the near east: new evidence from the pre-neolithic and pre-pottery neolithic in Cyprus. Curr Anthropol. 2011;52:S255–71.View ArticleGoogle Scholar
- IWTO Market Information, FAOSTAT. http://www.fao.org/faostat/en/#home. Accessed 15 Sept 2016.
- Lescheva M, Ivolga A. Current state and perspectives of sheep breeding development in Russian modern economic conditions. Econ Agric. 2015;62:467–80.Google Scholar
- Erokhin AI. Ovtzevodstvo. Voronezh: Voronezhskii GAY; 2014 (in Russian).Google Scholar
- Amerkhanov KhA. Ovtzevodstvo I kozovodstvo Rossiiskoy Federatsii v tsyfrakh. Stavropol: BI; 2015 (in Russian).Google Scholar
- Veniaminov AA. Porody ovets mira. Moskva: Kolos; 1984 (in Russian).Google Scholar
- Zakharov IA. Genefondy sel`skokhozyastvennych zhivotnykh: geneticheskie resursy zhivotnovodsrva Rossii. Moskva: Nauka; 2006 (in Russian).Google Scholar
- Sel’kin II, Sokolov AN. Sozdanie i soversenstvovanie polytonkorunnykh porod ovets. Ovtsy, kosy, sherstyanoe delo. 2002;3:10–2 (in Russian).Google Scholar
- Sel’kin II, Aboneev VV. Severokavkazskay myaso-sherstnaya poroda. Stavropol: BI; 2007 (in Russian).Google Scholar
- LaFramboise T. Single nucleotide polymorphism arrays: a decade of biological, computational and technological advances. Nucleic Acids Res. 2009;37:4181–93.View ArticlePubMedPubMed CentralGoogle Scholar
- Lenstra JA, Groeneveld LF, Edin GH, Kantanen J, Williams JL, Taberlet P, et al. Molecular tools and analytical approaches for the characterization of farm animal genetic diversity. Anim Genet. 2012;43:483–502.View ArticlePubMedGoogle Scholar
- Morin PA, McCarthy M. Highly accurate SNP genotyping from historical and low-quality samples. Mol Ecol Resour. 2007;7:937–46.View ArticleGoogle Scholar
- Smith M, Pascal C, Grauvogel Z, Habicht C, Seeb J, Seeb L. Multiplex preamplification PCR and microsatellite validation allows accurate single nucleotide polymorphism (SNP) genotyping of historical fish scales. Mol Ecol Resour. 2011;11:268–77.View ArticlePubMedGoogle Scholar
- Kawęcka A, Gurgul A, Miksza-Cybulska A. The use of SNP microarrays for biodiversity studies of sheep: a review. Ann Anim Sci. 2016;16:975–87.Google Scholar
- Gill P. An assessment of the utility of single nucleotide polymorphisms (SNPs) for forensic purposes. Int J Legal Med. 2001;114:204–10.View ArticlePubMedGoogle Scholar
- Paschou P, Ziv E, Burchard EG, Choudhry S, Rodriguez-Cintron W, Mahoney MW, et al. PCA-correlated SNPs for structure identification in worldwide human populations. PLoS Genet. 2007;3:1672–86.View ArticlePubMedGoogle Scholar
- Kijas JW, Lenstra JA, Hayes B, Boitard S, Porto Neto LR, San Cristobal M, et al. Genome-wide analysis of the world’s sheep breeds reveals high levels of historic mixture and strong recent selection. PLoS Biol. 2012;10:e1001258.View ArticlePubMedPubMed CentralGoogle Scholar
- Zhang L, Mousel MR, Wu X, Michal JJ, Zhou X, Ding B. Genome-wide genetic diversity and differentially selected regions among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep. PLoS One. 2013;8:e65942.View ArticlePubMedPubMed CentralGoogle Scholar
- Ciani E, Crepaldi P, Nicoloso L, Lasagna E, Sarti FM, Moioli B, et al. Genome-wide analysis of Italian sheep diversity reveals a strong geographic pattern and cryptic relationships between breeds. Anim Genet. 2014;45:256–66.View ArticlePubMedGoogle Scholar
- Ciani E, Lasagna E, D’Andrea M, Alloggio I, Marroni F, Ceccobelli S, et al. Merino and Merino-derived sheep breeds: a genome-wide intercontinental study. Genet Sel Evol. 2015;47:64.View ArticlePubMedPubMed CentralGoogle Scholar
- Beynon SE, Slavov GT, Farré M, Sunduimijid B, Waddams K, Davies B, et al. Population structure and history of the Welsh sheep breeds determined by whole genome genotyping. BMC Genet. 2015;16:65.View ArticlePubMedPubMed CentralGoogle Scholar
- Deniskova TE, Dotsev AV, Wimmers K, Reyer H, Kharzinova VR, Gladyr EA, et al. Genomic evaluation and population structure of eleven Russian sheep breeds. J Anim Sci. 2016;94:834.View ArticleGoogle Scholar
- Tapio M, Marzanov N, Ozerov M, Cinkulov M, Gonzarenko G, Kiselyova T, et al. Sheep mitochondrial DNA variation in European, Caucasian, and Central Asian areas. Mol Biol Evol. 2006;23:1776–83.View ArticlePubMedGoogle Scholar
- Tapio M, Ozerov M, Tapio I, Toro MA, Marzanov N, Cinkulov M, et al. Microsatellite-based genetic diversity and population structure of domestic sheep in northern Eurasia. BMC Genet. 2010;11:76.View ArticlePubMedPubMed CentralGoogle Scholar
- Zinovieva NA, Selionova MI, Gladyr EA, Petrovic MP, Caro Petrovic V, Ruzic MD. Investigation of gene pool and genealogical links between sheep breeds of southern Russia by blood groups and DNA microsatellites. Genetika. 2015;47:395–404.View ArticleGoogle Scholar
- Deniskova TE, Selionova MI, Dotsev AV, Bobryshova GT, Gladyr EA, Kostjunina OV, et al. Variability of microsatellites in sheep breeds raced in Russia. Agric Biol [Sel`skokhozyastvennaya biologia]. 2016;51:801–10.Google Scholar
- Yang J, Benyamin B, McEvoy BP, Gordon S, Henders AK, Nyholt DR, et al. Common SNPs explain a large proportion of the heritability for human. Nat Genet. 2010;42:565–71.View ArticlePubMedPubMed CentralGoogle Scholar
- Fan JB, Oliphant A, Shen R, Kermani BG, Garcia F, Gunderson KL, et al. Highly parallel SNP genotyping. Cold Spring Harb Symp Quant Biol. 2003;68:69–78.View ArticlePubMedGoogle Scholar
- Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–75.View ArticlePubMedPubMed CentralGoogle Scholar
- Wahlund S. Zusammensetzung von Populationen und Korrelationerscheinungen vom Standpunkt der Vererbungslehre aus betrachtet. Hereditas. 1928;11:65–106.View ArticleGoogle Scholar
- Keenan K, McGinnity P, Cross TF, Crozier WW, Prodohl PA. diveRsity: an R package for the estimation of population genetics parameters and their associated errors. Methods Ecol Evol. 2013;4:782–8.View ArticleGoogle Scholar
- Nei M. Estimation of average heterozygosity and genetic distance from small number of individuals. Genetics. 1978;89:583–90.PubMedPubMed CentralGoogle Scholar
- Wickham H. ggplot2: elegant graphics for data analysis. New York: Springer; 2009.View ArticleGoogle Scholar
- Nei M. Genetic distance between populations. Am Nat. 1972;106:283–92.View ArticleGoogle Scholar
- Jombart T. Ahmed I. adegenet 1.3-1: new tools for the analysis of genome-wide SNP data. Bioinformatics. 2011;27:3070–1.View ArticlePubMedPubMed CentralGoogle Scholar
- Huson DH, Bryant D. Application of phylogenetic networks in evolutionary studies. Mol Biol Evol. 2006;23:254–67.View ArticlePubMedGoogle Scholar
- Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19:1655–64.View ArticlePubMedPubMed CentralGoogle Scholar
- Francis RM. POPHELPER: an R package and web app to analyse and visualise population structure. Mol Ecol Resour. 2017;17:27–32.View ArticlePubMedGoogle Scholar
- NatGeo Mapmaker Interactive database. https://mapmaker.nationalgeographic.org/. Accessed 15 Dec 2017.
- maps: Draw Geographical Maps. https://CRAN.R-project.org/package=maps. Accessed 15 Dec 2017.
- Barbato M, Orozco-terWengel P, Tapio M, Bruford MW. SNeP: a tool to estimate trends in recent effective population size trajectories using genome-wide SNP data. Front Genet. 2015;6:109.View ArticlePubMedPubMed CentralGoogle Scholar
- Corbin LJ, Liu AY, Bishop SC, Woolliams JA. Estimation of historical effective population size using linkage disequilibria with marker data. J Anim Breed Genet. 2012;129:257–70.View ArticlePubMedGoogle Scholar
- Sved J, Feldman M. Correlation and probability methods for one and two loci. Theor Popul Biol. 1973;4:129–32.View ArticlePubMedGoogle Scholar
- Barbato M, Hailer F, Orozco-terWengel P, Kijas JW, Mereu P, Cabras P, et al. Genomic signatures of adaptive introgression from European mouflon into domestic sheep. Sci Rep. 2017;7:7623.View ArticlePubMedPubMed CentralGoogle Scholar
- R Core Team. R: a language and environment for statistical computing. R Foundation for statistical computing. Vienna, Austria; 2012. http://www.R-project.org.
- Tapio I, Tapio M, Grislis Z, Holm LE, Jeppsson S, Kantanen J, et al. Unfolding of population structure in Baltic sheep breeds using microsatellite analysis. Heredity (Edinb). 2005;94:448–56.View ArticleGoogle Scholar
- Tapio M. Origin and maintenance of genetic diversity in North European sheep. PhD thesis, University of Oulu; 2006.Google Scholar
- Tapio M, Tapio I, Grislis Z, Holm LE, Jeppsson S, Kantanen J, et al. Native breeds demonstrate high contributions to the molecular variation in northern European sheep. Mol Ecol. 2005;14:3951–63.View ArticlePubMedGoogle Scholar
- International Sheep Genomics Consortium. http://www.sheephapmap.org/pag.php. Accessed 20 August 2017.
- Prieur V, Clarke SM, Brito LF, McEwan JC, Lee MA, Brauning R, et al. Estimation of linkage disequilibrium and effective population size in New Zealand sheep using three different methods to create genetic maps. BMC Genet. 2017;18:68.View ArticlePubMedPubMed CentralGoogle Scholar
- Dunin IM, Dankvert AG. Spravochnik porod i tipov sel`skokhozyastvennykh zhivotnykh, razvodimykh v Rossiiskoi Federatsii. Moskva: VNIIPLEM; 2013 (in Russian).Google Scholar
- Taberlet P, Valentini A, Rezaei HR, Naderi S, Pompanon F, Negrini R, et al. Are cattle, sheep, and goats endangered species? Mol Ecol. 2008;17:275–84.View ArticlePubMedGoogle Scholar
- Ryder ML. Sheep and man. London: Gerald Duckworth & Co., Ltd.; 1983.Google Scholar
- Moradi MH, Nejati-Javaremi A, Moradi-Shahrbabak M, Dodds KG, McEwan JC. Genomic scan of selective sweeps in thin and fat tail sheep breeds for identifying of candidate regions associated with fat deposition. BMC Genet. 2012;13:10.View ArticlePubMedPubMed CentralGoogle Scholar
- Lv FH, Peng WF, Yang J, Zhao YX, Li WR, Liu MJ, et al. Mitogenomic meta-analysis identifies two phases of migration in the history of eastern Eurasian sheep. Mol Biol Evol. 2015;32:2515–33.View ArticlePubMedPubMed CentralGoogle Scholar
- Ryder ML. A survey of European primitive breeds of sheep. Ann Genet Sel Anim. 1981;13:381–418.PubMedPubMed CentralGoogle Scholar
- Dýrmundsson ÓR, Niżnikowski R. North European short-tailed breeds of sheep: a review. Animal. 2010;4:1275–82.View ArticlePubMedGoogle Scholar
- Ivanov MF. Ovtsevodstvo. 3rd ed. Moskva: Novaya Derevnya; 1935 (in Russian).Google Scholar
- Yunusbayev B, Metspalu M, Metspalu E, Valeev A, Litvinov S, Valiev R, et al. The genetic legacy of the expansion of Turkic-speaking nomads across Eurasia. PLoS Genet. 2015;11:e1005068.View ArticlePubMedPubMed CentralGoogle Scholar
- Zhao YX, Yang J, Lv FH, Hu XJ, Xie XL, Zhang M, et al. Genomic reconstruction of the history of native sheep reveals the peopling patterns of nomads and the expansion of early pastoralism in East Asia. Mol Biol Evol. 2017;34:2380–95.View ArticlePubMedPubMed CentralGoogle Scholar
- Gadzhiev ZK. Grubosherstye ovtsy Dagestana. Makhatchkala: Stavropolskii NII zhivotnobodstva i kormoproizvodstva; 2010 (in Russian).Google Scholar
- Musalaev K. Sostoyanie I perspectivy razvitiya grubosherstnogo ovtsevodstva i kozovodstva Respubliki Dagestan. Sbornik nauchnykh trudov po materialam mezhdunarodnoi nauchno-prakticheskoi konferencii FGBNU VNIIOK. 2014;3:88–91 (in Russian).Google Scholar
- Tayshin VA, Lkhasaranov BB. Aborigennaya buryatskasya ovtsa. Ulan-Ude: BNC SO RAN; 1997 (in Russian).Google Scholar
- Tayshin VA, Lkhasaranov VV, Shabanova RG. Osnovnye prisnaki otbora aborigennyh buryatskikh. Ovtsy, kozy, sherstyanoe delo. 2001;1:12–4 (in Russian).Google Scholar
- Biltuev SI. Sovremennoe sostoyanie polygrubosherstnogo i grubosherstnogo ovtsevodstva v Respublike Byryatia. Materialy Mezhdunarodnoi nauchno-prakticheskoi konferencii. posvyashennoi 60-letiu Zabaikal`skoi porody ovets. 2016;2016:52–7 (in Russian).Google Scholar
- Averyanov IYA. O proiskhozhdenii karakulskoy ovtsy. Ovtsevodstvo. 1968;5:35–6 (in Russian).Google Scholar
- Ivanov MF. Karakulevodstvo na uge Rossii: Opyt zootekh.-eccon. issled.Poltava: Izdatel`stvo Poltavskogo obtshestva sel`skogo khozyaystva; 1914 (in Russian).Google Scholar
- Ernst LK, Dmitriev NG, Paronyan IA. Geneticheskie resursy sel`skokhozyaistvennykh zhivotnykh v Rossii i sopredel`nykh stranakh. SPB: VNIIGRZH; 1994 (in Russian).Google Scholar
- Drăgănsecu C. An attempt to a filetic classification of Valachian (Zackel) and Tsigai breed. Stocarstv. 1994;48:395–400.Google Scholar
- Drăgănsecu C. Origin and relationships between Valachian and Tsigai sheep from the Danube area. Stocarstvo. 1995;49:321–7.Google Scholar
- Porter V, Alderson L, Hall SJG, Sponenberg DP. Mason’s world encyclopedia of livestock breeds and breeding. 1st ed. Wallingford: CAB International; 2016.View ArticleGoogle Scholar
- Ilişiu E, Dărăban S, Radu R, Pădeanu I, Ilişiu VC, Pascal C, et al. The Romanian Tsigai sheep breed, their potential and the challenges for research. Appl Agric For Res. 2013;2:161–70.Google Scholar
- Ivanov MF. Volosckie Ovta. Moskva: Sochinenie I; 1929 (in Russian).Google Scholar
- Drăgănsecu C. A note on Balkan sheep breeds origin and their taxonomy. Arch Zootech. 2007;10:90–101.Google Scholar
- Kosilov VI, Shkilev PN, Nikonova EA. Produktivnye kachestva ovets raznykh porod na Uzhnom Urale. Moskva: Omega-L; 2014 (in Russian).Google Scholar
- Medvedev MV, Erokhin AI. Otkormocnye i uboinye kachestva ovets kuibyshevskoy porody i ee pomesei s myaso-sherstnymi baranami. Ovtsy, kozy, sherstyanoe delo. 2004;1:29–30 (in Russian).Google Scholar
- Kolosov Y. Sal`skaya poroda ovets–istoria razvitiya i sovershenstvovanie. Sbornik nauchnykh trudov po materialam mezhdunarodnoi nauchno-prakticheskoi konferencii FGBNU VNIIOK. 2014;3:84–8 (in Russian).Google Scholar
- Egorov MV. Sovremennoe sostoyanie ovtsevodstva v Rossiiskoi Federatsii. Mezhdunarodnoi nauchno-prakticheskoi konferencii. posvyashennoi 60-letiu Zabaikal`skoi porody ovets. 2016;2016:13–22 (in Russian).Google Scholar
- Murzina TV, Vershinina VA. Stanovlenie tonkorunnogo ovtsevodsrva i sovremennoe sostoyanie ovets v Zabaikal’skom krae. Informatsionnii bulleten. 2016;1:35–41 (in Russian).Google Scholar