Genome-wide SNP data unveils the globalization of domesticated pigs
- Bin Yang†1,
- Leilei Cui†1,
- Miguel Perez-Enciso3, 4,
- Aleksei Traspov5,
- Richard P. M. A. Crooijmans2,
- Natalia Zinovieva5,
- Lawrence B. Schook6,
- Alan Archibald7,
- Kesinee Gatphayak8,
- Christophe Knorr9,
- Alex Triantafyllidis10,
- Panoraia Alexandri10,
- Gono Semiadi11,
- Olivier Hanotte12,
- Deodália Dias13,
- Peter Dovč14,
- Pekka Uimari15,
- Laura Iacolina16, 17,
- Massimo Scandura17,
- Martien A. M. Groenen2,
- Lusheng Huang1Email author and
- Hendrik-Jan Megens2Email authorView ORCID ID profile
© The Author(s) 2017
Received: 18 January 2017
Accepted: 31 August 2017
Published: 21 September 2017
Pigs were domesticated independently in Eastern and Western Eurasia early during the agricultural revolution, and have since been transported and traded across the globe. Here, we present a worldwide survey on 60K genome-wide single nucleotide polymorphism (SNP) data for 2093 pigs, including 1839 domestic pigs representing 122 local and commercial breeds, 215 wild boars, and 39 out-group suids, from Asia, Europe, America, Oceania and Africa. The aim of this study was to infer global patterns in pig domestication and diversity related to demography, migration, and selection.
A deep phylogeographic division reflects the dichotomy between early domestication centers. In the core Eastern and Western domestication regions, Chinese pigs show differentiation between breeds due to geographic isolation, whereas this is less pronounced in European pigs. The inferred European origin of pigs in the Americas, Africa, and Australia reflects European expansion during the sixteenth to nineteenth centuries. Human-mediated introgression, which is due, in particular, to importing Chinese pigs into the UK during the eighteenth and nineteenth centuries, played an important role in the formation of modern pig breeds. Inbreeding levels vary markedly between populations, from almost no runs of homozygosity (ROH) in a number of Asian wild boar populations, to up to 20% of the genome covered by ROH in a number of Southern European breeds. Commercial populations show moderate ROH statistics. For domesticated pigs and wild boars in Asia and Europe, we identified highly differentiated loci that include candidate genes related to muscle and body development, central nervous system, reproduction, and energy balance, which are putatively under artificial selection.
Key events related to domestication, dispersal, and mixing of pigs from different regions are reflected in the 60K SNP data, including the globalization that has recently become full circle since Chinese pig breeders in the past decades started selecting Western breeds to improve local Chinese pigs. Furthermore, signatures of ongoing and past selection, acting at different times and on different genetic backgrounds, enhance our insight in the mechanism of domestication and selection. The global diversity statistics presented here highlight concerns for maintaining agrodiversity, but also provide a necessary framework for directing genetic conservation.
Domestication of pigs from wild boars occurred independently in Asia and Europe about 10,000 years ago . Due to the biogeographic difference between wild ancestral populations, which results from 1.2 million years of separation, Asian and European pigs are genetically highly divergent [2–5]. Sus scrofa is native to Eurasia and North Africa, but was introduced into other parts of the world, i.e. into the Americas, primarily in its domesticated form, during the time of the European colonization in the sixteenth century, and later in Australia and New Zealand . Both demographic processes, and natural as well as artificial selection, have led to the formation of a multitude of pig breeds around the world that vary in coat color, ear shape, body size, snout bluntness, behavior, growth rate, fatness, and prolificacy and other economically important traits.
In addition to domestication, crossbreeding between Asian and European indigenous pigs mediated by humans are significant landmarks in pig breeding history. Although anecdotal evidence exists even from the classical era, admixture between Western and Eastern pigs only started to become common in the mid- to late eighteenth century . Introduction of Chinese pigs into Britain is documented from then and its aim was to improve the production characteristics of local pigs, which led to the creation of modern breeds such as Yorkshire (i.e., Large White), Berkshire and Hampshire . In the late eighteenth century, Chinese pigs may also have been imported to America, and crossed with local pigs of European ancestry there , although most likely the Asian influence in American village pigs was through crosses with international breeds . Reciprocally, at least since the 1840s, modern breeds such as Berkshire, Hampshire, Russian local pigs, Duroc, Large White and Landrace were introduced into China . Such domestic animals were traded, loaded onto ships and released elsewhere. This is well documented, e.g., during the exploration of the Pacific by Captain Cook, who is credited for having released the first pigs on the New Zealand islands . As in Europe, these imported pigs were used for crossbreeding with local breeds. However, in China, the introduction of pigs from outside and trading of pigs within China, appear to have been less widespread until recently, as is apparent from the high degree of geographic structure that remains in the Chinese traditional pig breeds . Nevertheless, historical records and genetic evidence point to the contribution of European pigs to some East Asian breeds. For instance, the modern Korean Native pig is a cross between a local, traditional Korean pig and Berkshire. More recently, since the 1980s, Chinese pig breeders began programs to improve local breeds using Western stock  by creating synthetic breeds. In Africa, although advocated as an additional center of domestication, most of the evidence points to introgression from foreign breeds. Interestingly, Asian haplotypes predominate in East Africa, whereas European haplotypes predominate in West Africa .
Today, pig is a major livestock species, which in 2012 represented about 36.3% of the total meat production for human societies (www.fao.org), with major contributions from only a few international commercial breeds (i.e. Duroc, Large White, and Landrace). Nevertheless, hundreds of domesticated pig breeds worldwide  are still important for local meat production by small farmers. Many of these pig breeds have unique characteristics that differ from those of the international commercial breeds. Conservation of agrodiversity is one of the pillars to maintain food security, particularly in a rapidly changing world where consolidation of international plant and animal breeds is resulting in an increasingly narrow genetic basis for food production . Thus, indigenous pig breeds, together with their wild relatives, are valued resources for the human society, not only for food, but also as genetic reservoirs. In addition, they constitute cultural and historical value since certain breeds are highly connected to local identity and specific agricultural practices. Finally, breed diversity can be leveraged for understanding the genetic basis of complex traits and adaptive evolution . Large-scale genotyping technologies have enabled the analysis of the genetic ancestry and admixture of many domestic animals, including dogs , cattle , and pigs  and have also enabled the characterization of the genetic basis of phenotypic changes during domestication in chicken , dogs , rabbits  and pigs .
To date, population genetic studies using genomic data in pigs had a limited, usually regional, scope [9, 23–25]. Compared to previous generations of molecular markers, particularly microsatellites, single nucleotide polymorphism (SNP) markers allow for relatively straightforward data integration across studies since SNP genotypes can be compared unambiguously across studies. The aim of the current study was to perform a truly global integration of pig genotype data through the analysis of 1839 domestic pigs from 122 indigenous pig breeds that were collected in 29 different countries, together with 215 wild boars and 39 out-group individuals. As a result, our findings constitute a big leap in understanding the population structure, admixture, demographic history, and characterization of genetic loci involved in the domestication of pigs globally.
Samples and data
Number of populations and samples by continent
We conducted a series of quality control procedures on the raw data using PLINK v1.9 . First, we excluded the breeds with less than five individuals. For the breeds or populations with more than 20 individuals, we randomly removed one individual from a pair of highly related animals (identity by state score > 0.95), and then kept the top 20 samples ranked by the SNP call rate. Next, we removed SNPs with a minor allele frequency (MAF) lower than 0.01, a call rate lower than 90% and individuals with a call rate lower than 90%, which resulted in a dataset of 55,072 SNPs for 2093 individuals that was used to estimate ROH, haplotype diversity and effective population size. We further removed SNPs with a MAF lower than 0.05, and in high linkage disequilibrium (r2 > 0.2) by using—maf 0.05 and—indep-pairwise 50 10 0.2 in PLINK v1.9 , respectively, and generated a dataset of 15,427 SNPs for 2093 individuals for subsequent multi-dimensional scaling (MDS), neighbor joining (NJ) tree and admixture analyses. See Additional file 2: Table S2 for further details.
MDS was carried out using—mds-plot and—cluster options in PLINK v1.9  and visualized by R programming language . The NJ tree was constructed using PHYLIP v3.69  based on the identical by states matrix obtained by PLINK v1.9 , and visualized using FigTree v1.4 (http://beast.bio.ed.ac.uk/figtree). To facilitate visualization, we randomly selected six individuals from each population to build up the NJ tree. The geographical maps were plotted using R package MAPS  and MAPPLOTS (https://cran.r-project.org/web/packages/mapplots/). The coordinates of longitude and latitude of each population were set according to where the pigs were sampled (see Additional file 1: Table S1). The geographical distances between each pair of breeds were computed using distm function in R package GEOSPHERE (https://cran.r-project.org/web/packages/geosphere/). The proportion of mixed ancestry in the populations analyzed was evaluated by the ADMIXTURE 1.22 program . We evaluated different K values with the mixed ancestry model (K = 2 to 17).
Runs of homozygosity, haplotype diversity and effective population size
Runs of homozygosity (ROH) of each breed were identified using PLINK v.1.07 by a 5-Mb sliding window process across the genome with at least 50 SNPs, allowing five missing calls and one heterozygous SNP. The minimum length for ROH was set to 500 kb. ROH statistics were then transformed to FROH. We inferred haplotypes of autosomes for all individuals using SHAPEIT v2 . Haplotype diversity was calculated for populations with a minimum of 10 individuals. For each population, we randomly selected 10 individuals for the analysis. The haplotype diversity of a population was measured as the average number of haplotypes in windows of 5, 10, and 15 SNPs, respectively, which is similar to the method described in . LD between adjacent SNPs was measured by the genotype correlation coefficient (r2) calculated by the—r2—ld-window 99999—ld-window-r2 0 command in PLINK v1.9. We used the same equation to fit the relationship between LD and genetic distance as previously described . The SNPs used to calculate the LD within a population were filtered by applying the following criteria: a MAF higher than 0.05 and a P value for Hardy–Weinberg equilibrium higher than 1 × 10−6. The effective population size was estimated according to Sved , based on the equation r2 = 1/(4Ne c + 1), where r2 is the linkage disequilibrium between a pair of SNPs, Ne is the effective population size, c is the genetic distance in Morgan between a pair of SNPs, which was obtained by multiplying their physical distance and recombination rate . The Ne at generation T, were obtained by the equation T = 1/2c, the same as described in .
To detect loci that may have been selected during domestication, we calculated the fixation index (Fst)  between domestic and wild boars in Asia and Europe, separately. To avoid the influence of introgression, we used 42,808 SNPs (MAF > 0.05) in 782 individuals that have more than 90% of Asian or European ancestry in the analysis. We ranked the Fst values of genome-wide SNPs, and genes within a 100-kb region of high-Fst SNPs were identified as candidate genes that may have been involved in past selection. We selected candidate genes according to their functional relevance to phenotypes, such as e.g. behavior, development, energy metabolism, which may confer differences between domestic pigs and wild boars.
After quality control, a dataset of 2093 samples representing 122 domestic pig breeds (1839 individuals), 19 wild boar populations (215 individuals) and five out-group populations (39 individuals) was available. The 122 domestic breeds included 104 local breeds or synthetic populations and 18 international commercial populations (Duroc, Large White, Landrace and Pietrain from different countries were considered as different breeds) (see Additional file 1: Table S1). Among the 104 local breeds, 39 originated from Europe, 40 from Asia, 22 from the Americas and three from other parts of the world. The wild boars were from widespread regions around the world and the five out-group populations are Babyrousa babyrussa, Sus barbatus, Sus celebensis and Sus verrucosus from the islands of Southeast Asia and Phacochoerus africanus from Africa. A more detailed description of many of these samples was reported in previous studies [9, 24, 25].
Global population structure
The second axis separates domestic pigs from wild boars in both Asia and Europe (Fig. 1) and (see Additional file 3: Figure S1), which indicates that domestication and subsequent artificial selection also resulted in the differentiation between wild and domesticated populations. Regarding the international commercial pig breeds, the Duroc breed tends to cluster with American domestic breeds and to separate from other commercial breeds (Landrace, Large White and Pietrain) that tend to cluster together. These results agree with the fact that Duroc pigs were originally developed in the USA, whereas, the other international commercial breeds (e.g. Large White-England, Landrace-Denmark, and Pietrain-Belgium) originated in Europe.
The results from hierarchical clustering (neighbor joining based on identity by state distance metric) generally agree with those from the MDS analysis. An important observation is that, even on a global scale, all populations of the major commercial breeds, still cluster together, i.e. breed identity has been maintained for both commercial and non-commercial populations (see Additional file 5: Figure S3).
Regional population structures in Asia and Europe
In Europe, the pig breeds from Southern Europe (Italy, Spain and Hungary) and those from North and middle Europe (Netherlands, Sweden, Poland, Germany and the Czech Republic) were genetically distinct. This distinction is represented by the first axis (Fig. 2d, e). Within the UK, British Lop, Leicoma, Middle White and Welsh breeds differ from Large Black, Gloucester Old Spot, Saddleback and Tamworth breeds. However, we observed no correlation with geographical distances among pig breeds in Europe (Fig. 2f), which is consistent with previous results based on microsatellites . The absence of population structure in European pigs is explained, at least in part, by the Asian introgression and subsequent influence of highly productive “international” breeds on local pig diversity. In Europe, many ‘local’ or ‘traditional’ breeds have effectively become (partially) extinct due to such extensive crossbreeding [36, 39, 40].
Global genetic ancestries
Admixture between Asian and European ancestries
An additional interesting pattern is the widespread admixed ancestries that we observed in pig populations across the different continents. Note that many breeds showed evidence of both Asian and European ancestries. Since these two lineages evolved independently, this admixture is a result of human-mediated activities. In Europe, importation of Chinese pigs is well accredited since the eighteenth century.
In Asia, 23 (57.5%) of the 40 breeds analyzed have 5% or more inferred European ancestry (see Additional file 4: Figure S2 and Additional file 6: Figure S4), (K = 2). Most of the European introgression was inferred to be from Duroc, Landrace, Large White, Berkshire and Hampshire breeds (see Additional file 4: Figure S2 and Additional file 6: Figure S4), (K = 13). Eight Asian breeds have more than 20% of genomic introgression from European pigs. These breeds include a Korean local breed (KPKO), a Thailand local breed (THCD), and six breeds from China (Lichahei (CNLC) from Shandong Province, Sutai (CNST) from Jiangsu Province, Kele (CNKL) and Guanling (CNGU) pigs from Guizhou Province, Leanhua (CNLAH) from Jiangxi Province, Minzhu (CNMZ) from Northeast China. Among these breeds with a large degree of European introgression (>20%), the Korean pig, a known East–West synthetic breed, formed the largest European ancestry group (see Additional file 4: Figure S2 and Additional file 6: Figure S4), (K = 2 and K = 13), including ancestry from Berkshire, Hampshire, Landrace and Duroc, which reflects the complex breeding history of this breed (Fig. 3a) and (see Additional file 4: Figure S2) (K = 13 and K = 17). This is largely in line with the known origin of the Korean local pigs. Sutai and Lichahei have been mainly admixed with Duroc, while Min pigs have a considerable contribution from Berkshire (Fig. 3a) and (see Additional file 4: Figure S2 and Additional file 6: Figure S4) (K = 8 and K = 17). It is interesting that the admixture with European pigs occurred mainly in Western and Northern Chinese pig breeds (Gongbujiangda (CNXZ) and Milin Tibetan (CNML) pigs, Kele and Guanling pigs from Guizhou Province, Mingguangxiaoer (CNMG) pigs from Yunnan province, Bamei (CNBM) pigs from Gansu province, Laiwuhei (CNLH) pigs from Shandong Province, Hetaodaer (CNHT) from Inner Mongolia and Min (CNMZ) pigs from Heilongjiang Province); the European ancestries that are involved encompass Large White, Landrace, Berkshire or other European breeds (Fig. 3a–c) and (see Additional file 4: Figure S2 and Additional file 6: Figure S4) (K = 13 and K = 17). In comparison, the pig breeds from South and Central China, including Erhualian, Xiang, Dongshan, Shaziling, Congjiangxiang, Lantang, Jinhua, Litang Tibetan and Luchuan, show no or negligible introgression from European pigs (Fig. 3a, c).
Iberian pigs from Spain, Cinta Senese and Nera Siciliana pigs from Italy, and Mangalica pigs from Hungary showed little evidence of influence from Asian pigs (see Additional file 4: Figure S2 and Additional file 7: Figure S5). By contrast, there is evidence of introgression from Asian pigs for all other pig breeds from Europe, including those from Ukraine and Russia (see Additional file 4: Figure S2 and Additional file 7: Figure S5). These results confirm the widespread Asian influences in European breeds.
The North and South American samples consisted mainly of village and feral pigs from eight countries . Consistent with previous studies on pigs from the Americas using 60K SNP , and mitochondrial DNA data , pig populations from rural areas have mosaic genetic compositions that consist of multiple ancestries from both Europe and Asia. The largest ancestry components were similar to Iberian pigs (ESIB), in agreement with a primigenious origin from the Iberian Peninsula. Other European components are related to Duroc, Landrace, Berkshire, Hampshire, and European Wild boars (Fig. 3a) and (see Additional file 4: Figure S2 and Additional file 8: Figure S6). Intriguingly, a considerable contribution from pigs from both east and south China was observed in most of the American Village pigs (Additional file 8: Figure S6). In general, the village pigs from Brazil, Mexico and Cuba have larger Asian components than the pigs from other American countries (Fig. 3a).
In Africa, the Tunisian wild boar shows a high degree of similarity to the European wild boar, and specifically to the wild boar from the Iberian Peninsula. A local breed from Kenya was inferred to contain both Asian and European ancestries (Fig. 3a) and (see Additional file 4: Figure S2 and Additional file 8: Figure S6). This is in agreement with mtDNA studies, which showed that Asian haplotypes were abundant in East Africa but completely absent in the Northern African pigs (i.e. Tunisian wild boars) .
The four major international commercial pig breeds, i.e. Duroc, Landrace, Large White, and Pietrain, have a considerable percentage of Asian ancestry (see Additional file 4: Figure S2 and Additional file 9: Figure S7) (K = 2). At K = 8, these four breeds form four distinct ancestries. The Landrace and Large White breeds also showed a diversity of genetic ancestries related to various pig populations including Berkshire, Hampshire, South European local pigs or Asian pigs.
Loci involved in domestication
Candidate genes for domestication loci in Asia and Europe
Muscle growth/differentiation 
Growth, energy homeostasis 
Personality traits angry/hostility 
Development of brain 
Insulin signaling pathways 
Food intake and energy balance 
Pubertal delay 
Precocious puberty 
Perception of smell 
Central nervous system 
Central nervous system 
Cardiac LDL-cholesterol 
Protein catabolic process 
Glucose homeostasis 
Vertebrate development 
Autism spectrum disorder 
Immune response 
Cholesterol synthesis 
Pig is one of the most important livestock species for humans as a valued, global, resource for meat production and as an excellent animal model to understand the genetic mechanisms that underlie complex traits [6, 18]. Its long domestication history, originating from a large diversity of wild ancestors throughout Eurasia, and selection for economic and cultural purposes have resulted in a large number of breeds globally, which show a wide phenotypic diversity. Our worldwide survey on SNP data from 122 breeds/populations and 215 wild boars worldwide, reveals genetic ancestries, introgression and inbreeding histories of pigs at a global scale and at an unprecedented detail. Although there are potential issues regarding ascertainment bias  associated with the SNP assay used in this study, admixture analyses using 60K SNP and 30 million SNPs called from whole-genome sequence data provided very similar results (see Additional file 14: Figure S12), which indicates that robust conclusions can be drawn from the 60K SNP assay data for a wide population study as presented here.
The high degree of geographic structure observed here in the Asian domesticated pigs agrees with a previous report based on microsatellite markers , and differs substantially from that observed in pig populations from Europe and the Americas, in which almost no correlation between genetic and geographical distances exist . The strong concordances between genetic and geographical distances for the pig populations in Asia may be attributed to the fact that pig populations within certain eco-geographical regions are more likely to have common ancestries, and that most of the breeds in Asia did not migrate over large distances. Furthermore, introgressions from European populations did not mask the identity of most Asian breeds, at least not to a large extent. Removing the breeds with more than 20% European introgression resulted in an increased correlation between genetic and geographical distances (see Additional file: 15 Figure S13). This indicates that admixture with geographically distant populations could be a major force in breaking regional genetic-geography concordance, as has been the case in Europe. Recent breed interchanges have largely masked an underlying geographic signal. However, it is interesting to note that some breeds have remained relatively unchanged for centuries. For instance, Ramirez et al.  showed that the modern Iberian breed is genetically very similar to a sixteenth century Spanish pig.
Contribution of Chinese populations to worldwide pigs
The Chinese ancestries in European pigs observed in this study confirmed, on a broader population scale, the findings of previous genetic studies [2, 23, 41, 63]. These results are consistent with the historical record that South Chinese pigs were brought to England from Guangzhou in South China, the only treaty port city in China at that time, and have contributed to local British breeds, such as Berkshire and Yorkshire around 200 years ago [7, 8, 63]. Interestingly, our analyses revealed that ancestry represented by Lantang pigs from Guangdong Province is likely the major source of introgression in American pigs (Fig. 3a). The Meishan pigs of Eastern China, a breed famous for its high prolificacy, were imported to Western countries including France, England and USA in the 1980s . Meishan pigs were used in experimental crosses to study the genetic basis of complex traits . Recent studies showed that many Asian alleles with favorable phenotypic effects reached a high frequency in European pigs. These included MC1R alleles that are associated with black coat color , an IGF2 allele for muscle growth , and AHR alleles for sow reproduction traits . These studies underscored the importance of Asian pigs as vital genetic resources for international pig breeding and pork production.
Contribution of European ancestry to worldwide pig populations
Both MDS and admixture analysis showed that European pigs were the major contributors to pig populations in those regions of the world where Sus scrofa does not occur natively (America, Africa, and Oceania). These results are consistent with mitochondrial and Y chromosome polymorphisms . This contribution is due, in part, to the waves of colonization by Europeans since the sixteenth century. In addition, recent increase in worldwide trading of commercial, improved, pigs throughout the globe, and the desire of local farmers to improve their pigs using these western breeds, have likely contributed to this process as well. Populations in the Americas, Africa, and Oceania tend to harbor multiple ancestries of Mediterranean countries and/or international commercial breeds such as Berkshire, Hampshire, and Duroc (Fig. 3), which indicates a very dynamic process of global mixing of populations during several centuries.
More recently, the global process of mixing has become ‘full circle’ by the introduction of European pigs, themselves heavily influenced by Asian pigs, in Asia. In fact, one of the main original findings of our study is the widespread European influence in many Asian populations, the extent of which was mostly unknown until now. In Asia, we observed widespread and complex gene flow from European pigs, which indicates that many Asian indigenous pig breeds are no longer strictly Asian, but also contain a genetic component of European origin. Occurrence of European introgression in Japan , Korea  and Vietnam  was reported before. In China, there are over 80 pig breeds and a high diversity in phenotypes . Historical documents indicate at least three waves of introgression from European pigs since the 1840s. The first wave of introgression may have occurred around the 1840s, when European pig breeds including Berkshire, Large White, Duroc, pigs from Russia, and Tamworth were brought to China by Germans and Japanese . Subsequently, starting from the early twentieth century, probably since the 1910s, large-scale importation of Western European pigs, such as Berkshire in the Hebei, Sichuan, and Jiangsu provinces, and of Russian pigs in Northwest China, took place to improve local breeds. This study demonstrates that many pig breeds from West and North China contain ancestries from European pigs, notably Berkshire, Hampshire, Large White and Russian pigs, which is in agreement with historical records. Since 1937, war and civic and economic upheavals hampered systematic breeding, which may explain why most of the pig populations in China maintained their geographic identity in spite of admixture with European pigs. Since the 1980s, due to changes in Chinese policies regarding the introduction of foreign agricultural germplasms, many international commercial breeds were introduced into China, which gave rise to several synthetic breeds, such as the Lichahei from Shandong Province, and Sutai pigs from Jiangsu Province. Both breeds currently display considerable ancestry from Duroc.
Indications for conservation of indigenous pigs
Inbreeding and decrease in effective population size may reduce the fitness of a population in response to challenges from changing environments or infectious diseases. This study provides an overview on the inbreeding, demography and admixture history of pig populations worldwide. First, our analysis revealed that 40 breeds or populations have substantial cumulative ROH (>200 Mb), and also exhibit low haplotype diversity, which indicate that these populations underwent recent inbreeding. This may reflect the fact that all domesticated and many wild populations are de facto under population management, deliberate or not. Second, we show that many of the indigenous pig breeds have smaller Ne than those of international commercial breeds. Since the commercial breeds are also the breeds that are the most admixed, this is not surprising. Finally, we found prevalent admixture of Asian and European ancestries in the indigenous pig populations, which suggests that many breeds have become less representative of the original local ancestries. The admixture of populations, particularly between East and West, has resulted in a re-shaping of the nucleotide diversity in the genomes of modern pig breeds. Because of that, only some of the current least admixed breeds may represent the original nucleotide and haplotype diversity in Europe. These results could help to make decisions on the conservation and management of pig populations. For example, Mangalica pigs from Hungary (HUMA) and Mora Romagnola pigs from Italy (ITMR) present the most extensive ROH in their genomes and the highest European ancestry, which indicate that these two breeds have undergone intensive inbreeding, and require special attention regarding conservation measures.
Genetic basis of domestication
Domestication of plants and animals has been one of the major transitions in human history. Farming practices have not only altered the human societies but the interactions with nature, especially for domesticated plants and animals. Domestication of pigs has led to dramatic phenotypic changes transforming the wild boar into pigs by altering their behavior, morphology, coat color, reproduction and physiology. The admixture and MDS analyses presented in this study confirm the close relationship between wild and domesticated Sus scrofa in the geographic areas where domestication took place. Therefore, the genomic regions that show a much higher than average differentiation between wild and domesticated pigs should be enriched for loci under selection during domestication. We identified a number of genes that are located near loci with extreme Fst values and that have functions that match the phenotypic changes from wild boars to domestic pigs. For instance, domestic pigs receive a stable feed supply from humans, while wild boars need to endure starvation if they cannot find food in the wild. We identified a number of genes with functions related to energy balance and metabolism (NUM, LEP FOXA1 and INSIG2), which could have contributed to the adaptation of pigs to food scarcity or abundance. Genes involved in growth (MSTN and SOX2-OT) and reproduction (GNRHR, PATZ1, ESR1, and PLSCR4) could be associated with improved meat production and reproduction traits in domestic pigs that have undergone strong artificial selection. Lastly, genes related to nervous system and behavior (TBX19, LRRC4, VEPH1 and CDH9) could be associated with changes in the behavior of domestic pigs compared to wild boars. The absence of signatures of selection found in previous studies [22–24] can be attributed to the higher density of SNPs, the specific selection-detection method, or the application of specific population contrasts in those studies. While further studies are needed to validate the role of the genes that we identified here in the domestication process, our findings confirm that this long-standing genetic experiment—i.e. domestication—is continuing to yield insights into biology and evolution.
We present the largest population study on pigs and their wild ancestors to date, which investigates the population structure and introgression of worldwide pig populations globally. We demonstrate regional and global mixing of pig diversity, which reflect that this species has essentially followed many of the globalization events over the past centuries. Population diversity statistics such as ROH provided insight on inbreeding history and effective population sizes that allow us to recommend guidelines for breeding and conservation programs. Similar to other domesticated species, pigs represent an excellent model to study adaptation. We have identified a number of candidate genes that could have been under positive selection during domestication.
LH, MAMG, HJM and BY conceived and coordinated the study. HJM, LH, MAMG, MPE, ZN, RC, LI, MS, SG, PU, PD DD, AD, OH, GS, CK, GL, AA, LBS, ATri, PA and ATra provided the samples and generated the data. LC, BY and HJM analyzed the data. BY, HJM, LH, MPE and MAMG interpreted the results. BY and LC wrote the manuscript. LH, MAMG, MPE and HJM revised the manuscript. All authors read and approved the final manuscript.
We thank Marco Apollonio from Sassari University for contributing to the sampling of Italian wild boars, and Greger Larson from Oxford University for valuable comments on the manuscript.
The authors declare that they have no competing interests.
Availability of data and materials
All authors declare that animal samples were obtained according to local/national laws at the time of sampling. Exchange of samples and data was in accordance with national and international regulations, and approved by data and sample owners.
This study is supported by National Production Technology System for the Pig Industry in China (nycytx-008) and Outstanding Talents and Innovation Team of Agricultural Science (2011-81) to LH, AGL2010-14822 and AGL2013-41834-R (Ministry of Economy and Science, Spain) to MPE. LI received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No. 656697. HJM received funding from the IMAGE project (Horizon 2020, No. 677353).
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Larson G, Dobney K, Albarella U, Fang M, Matisoo-Smith E, Robins J, et al. Worldwide phylogeography of wild boar reveals multiple centers of pig domestication. Science. 2005;307:1618–21.View ArticlePubMedGoogle Scholar
- Giuffra E, Kijas JM, Amarger V, Carlborg O, Jeon JT, Andersson L. The origin of the domestic pig: independent domestication and subsequent introgression. Genetics. 2000;154:1785–91.PubMedPubMed CentralGoogle Scholar
- Megens HJ, Crooijmans RP, San Cristobal M, Hui X, Li N, Groenen MA. Biodiversity of pig breeds from China and Europe estimated from pooled DNA samples: differences in microsatellite variation between two areas of domestication. Genet Sel Evol. 2008;40:103–28.PubMedPubMed CentralGoogle Scholar
- Ai H, Fang X, Yang B, Huang Z, Chen H, Mao L, et al. Adaptation and possible ancient interspecies introgression in pigs identified by whole-genome sequencing. Nat Genet. 2015;47:217–25.View ArticlePubMedGoogle Scholar
- Frantz LA, Schraiber JG, Madsen O, Megens HJ, Bosse M, Paudel Y, et al. Genome sequencing reveals fine scale diversification and reticulation history during speciation in Sus. Genome Biol. 2013;14:R107.View ArticlePubMedPubMed CentralGoogle Scholar
- Rothschild MF, Ruvinsky A. The genetics of the pig. 2nd ed. Wallingford: CAB International; 2001.Google Scholar
- Darwin C. The variation of animals and plants under domestication. London: John Murray; 1875.Google Scholar
- White S. From globalized pig breeds to capitalist pigs: a study in animal cultures and evolutionary history. Environ Hist Durh NC. 2011;16:94–120.View ArticleGoogle Scholar
- Burgos-Paz W, Souza CA, Megens HJ, Ramayo-Caldas Y, Melo M, Lemus-Flores C, et al. Porcine colonization of the Americas: a 60k SNP story. Heredity (Edinb). 2013;110:321–30.View ArticleGoogle Scholar
- Xu W. Introduction and domestication of European breeds of pig in modern China. Anc Mod Agric. 2004;1:54–62.Google Scholar
- Gascoigne J. Captain cook: voyager between worlds. London: Hambledon Continuum; 2007.Google Scholar
- Noce A, Amills M, Manunza A, Muwanika V, Muhangi D, Aliro T, et al. East African pigs have a complex Indian, Far Eastern and Western ancestry. Anim Genet. 2015;46:433–6.View ArticlePubMedGoogle Scholar
- Chen K, Baxter T, Muir WM, Groenen MA, Schook LB. Genetic resources, genome mapping and evolutionary genomics of the pig (Sus scrofa). Int J Biol Sci. 2007;3:153–65.View ArticlePubMedPubMed CentralGoogle Scholar
- Godfray HC, Beddington JR, Crute IR, Haddad L, Lawrence D, Muir JF, et al. Food security: the challenge of feeding 9 billion people. Science. 2010;327:812–8.View ArticlePubMedGoogle Scholar
- Bosse M, Megens HJ, Madsen O, Paudel Y, Frantz LA, Schook LB, et al. Regions of homozygosity in the porcine genome: consequence of demography and the recombination landscape. PLoS Genet. 2012;8:e1003100.View ArticlePubMedPubMed CentralGoogle Scholar
- Vonholdt BM, Pollinger JP, Lohmueller KE, Han E, Parker HG, Quignon P, et al. Genome-wide SNP and haplotype analyses reveal a rich history underlying dog domestication. Nature. 2010;464:898–902.View ArticlePubMedPubMed CentralGoogle Scholar
- Decker JE, McKay SD, Rolf MM, Kim J, Molina Alcala A, Sonstegard TS, et al. Worldwide patterns of ancestry, divergence, and admixture in domesticated cattle. PLoS Genet. 2014;10:e1004254.View ArticlePubMedPubMed CentralGoogle Scholar
- Groenen MA, Archibald AL, Uenishi H, Tuggle CK, Takeuchi Y, Rothschild MF, et al. Analyses of pig genomes provide insight into porcine demography and evolution. Nature. 2012;491:393–8.View ArticlePubMedPubMed CentralGoogle Scholar
- Rubin CJ, Zody MC, Eriksson J, Meadows JR, Sherwood E, Webster MT, et al. Whole-genome resequencing reveals loci under selection during chicken domestication. Nature. 2010;464:587–91.View ArticlePubMedGoogle Scholar
- Axelsson E, Ratnakumar A, Arendt ML, Maqbool K, Webster MT, Perloski M, et al. The genomic signature of dog domestication reveals adaptation to a starch-rich diet. Nature. 2013;495:360–4.View ArticlePubMedGoogle Scholar
- Carneiro M, Rubin CJ, Di Palma F, Albert FW, Alfoldi J, Barrio AM, et al. Rabbit genome analysis reveals a polygenic basis for phenotypic change during domestication. Science. 2014;345:1074–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Rubin CJ, Megens HJ, Martinez Barrio A, Maqbool K, Sayyab S, Schwochow D, et al. Strong signatures of selection in the domestic pig genome. Proc Natl Acad Sci USA. 2012;109:19529–36.View ArticlePubMedPubMed CentralGoogle Scholar
- Bosse M, Megens HJ, Frantz LA, Madsen O, Larson G, Paudel Y, et al. Genomic analysis reveals selection for Asian genes in European pigs following human-mediated introgression. Nat Commun. 2014;5:4392.View ArticlePubMedPubMed CentralGoogle Scholar
- Wilkinson S, Lu ZH, Megens HJ, Archibald AL, Haley C, Jackson IJ, et al. Signatures of diversifying selection in European pig breeds. PLoS Genet. 2013;9:e1003453.View ArticlePubMedPubMed CentralGoogle Scholar
- Ai H, Yang B, Li J, Xie X, Chen H, Ren J. Population history and genomic signatures for high-altitude adaptation in Tibetan pigs. BMC Genomics. 2014;15:834.View ArticlePubMedPubMed CentralGoogle Scholar
- Ramos AM, Crooijmans RP, Affara NA, Amaral AJ, Archibald AL, et al. Design of a high density SNP genotyping assay in the pig using SNPs identified and characterized by next generation sequencing technology. PLoS One. 2009;4:e6524.View ArticlePubMedPubMed CentralGoogle Scholar
- Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–75.View ArticlePubMedPubMed CentralGoogle Scholar
- R Core Team. A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2013.Google Scholar
- Felsenstein J. PHYLIP—phylogeny inference package. Cladistics. 1989;5:164–6.Google Scholar
- Becker RA, Wilkes AR. Maps in S. AT\&T Bell Laboratories Statistics Research Report. 1993;96:93.2.Google Scholar
- Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19:1655–64.View ArticlePubMedPubMed CentralGoogle Scholar
- Delaneau O, Marchini J, Zagury JF. A linear complexity phasing method for thousands of genomes. Nat Methods. 2012;9:179–81.View ArticleGoogle Scholar
- Amaral AJ, Megens HJ, Crooijmans RP, Heuven HC, Groenen MA. Linkage disequilibrium decay and haplotype block structure in the pig. Genetics. 2008;179:569–79.View ArticlePubMedPubMed CentralGoogle Scholar
- Sved JA. Linkage disequilibrium and homozygosity of chromosome segments in finite populations. Theor Popul Biol. 1971;2:125–41.View ArticlePubMedGoogle Scholar
- Tortereau F, Servin B, Frantz L, Megens HJ, Milan D, Rohrer G, et al. A high density recombination map of the pig reveals a correlation between sex-specific recombination and GC content. BMC Genomics. 2012;13:586.View ArticlePubMedPubMed CentralGoogle Scholar
- Herrero-Medrano JM, Megens HJ, Groenen MA, Ramis G, Bosse M, Perez-Enciso M, et al. Conservation genomic analysis of domestic and wild pig populations from the Iberian Peninsula. BMC Genet. 2013;14:106.View ArticlePubMedPubMed CentralGoogle Scholar
- Cockerham BS, Wa CC. Estimating F-statistics for the analysis of population structure. Evolution. 1984;38:14.Google Scholar
- Zhang Z. Pig breeds in China. Shanghai: Shanghai Scientific and Technical Publishers; 1986.Google Scholar
- Herrero-Medrano JM, Megens HJ, Crooijmans RP, Abellaneda JM, Ramis G. Farm-by-farm analysis of microsatellite, mtDNA and SNP genotype data reveals inbreeding and crossbreeding as threats to the survival of a native Spanish pig breed. Anim Genet. 2013;44:259–66.View ArticlePubMedGoogle Scholar
- Herrero-Medrano JM, Megens HJ, Groenen MA, Bosse M, Perez-Enciso M, Crooijmans RP. Whole-genome sequence analysis reveals differences in population management and selection of European low-input pig breeds. BMC Genomics. 2014;15:601.View ArticlePubMedPubMed CentralGoogle Scholar
- Ramirez O, Ojeda A, Tomas A, Gallardo D, Huang LS, Folch JM, et al. Integrating Y-chromosome, mitochondrial, and autosomal data to analyze the origin of pig breeds. Mol Biol Evol. 2009;26:2061–72.View ArticlePubMedGoogle Scholar
- Wagner KR, McPherron AC, Winik N, Lee SJ. Loss of myostatin attenuates severity of muscular dystrophy in mdx mice. Ann Neurol. 2002;52:832–6.View ArticlePubMedGoogle Scholar
- Graham ES, Turnbull Y, Fotheringham P, Nilaweera K, Mercer JG, Morgan PJ, et al. Neuromedin U and Neuromedin U receptor-2 expression in the mouse and rat hypothalamus: effects of nutritional status. J Neurochem. 2003;87:1165–73.View ArticlePubMedGoogle Scholar
- Mankowska M, Szydlowski M, Salamon S, Bartz M, Switonski M. Novel polymorphisms in porcine 3′UTR of the leptin gene, including a rare variant within target sequence for MIR-9 gene in Duroc breed, not associated with production traits. Anim Biotechnol. 2015;26:156–63.View ArticlePubMedGoogle Scholar
- Waraich RS, Weigert C, Kalbacher H, Hennige AM, Lutz SZ, Häring HU, et al. Phosphorylation of Ser357 of rat insulin receptor substrate-1 mediates adverse effects of protein kinase C-delta on insulin action in skeletal muscle cells. J Biol Chem. 2008;283:11226–33.View ArticlePubMedGoogle Scholar
- Wasserman D, Geijer T, Sokolowski M, Rozanov V, Wasserman J. Genetic variation in the hypothalamic-pituitary-adrenocortical axis regulatory factor, T-box 19, and the angry/hostility personality trait. Genes Brain Behav. 2007;6:321–8.View ArticlePubMedGoogle Scholar
- Adachi H, Tsujimoto M, Hattori M, Arai H, Inoue K. cDNA cloning of human cytosolic platelet-activating factor acetylhydrolase gamma-subunit and its mRNA expression in human tissues. Biochem Biophys Res Commun. 1995;214:180–7.View ArticlePubMedGoogle Scholar
- Beneduzzi D, Trarbach EB, Min L, Jorge AA, Garmes HM, Renk AC, et al. Role of gonadotropin-releasing hormone receptor mutations in patients with a wide spectrum of pubertal delay. Fertil Steril. 2014;102:838–46.View ArticlePubMedPubMed CentralGoogle Scholar
- Luo Y, Liu Q, Lei X, Wen Y, Yang YL, Zhang R, et al. Association of estrogen receptor gene polymorphisms with human precocious puberty: a systematic review and meta-analysis. Gynecol Endocrinol. 2015;31:516–21.View ArticlePubMedGoogle Scholar
- Yang WL, Ravatn R, Kudoh K, Alabanza L, Chin KV. Interaction of the regulatory subunit of the cAMP-dependent protein kinase with PATZ1 (ZNF278). Biochem Biophys Res Commun. 2010;391:1318–23.View ArticlePubMedGoogle Scholar
- Young JM, Shykind BM, Lane RP, Tonnes-Priddy L, Ross JA, Walker M, et al. Odorant receptor expressed sequence tags demonstrate olfactory expression of over 400 genes, extensive alternate splicing and unequal expression levels. Genome Biol. 2003;4:R71.View ArticlePubMedPubMed CentralGoogle Scholar
- Amaral PP, Neyt C, Wilkins SJ, Askarian-Amiri ME, Sunkin SM, Perkins AC, et al. Complex architecture and regulated expression of the Sox2ot locus during vertebrate development. RNA. 2009;15:2013–27.View ArticlePubMedPubMed CentralGoogle Scholar
- Shen T, Zhu Y, Patel J, Ruan Y, Chen B, Zhao G, et al. T-box20 suppresses oxidized low-density lipoprotein-induced human vascular endothelial cell injury by upregulation of PPAR-gamma. Cell Physiol Biochem. 2013;32:1137–50.View ArticlePubMedGoogle Scholar
- Wang M, Bridges JP, Na CL, Xu Y, Weaver TE. Meckel-Gruber syndrome protein MKS3 is required for endoplasmic reticulum-associated degradation of surfactant protein C. J Biol Chem. 2009;284:33377–83.View ArticlePubMedPubMed CentralGoogle Scholar
- Kaestner KH, Katz J, Liu Y, Drucker DJ, Schutz G. Inactivation of the winged helix transcription factor HNF3alpha affects glucose homeostasis and islet glucagon gene expression in vivo. Genes Dev. 1999;13:495–504.View ArticlePubMedPubMed CentralGoogle Scholar
- Yabe D, Brown MS, Goldstein JL. Insig-2, a second endoplasmic reticulum protein that binds SCAP and blocks export of sterol regulatory element-binding proteins. Proc Natl Acad Sci USA. 2002;99:12753–8.View ArticlePubMedPubMed CentralGoogle Scholar
- Zhang Q, Wang J, Fan S, Wang L, Cao L, Tang K, et al. Expression and functional characterization of LRRC4, a novel brain-specific member of the LRR superfamily. FEBS Lett. 2005;579:3674–82.View ArticlePubMedGoogle Scholar
- Muto E, Tabata Y, Taneda T, Aoki Y, Muto A, Arai K, et al. Identification and characterization of Veph, a novel gene encoding a PH domain-containing protein expressed in the developing central nervous system of vertebrates. Biochimie. 2004;86:523–31.View ArticlePubMedGoogle Scholar
- Wang K, Zhang H, Ma D, Bucan M, Glessner JT, Abrahams BS, et al. Common genetic variants on 5p14.1 associate with autism spectrum disorders. Nature. 2009;459:528–33.View ArticlePubMedPubMed CentralGoogle Scholar
- Tan J, Pieper K, Piccoli L, Abdi A, Foglierini M, Geiger R, et al. A LAIR1 insertion generates broadly reactive antibodies against malaria variant antigens. Nature. 2016;529:105–9.View ArticlePubMedGoogle Scholar
- Onteru SK, Fan B, Du ZQ, Garrick DJ, Stalder KJ, Rothschild MF. A whole-genome association study for pig reproductive traits. Anim Genet. 2012;43:18–26.View ArticlePubMedGoogle Scholar
- Ramirez O, Burgos-Paz W, Casas E, Ballester M, Bianco E, Olalde I, et al. Genome data from a sixteenth century pig illuminate modern breed relationships. Heredity (Edinb). 2015;114:175–84.View ArticleGoogle Scholar
- Fang M, Andersson L. Mitochondrial diversity in European and Chinese pigs is consistent with population expansions that occurred prior to domestication. Proc Biol Sci. 2006;273:1803–10.View ArticlePubMedPubMed CentralGoogle Scholar
- China National Commission of Animal Genetic Resources. Animal genetic resources in China pigs. Beijing: China Agricultural Press; 2011.Google Scholar
- Rothschild M, Jacobson C, Vaske D, Tuggle C, Wang L, et al. The estrogen receptor locus is associated with a major gene influencing litter size in pigs. Proc Natl Acad Sci USA. 1996;93:201–5.View ArticlePubMedPubMed CentralGoogle Scholar
- Kijas JM, Wales R, Tornsten A, Chardon P, Moller M, Andersson L. Melanocortin receptor 1 (MC1R) mutations and coat color in pigs. Genetics. 1998;150:1177–85.PubMedPubMed CentralGoogle Scholar
- Ojeda A, Huang LS, Ren J, Angiolillo A, Cho IC, Soto H, et al. Selection in the making: a worldwide survey of haplotypic diversity around a causative mutation in porcine IGF2. Genetics. 2008;178:1639–52.View ArticlePubMedPubMed CentralGoogle Scholar
- Murakami K, Yoshikawa S, Konishi S, Ueno Y, Watanabe S, Mizoguchi Y. Evaluation of genetic introgression from domesticated pigs into the Ryukyu wild boar population on Iriomote Island in Japan. Anim Genet. 2014;45:517–23.View ArticlePubMedGoogle Scholar
- Edea Z, Kim SW, Lee KT, Kim TH, Kim KS. Genetic Structure of and evidence for admixture between Western and Korean native pig breeds revealed by single nucleotide polymorphisms. Asian-Aust J Anim Sci. 2014;27:1263–9.View ArticleGoogle Scholar
- Pham LD, Do DN, Nam LQ, Van Ba N, Minh LT, Hoan TX, et al. Molecular genetic diversity and genetic structure of Vietnamese indigenous pig populations. J Anim Breed Genet. 2014;131:379–86.View ArticlePubMedGoogle Scholar