Skip to main content

Identification of functional mutations associated with environmental variance of litter size in rabbits

Abstract

Background

Environmental variance (VE) is partly under genetic control and has recently been proposed as a measure of resilience. Unravelling the genetic background of the VE of complex traits could help to improve resilience of livestock and stabilize their production across farming systems. The objective of this study was to identify genes and functional mutations associated with variation in VE of litter size (LS) in rabbits. To achieve this, we combined the results of a genome-wide association study (GWAS) and a whole-genome sequencing (WGS) analysis using data from two divergently selected rabbit lines for high and low VE of LS. These lines differ in terms of biomarkers of immune response and mortality. Moreover, rabbits with a lower VE of LS were found to be more resilient to infections than animals with a higher VE of LS.

Results

By using two GWAS approaches (single-marker regression and Bayesian multiple-marker regression), we identified four genomic regions associated with VE of LS, on chromosomes 3, 7, 10, and 14. We detected 38 genes in the associated genomic regions and, using WGS, we identified 129 variants in the splicing, UTR, and coding (missense and frameshift effects) regions of 16 of these 38 genes. These genes were related to the immune system, the development of sensory structures, and stress responses. All of these variants (except one) segregated in one of the rabbit lines and were absent (n = 91) or fixed in the other one (n = 37). The fixed variants were in the HDAC9, ITGB8, MIS18A, ENSOCUG00000021276 and URB1 genes. We also identified a 1-bp deletion in the 3′UTR region of the HUNK gene that was fixed in the low VE line and absent in the high VE line.

Conclusions

This is the first study that combines GWAS and WGS analyses to study the genetic basis of VE. The new candidate genes and functional mutations identified in this study suggest that the VE of LS is under the control of functions related to the immune system, stress response, and the nervous system. These findings could also explain differences in resilience between rabbits with homogeneous and heterogeneous VE of litter size.

Background

Understanding the effect of the environment on the phenotype of farm animals is important to improve responses to genetic selection. The environment can affect both the mean of a trait and its variance (environmental variance or \({\text{V}}_{\text{E}}\)). Many studies in various species have provided statistical evidence that \({\text{V}}_{\text{E}}\) is partly under genetic control: pigs [1], mice [2], chickens [3], snails (Helix aspersa) [4] and cattle [5], among others. For instance, the \({\text{V}}_{\text{E}}\) can differ between genotypes under the same environment [6]. Successful divergent selection experiments for \({\text{V}}_{\text{E}}\) support these findings in both livestock and model animals [7,8,9].

Recently, \({\text{V}}_{\text{E}}\) was proposed as a measure of resilience [10], which is the ability of an animal to maintain or quickly recover their performance in spite of environmental perturbations [11, 12]. Previous genome-wide association studies (GWAS) for \({\text{V}}_{\text{E}}\) have identified relevant contributions from candidate genes that are related to important phases of the inflammatory response, such as Hsp90 [13, 14], p22-PHOX, GNG11, and GNGT1 [15], which are triggered by tissue damage and the entry of pathogens [16]. In humans, the FTO gene, which affects the variability of body mass index [17], was also found to be associated with sensitivity to infections [18]. All these results support the role of the immune system in the detection and response to environmental perturbations such as pathogen infections [12].

Unravelling the genetic background of the \({\text{V}}_{\text{E}}\) of complex traits could help to improve resilience of livestock and stabilize their production across farming systems [19]. The aim of this study was to identify genes and functional mutations associated with variation in the \({\text{V}}_{\text{E}}\) of litter size (LS) in rabbits. We performed a GWAS and a whole-genome sequencing (WGS) analysis using data from two rabbit lines that have been divergently selected for high and low \({\text{V}}_{\text{E}}\) of LS [9]. These lines show a remarkable difference in \({\text{V}}_{\text{E}}\) of LS (4.5% of the mean of the base population), as well as differences in mortality, in biomarkers of immune response (plasma cortisol, leukocytes and acute-phase protein levels), and in plasma concentrations of cholesterol and triglycerides [20]. Moreover, the line with a low \({\text{V}}_{\text{E}}\) of LS was found to cope better with environmental stressors such as infections than the line with a high \({\text{V}}_{\text{E}}\) of LS, which suggests that the homogeneous line is more resilient.

Methods

Animals

The rabbits used in this study belong to a divergent selection experiment for high and low \({\text{V}}_{\text{E}}\) of LS over 12 generations at the University Miguel Hernandez of Elche, Spain. Each divergent line had approximately 125 female and 25 male parents per generation (for more details see Blasco et al. [9]). The total number of litters over these generations was equal to 13,788 for 3070 does: 6094 from the line with a low \({\text{V}}_{\text{E}}\) of LS, 6682 from the line with a high \({\text{V}}_{\text{E}}\) of LS, and 1012 from the base population. In total, 1658 records of litter size from generations 11 and 12, and genotypes for 384 does were used for the GWAS: 96 from the base population (404 parities), 149 from the line with a high \({\text{V}}_{\text{E}}\) of LS (649 parities), and 139 from the line with a low \({\text{V}}_{\text{E}}\) of LS (605 parities). The average litter size (total number born; TNB includes live born plus stillborn) for the base population and the lines with a low and high \({\text{V}}_{\text{E}}\) of LS was 8.72 (± 3.05), 7.71 (± 2.38) and 6.51 (± 3.06), respectively.

Phenotype

In this study, we investigated genomic regions that were associated with the \({\text{V}}_{\text{E}}\) of LS, which was the selection criterion in the divergent selection experiment [9]. The \({\text{V}}_{\text{E}}\) of LS was calculated as the within-doe variance of TNB, after correction of TNB by year-season (47 levels) and parity-lactation status (3 levels) to avoid the effect of systematic effects on \({\text{V}}_{\text{E}}\). The mean estimate of residuals for a doe across parities was used to calculate the \({\text{V}}_{\text{E}}\) of LS for a doe, using the minimum quadratic risk estimator:

$${\text{V}}_{\text{E}} = \frac{1}{n + 1}\mathop \sum \limits_{1}^{n} \left( {x_{i} - \bar{x}} \right)^{2} ,$$
(1)

where \(x_{i}\) is the pre-corrected TNB at parity \(i\) of a doe and \(n\) is the total number of parities of the doe (ranging from 2 to 12). \({\text{V}}_{\text{E}}\) was calculated by assuming that the additive genetic and permanent effects are approximately the same for each parity of a doe [21]. The average of the \({\text{V}}_{\text{E}}\) of LS was 4.24 (± 3.41), 2.27 (± 1.97) and 3.84 (± 3.69) for the base population and for the low and high lines, respectively.

Genotypes

Genomic DNA was isolated from blood sampled from does using standard procedures. Genotyping was performed with the 200 K Affymetrix Axiom OrcunSNP array (ThermoFisher Scientific). Quality control of genotypes was performed using the platform Axiom Analysis Suite 3.1 of ThermoFisher Scientific and the PLINK v1.9 software [22]. Animals with a call rate lower than 97% and SNPs with a minor allele frequency lower than 0.05, with missing genotypes higher than 0.05, or with unknown positions on the rabbit reference genome (OryCun v2.0.96) were removed. After quality control, 367 animals (1589 parities) and 96,329 SNPs remained in the dataset. The missing genotypes were imputed with the Beagle v.4.1 software [23]. Finally, we identified outliers and checked the population structure by applying a principal component analysis (PCA) based on the genotypes (see Additional file 1).

GWAS

Two approaches were used for GWAS: single-marker regression (SMR) and Bayesian multiple-marker regression (BMMR). SMR was performed using the linear mixed model method, which is available in the GCTA v1.91.4 beta software [24]. To correct for population stratification, GCTA considers the genomic relationship matrix built, but without SNPs on the chromosome of the tested SNP [25]. The SNPs that were associated with \({\text{V}}_{\text{E}}\) of LS were identified at a conservative p-value threshold of 0.0001 [26]. Using the same method, we also tested the effect of ignoring differences in number of parities between does. In order to do that, we performed the SMR using a weighted linear mixed model method implemented in the R software. Instead of a genomic relationship matrix, the model included the first five principal components of the genomic relationship matrix to correct for population stratification. The \({\text{V}}_{\text{E}}\) of LS was weighted according to Blasco et al. [9]:

$$w_{i} = \frac{{\left( {n_{i} + 1} \right)^{2} }}{{2*\left( {n_{i} - 1} \right)}},$$
(2)

where \(n_{i}\) is the total number of parities for doe \(i\).

BMMR was performed using a Bayes B model that is implemented in the GenSel software [27]. This model assumed that, in a given iteration of the Monte Carlo Markov chain, many SNPs have no effect and variance, with a prior probability of π = 0.999, and approximately 100 SNPs have an effect and a variance on the \({\text{V}}_{\text{E}}\) of LS. The analysis was done using a chain length of 550,000, with a lag of 100 and a burn-in of 150,000. The means of the priors of the genotypic and environmental variances were equal to 5.1 and 4.3, respectively. A Bayes factor was calculated to determine statistical significance of the SNP associations, as:

$$BF = \frac{{\hat{p}_{i} /\left( {1 - \hat{p}_{i} } \right)}}{{\left( {1 - \pi } \right)/\pi }},$$

where \(\pi\) is the prior probability and \(\hat{p}_{i}\) is the posterior probability of a SNP in locus \(i\) having an effect. A threshold for \(BF\) higher than 10 was used to identify SNPs that are associated with \({\text{V}}_{\text{E}}\) of LS [28]. The contribution of each of the 2125 non-overlapping 1-Mb genomic windows to the genetic variance was computed as the posterior mean of the percentage of the genomic variance explained by all markers across the genome (total genomic variance).

Additional evidence for associated SNPs

The SNPs identified to be associated with \({\text{V}}_{\text{E}}\) using both the SMR and BMMR approaches were further tested using a permutation test and a GWAS within each population (base, with a high and with a low \({\text{V}}_{\text{E}}\) of LS) to determine whether they were spurious associations due to drift. Only SNPs that passed these additional tests were considered as displaying a true association with \({\text{V}}_{\text{E}}\) of LS.

The permutation test was performed using the PLINK v1. 9 software [22]. In total, 100,000 random permutations were performed to remove the true association between \({\text{V}}_{\text{E}}\) of LS and the genotype. Each permutated dataset was analysed using a linear-mixed model and the p-value of each SNP was calculated. The resulting distribution of p-values was used to calculate an empirical p-value (EMP1) for each SNP in the original data based on the number of times that the p-value of that SNP was declared to have a significant association with \({\text{V}}_{\text{E}}\) of LS under the null hypothesis of no association in the permutated data. The minimum EMP1 that could be registered was 1/N, where N is the number of permutation tests. Thus, only SNPs with an EMP1 close to 0.00001 were considered to be associated with \({\text{V}}_{\text{E}}\) of LS.

Within-population GWAS was performed for each population using the SMR approach [24]. The same reference alleles were established in the two lines and in the base population to estimate allele substitution effects. Confidence intervals (CI) of the SNP effects within a population were estimated as the allele substitution effect estimate ± 2SE. Overlapping CI for a SNP between lines was declared to signify no evidence of differences in allele effects on the phenotype across populations.

Identification of associated genomic regions

The significant SNPs were used to perform a linkage disequilibrium (LD) study using the PLINK v1.9 software [22]. For this purpose, SNPs within 0.5 Mb of a significant SNP were grouped into blocks, which were then expanded to genomic regions ± 1 Mb for the LD study. Genomic regions associated with the trait were considered to be blocks of SNPs with r2 higher than 0.7 between each other. We established this strong threshold following Vanliere et al. [29], who determined that two SNPs were dependent when r2 was equal or higher than 0.43.

Whole-genome sequencing

To identify which variants were present in one line but not in the other, due to the selection process, a pool of DNA from the breeding males in the 10th generation was created for each line (27 animals per line) and sequenced with Illumina Technology at an average depth of 27×. These males were all fathers of animals from the 11th generation, which were used in the GWAS.

Pre-processing of the WGS data was performed following Elston [30]. The BWA algorithm [31] was used to index the OryCun v2.0.96 reference genome from the raw data. Illumina adapters and low-quality read ends were removed using Trimmomatic v0.39 [32]. The BWA-MEM algorithm was used to align reads to OryCun v2.0.96. The sorted BAM files were obtained by SAMtools [33]. Duplicates were marked using Picard MarkDuplicates [34].

Variant calling was performed using the GATK Best Practices pipeline [35] by applying GATK to the BAM files using HaplotypeCaller and GenotypeGVCF to obtain the raw VCF files for the high and low \({\text{V}}_{\text{E}}\) of LS lines. Variants were filtered using SelectVariant from GATK. Single nucleotide variants (SNVs) that were filtered out were labelled using VariantFiltration with the following filter expressions: QD < 2, FS > 60, MQ < 40, MQRankSum < −12.5, ReadPosRankSum < −8. INDELs were filtered out according to QD < 2, FS > 200, and ReadPosRankSum < −20. Finally, variants were annotated using the snpEFF software [36].

Identification of candidate genes and functional mutations

The Ensembl release 97 database [37] was used to investigate candidate genes in the genomic regions associated with \({\text{V}}_{\text{E}}\) of LS, using OryCun 2.0.97 as the reference genome. SNVs and INDELs that were present in the genomic regions associated with \({\text{V}}_{\text{E}}\) of LS were also detected. Variants that segregated differently between the two lines and that had a greater impact on gene function were proposed as functional mutations for \({\text{V}}_{\text{E}}\) of LS. We considered that a variant had a greater impact if it affected: (a) the amino acid sequence of the protein (missense or frameshift mutations), (b) the UTR regions of the mRNAs, or (c) the splicing pattern of the transcripts. Genes that contained such possible functional mutations were identified as candidate genes for \({\text{V}}_{\text{E}}\) of LS were. The biological functions and the gene ontology of the candidate genes were reviewed in GeneCards [38].

Results

Genomic regions associated with \({\mathbf{V}}_{{\mathbf{E}}}\) of LS

GWAS identified SNPs associated with \({\text{V}}_{\text{E}}\) of LS using two approaches, SMR and BMMR. SMR identified 12 SNPs with a p-value less than 0.0001 on Oryctolagus cuniculus chromosomes (OCU) 3, 7, 10, and 14 (Fig. 1a). The same results were obtained with the weighted SMR analysis of \({\text{V}}_{\text{E}}\) to take differences in number of parities between does into account (data not shown). With BMMR, we identified 60 SNPs on several chromosomes that had a Bayes factor (\(BF\)) higher than 10 (Fig. 1b), including all the SNPs that were identified by SMR (Table 1). These latter SNPs were in genomic windows on OCU3 (50–52 Mb), OCU7 (141–142 Mb), OCU10 (4–5.7 Mb), and OCU14 (163–164 Mb), which explained 4.0, 0.2, 3.2 and 0.5% of the total genomic variance, respectively. The three most significant SNPs on OCU9 were also considered because they reached a p-value close to the threshold of 0.0001 (0.00018) and a \(BF\) greater than 10 (Fig. 1). The genomic window that contained these SNPs on OCU9 (4–6 Mb) explained 0.9% of the total genomic variance. In summary, 15 SNPs were identified to be associated with \({\text{V}}_{\text{E}}\) of LS by both methods.

Fig. 1
figure 1

Manhattan plots for genome-wide association analyses for environmental variance of litter size. a −log10(p-value) for association of SNPs using the single-marker regression approach. b Bayes factor (BF) for association of SNPs using the Bayesian multiple-marker regression approach. The dashed lines represent significance thresholds a p-value of 0.0001 (a) and \(BF\) of 10 (b). The red triangles highlight the SNPs that pass the threshold

Table 1 Genomic regions associated with environmental variance of litter size in rabbits

These 15 SNPs were further evaluated by comparing their within-population allele substitution effects and by a permutation test. The allele substitution effect estimates for the 15 SNPs did not differ significantly between lines. However, five of these SNPs, located on OCU14 and 9, did not pass the permutation test because of their high empirical p-value (EMP1). The 10 SNPs that passed the additional tests were used to perform an LD analysis and determine the \({\text{V}}_{\text{E}}\)-associated genomic regions (vQTL), as described in methods (Table 1), resulting in associated LD blocks of 1.2, 1.8 and 2.4 Mb on OCU14, 10 and 3, respectively. On OCU7, no associated LD block was detected (see Additional files 2, 3, 4 and 5). Hence, vQTL were identified on OCU3 at 50.4–52.8 Mb, on OCU10 at 3.9–5.7 Mb, on OCU14 at 162–163.2, and on OCU7, close to 141,236,037 bp (Table 1).

Candidate genes for \({\mathbf{V}}_{{\mathbf{E}}}\) of litter size

In total, 38 genes were located in the genomic regions that were associated with \({\text{V}}_{\text{E}}\) of LS (Table 1). We used WGS data of each line to identify 18,729 variants (SNVs + INDELs) in these regions (Table 2). From these, 129 were relevant (112 SNVs and 17 INDELs) based on their location in the transcription unit and/or splicing sites, which were located in 16 of the 38 genes identified in the GWAS (see Additional file 6). These 16 genes (proposed as candidate genes) are involved in biological processes related to inflammatory response, development of sensory structures, and regulation of gene expression (see Additional file 7).

Table 2 Classification and total number of variants (SNVs + INDELs) in each vQTL region

All 129 relevant variants segregated in one of the two lines and were absent (91) or fixed (37) in the other line, except for one INDEL in the 3′UTR of the HUNK gene (see Additional file 6). This latter was a 1-bp deletion that was fixed in the line with a low \({\text{V}}_{\text{E}}\) of LS and absent in the line with a high \({\text{V}}_{\text{E}}\) of LS. The other variants that were fixed for the alternative allele were identified in the line with a high \({\text{V}}_{\text{E}}\) of LS in the ITGB8, MIS18A, ENSOCUG00000021276, and URB1 genes, and in the line with a high \({\text{V}}_{\text{E}}\) of LS for the HDAC9 gene (see Additional file 6). These variants could affect biological processes that are related to immune (HDAC9, ITGB8, and HUNK) and stress (ENSOCUG00000021276) responses, to regulation of gene expression (HDAC9, MIS18A, and URB1), and to phosphorylation of proteins (HUNK).

Discussion

Our aim was to identify candidate genes and functional mutations associated with \({\text{V}}_{\text{E}}\) of litter size in rabbits. In GWAS, estimates of the effect of genomic variants on the phenotype depends on the model used [39]. In our study, we identified associated genomic regions using SMR and BMMR analyses. The SMR analysis does not consider the dependencies between SNPs, so the effects were overestimated. In addition, the number of false negatives increases when a correction such as Bonferroni is applied and variants with small effects cannot be detected. In the BMMR analysis, the shrinkage parameter of the model (π = 0.999) increases the power to detect variants with small effects but also increases the number of false positives [39]. Thus, in our study, only genomic regions that were identified by both methods were considered as candidate regions for identifying relevant genes.

Several genomic regions were associated with \({\text{V}}_{\text{E}}\) of LS (Table 1). The highlighted SNPs in these regions were further evaluated using within-population GWAS and a permutation test. However, both these tests have some limitations. The within-population GWAS, accurate estimation of the allele substitution effect was limited by the small number of individuals per population (base = 91; low = 134; high = 142), which did not represent the allele and genotype frequencies in each population. For the permutation test, the highest EMP1 of the SNPs retained in the analysis was 0.00097 (OCU14). This means that in 97 of the 100,000 permutation tests performed, the SNP was associated with \({\text{V}}_{\text{E}}\) of LS by chance under the null hypothesis. This could be due to the high level of relationship between animals in each population, which hinders elimination of true associations between genotype and phenotype.

The GWAS results were combined with WGS to identify candidate genes and functional mutations associated with \({\text{V}}_{\text{E}}\) of LS. We screened for SNVs and INDELs between the two divergent rabbit lines in the vQTL that were detected by GWAS (Table 2). A variant was considered as a potential functional mutation when it caused a missense or frameshift mutation or affected the UTR regions in the mRNAs or the splicing pattern of the transcripts. Such variants are expected to have a critical effect on the function of a gene because of a change in mRNA stability or in the amino acid sequence of the protein it encodes. Sixteen of the 38 genes identified in the GWAS contained at least one of these variants (see Additional file 6). Most of these variants segregated in one of the rabbit lines and were absent (91) or fixed (37) in the other line (see Additional file 6). The use of DNA pools for WGS allowed us to have more coverage to identify different variants between the lines with high and low \({\text{V}}_{\text{E}}\) of LS. However, the use of pools does not allow estimation of the frequency of a variant in a line, or the genotype of each animal used in the pool. For this reason, although we classified 129 variants as functional mutations, we focused on the variants that were fixed in one line and not in the other (see Additional file 6).

The 16 candidate genes identified in this study are involved in functions that are related to immune (DOCK2, HDAC9, ITGB8, and HUNK) and stress (ENSOCUG00000021276) responses, development of sensory structures (FOXI1, FGF18, and EVA1C), regulation of gene expression (PAXBP1, FERD3L, HDAC9, and FOXI1), and phosphorylation of proteins (HUNK), among others [see Additional file 7]. A recent study by Argente et al. [20] found differences in levels of plasma leukocytes and cortisol between the divergent rabbit lines used here but from generation 8 and showed that the line with a low \({\text{V}}_{\text{E}}\) of LS was less sensitive to infection and stress than the line with a high \({\text{V}}_{\text{E}}\) of LS. Our results confirm the importance of immune and stress responses for \({\text{V}}_{\text{E}}\) of LS through the DOCK2, ITGB8, HDAC9, and ENSOCUG00000021276 genes. For instance, DOCK2 is involved in the extravasation of monocytes (entry into the affected tissue) by promoting polarization of the cell membrane and remodelling of the actin cytoskeleton needed for this function. In addition, DOCK2 controls the monocyte inflammatory response via FcyR receptors [40], such as ITGB8, through TGF-β activation [41]. The HDAC9 gene may play a role in hematopoiesis and self-tolerance through the control of Treg cells [42]. The ENSOCUG00000021276 gene, which is orthologous to the human MRAP (melanocortin 2 receptor accessory protein) gene, could modulate stress response though the production of glucocorticoids in the adrenal gland but experimental analyses are needed to verify this inferred function [37].

Previous GWAS for \({\text{V}}_{\text{E}}\) in pigs and cows also identified genes that are involved in the immune response, more specifically in the inflammatory response [13,14,15]. Sell-Kubiak et al. [13] and Morgante et al. [14] identified genes of the HSP (heat shock protein) family to be associated with \({\text{V}}_{\text{E}}\), which regulate activation of leukocytes and protect cells against reactive-oxygen species (ROS) [43]. In mice, the candidate gene HDAC9 regulates expression of a gene of the HSP family (HSP70) [42]. Wijga et al. [15] also found genes involved in the phagocytosis process to be associated with the standard deviation of milk somatic cell count in cattle, which is in line with functions related to the DOCK2 gene [40]. Thus, there are several lines of evidence that support the importance of the immune system in the control of \({\text{V}}_{\text{E}}\).

For the other genes identified here (FOXI1, EVAC1, FGF18, and HUNK), we found no evidence in the literature that links them to a biological function associated with \({\text{V}}_{\text{E}}\). A recent study proposed to use \({\text{V}}_{\text{E}}\) as a measure of animal resilience [10], which is supported by the results of Argente et al. [20], who suggested that the line with a low \({\text{V}}_{\text{E}}\) of LS is more resilient to general stressors than the line with a high \({\text{V}}_{\text{E}}\) of LS. According to Colditz and Hine [12], animals can better maintain performance (be more resilient) when they can properly discriminate environmental stimuli from the background. In this context, the nervous system, cell receptors, and the immune system act as sensors of environmental disturbances. Thus, the immune system is required to perceive and properly respond to environmental stimuli that occur on farms, as well as correct development of the sensory organs and the neuron system [12]. Along the same line, candidate genes such as FOXI1, EVAC1, and FGF18 would be important to develop sensory structures and parts of the nervous system. The FOXI1 gene encodes an important transcriptional factor, which is necessary for normal development of the inner ear, with mice that lack this gene developing deafness [44]. The EVA1C gene is involved in the correct development of olfactory and optic sensory axons and other neural structures [45]. The growth factor FGF18 regulates development of the neural system, specifically the midbrain structure [46]. Finally, HUNK is a serine/threonine kinase, which was recently shown to be associated with control of expression of E-cadherin [47], a molecule that can act as a receptor for pathogens [48]. We identified a 1-bp deletion in the 3′UTR region of the HUNK gene, which was fixed in the line with a low \({\text{V}}_{\text{E}}\) of LS and absent in the line with a high \({\text{V}}_{\text{E}}\) of LS. Mutations in the 3′UTR region can affect expression of the gene and/or translation rate of the mRNA. This suggests that different levels of expression of HUNK between the two lines could influence \({\text{V}}_{\text{E}}\) of LS. The role of the identified candidate genes on modulation of \({\text{V}}_{\text{E}}\) of LS and, therefore, on resilient responses, requires further study to complement current evidence on the relevance of the immune system on \({\text{V}}_{\text{E}}\) [13,14,15, 20].

Conclusions

A combined GWAS and WGS analysis allowed us to identify 16 new candidate genes that carry 129 putative functional mutations that are associated with \({\text{V}}_{\text{E}}\) of LS in rabbits. These findings provide support for the control of \({\text{V}}_{\text{E}}\) of LS through regulation of the immune system and suggest that development of the nervous system and sensory structures may also be important to modulate animal resilience. This study advances our understanding of the genetic background of \({\text{V}}_{\text{E}}\). However, further studies are needed to validate the true effect of the putative functional mutations in these genes on \({\text{V}}_{\text{E}}\) of LS, as well as the relationship of \({\text{V}}_{\text{E}}\) with animal resilience.

Availability of data and materials

Data are available upon request to the corresponding author.

References

  1. Ibáñez-Escriche N, Varona L, Sorensen D, Noguera JL. A study of heterogeneity of environmental variance for slaughter weight in pigs. Animal. 2008;2:19–26.

    Article  PubMed  Google Scholar 

  2. Ibáñez-Escriche N, Moreno A, Nieto B, Piqueras P, Salgado C, Gutiérrez JP. Genetic parameters related to environmental variability of weight traits in a selection experiment for weight gain in mice; signs of correlated canalised response. Genet Sel Evol. 2008;40:279–93.

    PubMed  PubMed Central  Google Scholar 

  3. Mulder H, Hill W, Vereijken A, Veerkamp R. Estimation of genetic variation in residual variance in female and male broiler chickens. Animal. 2009;3:1673–80.

    Article  CAS  PubMed  Google Scholar 

  4. Ros M, Sorensen D, Waagepetersen R, Dupont-Nivet M, Sancristobal M, Bonnet J, et al. Evidence for genetic control of adult weight plasticity in the snail Helix aspersa. Genetics. 2004;168:2089–97.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Rönnegård L, Felleki M, Fikse WF, Mulder HA, Strandberg E. Variance component and breeding value estimation for genetic heterogeneity of residual variance in Swedish Holstein dairy cattle. J Dairy Sci. 2013;96:2627–36.

    Article  PubMed  CAS  Google Scholar 

  6. Falconer DS, Mackay TFC. Introduction to quantitative genetics. 4th ed. Harlow: Prentice Hall; 1996.

    Google Scholar 

  7. Garreau H, Bolet G, Larzul C, Robert-Granié C, Saleil G, SanCristobal M, et al. Results of four generations of a canalising selection for rabbit birth weight. Livest Sci. 2008;119:55–62.

    Article  Google Scholar 

  8. Formoso-Rafferty N, Cervantes I, Ibáñez-Escriche N, Gutiérrez JP. Genetic control of the environmental variance for birth weight in seven generations of a divergent selection experiment in mice. J Anim Breed Genet. 2016;133:227–37.

    Article  CAS  PubMed  Google Scholar 

  9. Blasco A, Martínez-Álvaro M, García ML, Ibáñez-Escriche N, Argente MJ. Selection for environmental variance of litter size in rabbit. Genet Sel Evol. 2017;49:48.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Berghof TVL, Poppe M, Mulder HA. Opportunities to improve resilience in animal breeding programs. Front Genet. 2019;9:692.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Mulder HA. Genomic selection improves response to selection in resilience by exploiting genotype by environment interactions. Front Genet. 2016;7:178.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  12. Colditz IG, Hine BC. Resilience in farm animals: biology, management, breeding and implications for animal welfare. Anim Prod Sci. 2016;56:1961–83.

    Article  Google Scholar 

  13. Sell-Kubiak E, Duijvesteijn N, Lopes MS, Janss LLG, Knol EF, Bijma P, et al. Genome-wide association study reveals novel loci for litter size and its variability in a Large White pig population. BMC Genomics. 2015;16:1049.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Morgante F, Sørensen P, Sorensen DA, Maltecca C, Mackay TFC. Genetic architecture of micro-environmental plasticity in Drosophila melanogaster. Sci Rep. 2015;5:9785.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Wijga S, Bastiaansen JWM, Wall E, Strandberg E, de Haas Y, Giblin L, et al. Genomic associations with somatic cell score in first-lactation Holstein cows. J Dairy Sci. 2012;95:899–908.

    Article  CAS  PubMed  Google Scholar 

  16. Owen JA, Punt J, Stranford SA, Jones PP, Kuby J. Immunology. 7th ed. New York: Freeman; 2013.

    Google Scholar 

  17. Yang J, Loos RJF, Powel JE, Medland SE, Elizabeth K, Chasman DI, et al. FTO genotype is associated with phenotypic variability of body mass index. Nature. 2012;490:267–72.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Feng Y, Wang F, Pan H, Qiu S, Lu J, Wu L, et al. Obesity-associated gene FTO rs9939609 polymorphism in relation to the risk of tuberculosis. BMC Infect Dis. 2014;14:592.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  19. Hermesch S, Dominik S, editors. Breeding focus 2014—improving resilience. Armidale: University of New England; 2014.

    Google Scholar 

  20. Argente MJ, García ML, Zbyňovská K, Petruška P, Capcarová M, Blasco A. Correlated response to selection for litter size environmental variability in rabbits’ resilience. Animal. 2019;13:2348–55.

    Article  CAS  PubMed  Google Scholar 

  21. Piles M, Garcia ML, Rafel O, Ramon J, Baselga M. Genetics of litter size in three maternal lines of rabbits: repeatability versus multiple-trait models. J Anim Sci. 2006;84:2309–15.

    Article  CAS  PubMed  Google Scholar 

  22. Chang CC, Chow CC, Tellier LCAM, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  23. Browning BL, Browning SR. Genotype imputation with millions of reference samples. Am J Hum Genet. 2016;98:116–26.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet. 2011;88:76–82.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Yang J, Weedon MN, Purcell S, Lettre G, Estrada K, Willer CJ, et al. Genomic inflation factors under polygenic inheritance. Eur J Hum Genet. 2011;19:807–12.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Lander E, Kruglyak L. Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results. Nat Genet. 1995;11:241–7.

    Article  CAS  PubMed  Google Scholar 

  27. Garrick DJ, Fernando RL. Genome-wide association studies and genomic prediction. Methods Mol Biol. 2013;1019:275–98.

    Article  CAS  PubMed  Google Scholar 

  28. Kass RE, Raftery AE. Bayes factors. J Am Stat Assoc. 1995;90:773–95.

    Article  Google Scholar 

  29. VanLiere JM, Rosenberg NA. Mathematical properties of the r2 measure of linkage disequilibrium. Theor Pop Biol. 2008;74:130–7.

    Article  Google Scholar 

  30. Elston RC. Preprocessing and quality control for whole-genome sequences from the Illumina HiSeq X platform. In: Wright MN, Gola D, Ziegler A, editors. Statistical human genetics methods in molecular biology, vol. 1666. New York: Humana Press; 2017. p. 629–47.

    Google Scholar 

  31. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler Transform. Bioinformatics. 2009;25:1754–60.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. 1000 Genome project data processing subgroup. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–9.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  34. Broad Institute. Picard tools. version 2.17.8; Broad Institute, GitHub repository. http://broadinstitute.github.io/picard/. Accessed 21 Feb 2018.

  35. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–303.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Cingolani P, Platts A, le Wang L, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin). 2012;6:80–92.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Zerbino DR, Achuthan P, Akanni W, Amode MR, Barrell D, Bhai J, et al. Ensembl 2018. Nucleic Acids Res. 2018;46:D754–61.

    Article  CAS  PubMed  Google Scholar 

  38. Stelzer G, Rosen R, Plaschkes I, Zimmerman S, Twik M, Fishilevich S, et al. The GeneCards suite: from gene data mining to disease genome sequence analysis. Curr Protoc Bioinformatics. 2016;54:1–30.

    Article  PubMed  Google Scholar 

  39. López de Maturana E, Ibáñez-Escriche N, González-Recio Ó, Marenne G, Mehrban H, Chanock SJ, et al. Next generation modelling in GWAS: comparing different genetic architectures. Hum Genet. 2014;133:1235–53.

    Article  PubMed  Google Scholar 

  40. Chen Y, Meng F, Wang B, He L, Liu Y, Liu Z. Dock2 in the development of inflammation and cancer. Eur J Immunol. 2018;48:915–22.

    Article  CAS  PubMed  Google Scholar 

  41. Kelly A, Gunaltay S, McEntee CP, Shuttleworth EE, Smedley C, Houston SA, et al. Human monocytes and macrophages regulate immune tolerance via integrin αvβ8–mediated TGFβ activation. J Exp Med. 2018;215:2725–36.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. de Zoeten EF, Wang L, Sai H, Dillmann WH, Hancock WW. Inhibition of HDAC9 increases T regulatory cell function and prevents colitis in mice. Gastroenterology. 2010;138:583–94.

    Article  PubMed  CAS  Google Scholar 

  43. Zhang X, Mosser D. Macrophage activation by endogenous danger signals. J Pathol. 2009;214:161–78.

    Article  Google Scholar 

  44. Enerbäck S, Nilsson D, Edwards N, Heglind M, Alkanderi S, Ashton E, et al. Acidosis and deafness in patients with recessive mutations in FOXI1. J Am Soc Nephrol. 2018;29:1041–8.

    PubMed  Google Scholar 

  45. James G, Foster SR, Key B, Beverdam A. The expression pattern of EVA1C, a novel slit receptor, is consistent with an axon guidance role in the mouse nervous system. PLoS One. 2013;28:e74115.

    Article  CAS  Google Scholar 

  46. Liu A, Li JYH, Bromleigh C, Lao Z, Niswander LA, Joyner AL. FGF17b and FGF18 have different midbrain regulatory properties from FGF8b or activated FGF receptors. Development. 2003;130:6175–85.

    Article  CAS  PubMed  Google Scholar 

  47. Williams CB, Phelps-Polirer K, Dingle IP, Williams CJ, Rhett MJ, Eblen ST, et al. HUNK phosphorylates EGFR to regulate breast cancer metastasis. Oncogene. 2020;39:1112–24.

    Article  CAS  PubMed  Google Scholar 

  48. Costa AM, Leite M, Seruca R, Figueiredo C. Adherens junctions as targets of microorganisms: a focus on Helicobacter pylori. FEBS Lett. 2013;587:259–65.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

We acknowledge the contributions of Roger Ros-Freixedes and Francisco Rosich of ASIC to the whole-genome sequencing pipeline and the useful comments of Agustin Blasco and Jack Dekkers. Cristina Casto-Rebollo acknowledges the FPU17/01196 scholarship of the Spanish Ministry of Science, Innovation and Universities.

Funding

This study was funded by the Spanish Ministry of Economy and Competitiveness (MINECO) with the Projects AGL2014-55921, C2-1-P and C2-2-P, and AGL2017-86083, C2-1-P and C2-2-P and the Grant RYC-2016-19764.

Author information

Authors and Affiliations

Authors

Contributions

CCR evaluated the experiments, analyzed the data and wrote the manuscript. MJA conceived and designed the study and contributed to the discussion of the results. MLG contributed to the study design and the discussion of the results. RP analyzed the lab data, contributed to the discussion of the results and edited the manuscript. NIE conceived the study, evaluated the experiments, contributed to the discussion of the results and edited the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Noelia Ibáñez-Escriche.

Ethics declarations

Ethics approval and consent to participate

All experimental procedures were approved by the Committee of Ethics and Animal Welfare of the Miguel Hernández University, according to Council Directives 98/58/EC and 2010/63/EU.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1.

Principal component analysis applied to the genotype data. Representation of the first (PC1) and second (PC2) components for the genotypes in the base population (red) and in the high (blue) and low (green) selection lines for environmental variance (\({\text{V}}_{\text{E}}\)) of litter size (LS).

Additional file 2.

Linkage disequilibrium of SNPs on OCU3 at 49–53 Mb. Representation of the linkage disequilibrium (LD) in the associated genomic region on OCU3. SNPs in this region were plotted according to their Bayes factor (\(BF\)). The colours of the SNPs indicate their LD with the SNP with the highest \(BF\) in this region (highlighted with a black triangle). Colours between red and green indicate an r2 between 1 and 0.5. Colours between green and blue indicate an r2 between 0.5 and 0. Genes in this region are plotted at the bottom of the graph according to their position on the genome.

Additional file 3.

Linkage disequilibrium of SNPs on OCU7 at 140–142 Mb. Representation of the linkage disequilibrium (LD) in the associated genomic region on OCU7. SNPs in this region were plotted according to their Bayes factor (\(BF\)). The colours of the SNPs indicate their LD with the SNP with the highest \(BF\) in this region (highlighted with a black triangle). Colours between red and green indicate an r2 between 1 and 0.5. Colours between green and blue indicate an r2 between 0.5 and 0. Genes in this region are plotted at the bottom of the graph according to their position on the genome.

Additional file 4.

Linkage disequilibrium of SNPs on OCU10 at 3–7 Mb. Representation of the linkage disequilibrium (LD) in the associated genomic region on OCU10. SNPs in this region were plotted according to their Bayes factor (\(BF\)). The colours of the SNPs indicate their LD with the SNP with the highest \(BF\) in this region (highlighted with a black triangle). Colours between red and green indicate an r2 between 1 and 0.5. Colours between green and blue indicate an r2 between 0.5 and 0. Genes in this region is plotted at the bottom of the graph according to their position on the genome.

Additional file 5.

Linkage disequilibrium of SNPs on OCU14 at 161–164 Mb. Representation of the linkage disequilibrium (LD) in the associated genomic region in OCU14. SNPs in the region were plotted according to their Bayes factor (\(BF\)). The colours of the SNPs indicate their LD with the SNP with the higher \(BF\) in this region (highlighted with a black triangle). Colours between red and green indicate r2 between 1 and 0.5. Colours between green and blue indicate r2 between 0.5 and 0. Genes in this region was plotted at the bottom of the graphic according their position in the genome.

Additional file 6.

Candidate genes and their biological function based on the GeneCards and Ensembl databases.

Additional file 7.

Total number of relevant variants identified in the associated genomic regions for the environmental variance of litter size. INDELs and SNVs identified by WGS analysis in a UTR or splicing region or with a missense effect. For each variant, the position is indicated according to the chromosome (OCU) and base pair (bp) location. REF show the allele in the reference genome (Oryctolagus cuniculus v2.0.96) and ALT the alternative variant identified in the rabbit lines. Low and High show the allelic distribution in each line where 0 indicates the reference allele and 1 the alternative allele.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Casto-Rebollo, C., Argente, M.J., García, M.L. et al. Identification of functional mutations associated with environmental variance of litter size in rabbits. Genet Sel Evol 52, 22 (2020). https://doi.org/10.1186/s12711-020-00542-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12711-020-00542-w