Multi-omic data integration and analysis using systems genomics approaches: methods and applications in animal production, health and welfare
© Suravajhala et al. 2016
Received: 7 October 2015
Accepted: 16 April 2016
Published: 29 April 2016
In the past years, there has been a remarkable development of high-throughput omics (HTO) technologies such as genomics, epigenomics, transcriptomics, proteomics and metabolomics across all facets of biology. This has spearheaded the progress of the systems biology era, including applications on animal production and health traits. However, notwithstanding these new HTO technologies, there remains an emerging challenge in data analysis. On the one hand, different HTO technologies judged on their own merit are appropriate for the identification of disease-causing genes, biomarkers for prevention and drug targets for the treatment of diseases and for individualized genomic predictions of performance or disease risks. On the other hand, integration of multi-omic data and joint modelling and analyses are very powerful and accurate to understand the systems biology of healthy and sustainable production of animals. We present an overview of current and emerging HTO technologies each with a focus on their applications in animal and veterinary sciences before introducing an integrative systems genomics framework for analysing and integrating multi-omic data towards improved animal production, health and welfare. We conclude that there are big challenges in multi-omic data integration, modelling and systems-level analyses, particularly with the fast emerging HTO technologies. We highlight existing and emerging systems genomics approaches and discuss how they contribute to our understanding of the biology of complex traits or diseases and holistic improvement of production performance, disease resistance and welfare.
Overview of the different ‘omic’ levels used in systems genomic analyses
Complete collection of DNA, containing all the genetic information of an organism
Complete collection of changes to the DNA and histone proteins
Complete collection of RNA molecules in a cell or collection of cells
Complete collection of proteins in e.g. a cell, tissue, or organism
Complete collection of small-molecule chemicals (e.g. hormones) in e.g. a cell, tissue or organism
Complete collection of (genes of) microbes in the organism
Complete collection of genetic material contained in an environmental sample
Complete collection of phenotypic traits, affected by genomic and/or environmental factors in an organism
Complete collection of functions described by all the complementary members in living organisms
Recently, a few articles reviewed systems biology and systems genetics in an animal context [18–20]. From the animal organizational level to the individual components of the systems within animals, these papers have given an overview of ‘omics’-enabled components that are characterized by vast amounts of data. Furthermore, a collection of papers presented at the symposium on systems biology in animals have attempted to bridge the gap and demonstrated that complex regulatory relationships exist among genotypes and phenotypes with an emphasis on the applicability of these integrative systems biology methods . Integrating data in the multi-omic space is difficult and tedious because of the extremely large volumes of data produced across several HTO platforms, primarily from NGS machines. Secondary datasets that are generated after quality control of the raw datasets are further analysed by bioinformatics and statistical methods to create tertiary datasets (quality-controlled final datasets) that in turn form the basis of input data to systems genomics analyses. There are a number of comprehensive repositories of data obtained from genomic, transcriptomic and phenomic resources that are specific to complex traits and may explain the effect of variants (single nucleotide polymorphisms (SNPs), insertions, deletions, copy number variations (CNV)) on these traits . Studies have validated several developments that aim at combining biological findings and data obtained from various resources. Recently, a complete model that estimates effects of candidate genes responsible for diseases and of those that affect interactions was conceptualized in the form of a ‘Genome <==> Phenome Superhighway’ (GPS) . Genetic mapping (either linkage or association) of internal or endo-phenotypes (e.g. based on gene expression or metabolite or protein levels or quantitative trait loci (QTL) mapping [3, 4]) has generated a wealth of data and created a need for classifying, annotating, storing and analysing these data to understand their role in the genetic variation of these endo-phenotypes. More precisely, the ‘omic’ space that contains data on a trait will advance our understanding of the functions that are associated with biochemical pathways and the interactions between macro- and micromolecules . However, it is necessary to carefully quality control a wide array of HTO methods that generate multi-omics datasets to remove redundant and false-positive data. With redundancy of data representing a huge threat in terms of errors in data usage, sharing data in a common information space would reduce the amount of redundant data and the potential for error. It would also contribute to harmonize to a greater degree the standardization activities across different ‘omics’ data, a critical issue in view of the integration of data from these different sources . Assuming that each of these HTO datasets can be quality controlled, the next logical step would be to assess how these diverse HTO datasets that represent different biological levels could be integrated and jointly analysed to exploit their full potential for the improvement of animal production, health and welfare. With the foregoing introduction to multi-omic data integration and analysis methods, our main objectives in this mini-review are to outline current and emerging methods to generate main ‘omic’ data types and highlight some applications in animal and veterinary sciences. This paper focuses only on the main HTO data types in systems genomics (genomics, transcriptomics, epigenomics, metabolomics and proteomics). The last section of this paper re-introduces ‘systems genetics’ or ‘systems genomics’ in an emerging multi-omics context and provides perspectives on how systems genomics can be used towards improving animal production, health and welfare.
For both humans and various animal species, the study of the “whole genome” goes back to the 1990s when the focus was on the identification of genetic variants using the GWAS approach, which is based on microarrays or chips with tens of thousands of SNPs. In a GWAS, each SNP is statistically tested for significance of association with the trait/phenotype of interest. In comparison with the wide variety of available human SNP chips, the number of SNPs on an animal SNP chip is much smaller, e.g. 60 K for pig and chicken, 50 K for sheep and 777 K for cattle. In livestock genomics, many GWAS focussed on production and health traits. In 2015, Sharma et al.  reported an extensive review on GWAS performed in cattle, pigs, and chicken. For example, GWAS on female reproduction traits in tropically adapted beef cattle , feed efficiency traits in pigs [26, 27], body weight in broilers  and obesity and metabolic diseases using the pig as a model  have been conducted. The advantage of performing GWAS in livestock species over humans is the availability of related animals and subsequent knowledge about the pedigree, which greatly reduces the number of individuals needed to reach sufficient power to detect genetic variants associated with the trait of interest. Furthermore, linkage disequilibrium (LD), which is much more extensive in the genome of animals than in the human genome and depends on relatedness between animals, has a positive influence on the required sample size. Animal breeding deals with such issues by using a mixed model that accounts for the population structure/pedigree . An important bioinformatics task is the annotation of the GWAS variants that explain a certain proportion of phenotypic variation and the prediction of functional properties, which serve to build ontology-based functional networks based on many databases. Many web services have been created to meet the challenge, but most of them only work within the framework of human medical research. Recently, the Functional Annotation of ANimal Genomes (FAANG) International Project Consortium was launched (http://www.faang.org/), to bring together animal scientists and sustain a steady focus on collaborations among this community. Three of four committees in this FAANG consortium address key issues on the functional annotation of animal genomes based on contributions from researchers worldwide, i.e. the Animals, Samples and Assays (ASA), Bioinformatics and Data Analysis (B&DA) and Metadata and Data Sharing (M&DS) committees. In addition, the 1000 Bull Genomes Project has provided the bovine research community with a huge volume of data on bovine variants that will be useful for GWAS and the identification of causal mutations (http://1000bullgenomes.com). These initiatives pave the way for a systematic incorporation of the findings of systems biology and systems genetics and for making them available online.
Another very important revolution in animal breeding is genomic selection (GS), a form of marker-assisted selection in which the whole genome (SNPs) is used in combination with the pedigree to predict breeding values of animals in a certain population (reviewed in ). GS or genomic prediction in general consists of two stages. First, the effects of SNPs are estimated using a training population for which both phenotypic and genomic data are available. Second, the effects of known SNPs are used to predict breeding values for a population for which only genomic data are available. The genomic best linear unbiased prediction (GBLUP) method uses a genomic relationship matrix (GRM) that describes the relationship between genotyped individuals using whole-genome genotyping data and after standardizing the matrix with respect to allele frequencies, it behaves like the numerator relationship matrix A in a regular BLUP. GBLUP provides genomic estimated breeding values (GEBV) of animals. This GBLUP version has evolved into single-step methods (ssBLUP) also known as the HBLUP method, as for example in the studies of Legarra et al. , Christensen et al. , Meuwissen et al. , and Koivula et al.  that calculate GEBV by using both the GRM and regular A matrix for both genotyped and non-genotyped animals. There are several GS methods, but the most widely used are those based on GBLUP and ssBLUP due to the simplicity of their assumptions and ease of computation. Bayesian methods that assume different distributional properties of SNP effects and a finite proportion of SNPs with non-zero effects are popular to identify QTL/candidate genes in a mixed model set up. GS has huge advantages for livestock genomics, since it significantly reduces the generation interval and thereby increases the response to selection. Furthermore, traits that are difficult or even impossible to measure (e.g. milk production on bulls or carcass traits on live animals) can now be genetically predicted and used to improve breeding strategies. GS is currently extensively used, for example in the pig  and cattle industries . Recently, the expected potential of sequence-assisted selection was shown to be unrealistic, since the increase in accuracy reached with sequence-based genotyping was small compared to high-density genotyping . However, increased awareness about including quantitative trait nucleotide (QTN) or loci (QTL) has raised interest, because current GS methods still ignore the functional or biological relevance of genes and their associations with QTN and QTL, and only use genomic data to build genomic relationship matrices (GRM) between genotyped and non-genotyped relatives. Attempts have been made to evaluate the relative contribution of groups of SNPs to the total genetic variance of traits, such as, intronic, exonic, intergenic, synonymous or non-synonymous variant classes, and overall, it seems that there are no significant differences between different groups of SNPs, as shown by Do et al. . Bayesian methods such as Bayes Cpi and Bayes R tend to provide more accurate predictions than GBLUP-based methods when a trait is clearly affected by major QTL. The concept of the systems genomic BLUP (sgBLUP) method  is based on a mixture model for whole-genome-based prediction and selection. sgBLUP uses two types of SNPs, i.e. SNPs that are functionally annotated to be relevant for the specific trait in question (e.g. by GWAS and post-GWAS SNP annotation software) and SNPs called “residual SNPs” that are commonly used to build the GRM and can be used across all traits.
Emerging technologies in genomics and epigenomics
Technology in the genomic and epigenomic fields is developing fast and provides opportunities for new ways of investigating the genome or epigenome and further implementation in animal breeding methods. With the growing power and speed of NGS, genome-wide genetic variation is now captured at the DNA sequence level at tens of millions of genomic locations. Epigenetic variation also contributes to phenotypic variation through histone modifications and DNA methylation at the gene level, which can lead to changes in or absence of the expression of genes that underlie a phenotype or a disease. The technical features of NGS are rapidly evolving, i.e. for example, restriction-site-associated DNA sequencing (RAD-Seq), which was recently applied in chickens and showed the effectiveness of NGS for animal breeding issues . Such data generated by these newest technologies will contribute to the identification of (novel) causal or regulatory variants, precise genome-wide LD patterns, and insertion-deletion (InDel) markers that will be useful in molecular-based animal breeding programs. Genotype-by-sequencing (GBS) is another novel NGS technique that was developed for plant breeding, but has potential for animal breeding due to its cost-effectiveness [40, 41]. However, its potential to reach reasonable prediction accuracy and minimal bias depends on the sequencing depth and the number of individuals that are sequenced with GBS . Similarly, CNV analysis using NGS has a significant impact on the study of phenotypic variation. A paradigm shift has occurred in such CNV studies with the initial global characterization being extended to in-depth studies leading to an integrated map. Although such enrichment analyses rely on good simulation and bioinformatics analyses, the results will produce a plethora of animal data that will contribute to the study of human diseases . To ensure appropriate gene expression in animals, it is necessary to determine the frequency with which semi-methylated CpG islands (CGI) or sites exist in various animal tissues, which is now possible with genome-wide DNA methylation analyses. By combining different technical approaches used to produce NGS data, it is possible to analyse genome-wide methylation patterns and profiles that match at the single-nucleotide resolution . One such method is reduced representation bisulfite sequencing (RRBS) , which is cost-effective since it allows to sequence only about 1 % of the genome by combining restriction enzymes and bisulfite sequencing. Chromatin immunoprecipitation (ChIP) coupled with high-throughput sequencing is used to integrate and efficiently identify protein-DNA binding sites in vivo. Combining ChIP-Seq data with expression data will allow us to unravel genome-wide patterns and to capture regulons and regulatory networks. Furthermore, recently it has become possible to perform ChIP-Seq with small amounts of fixed animal tissues instead of cultured cells, which will allow systematic analyses of protein–protein interactions (PPI) networks . In addition, dual luminescence-based co-immunoprecipitation (DULIP) makes it possible to detect PPI with high specificity and sensitivity. The co-immunoprecipitated luciferase tags are obtained either from ChIP-Seq methods or from other assays such as DULIP and further used to support and comprehend protein function and complex biological processes . With such advancements, systems-wide analyses that integrate mutation-dependent binding patterns (protein or DNA) will become feasible. It will be interesting to see how these methods improve the identification of candidate genes, drug targets and biomarkers as well as the capture of the complete genetic and epigenetic variation to accurately predict phenotypes. Currently, these high-throughput technologies offer many opportunities to better understand the complex quantitative traits and underlying (systems) biology. The remaining challenge is to overcome the difficulties in the discovery of causal genes and variants, drug targets, vaccines and biomarkers for highly complex diseases and traits of agricultural interest.
Transcriptomic research investigates the expression levels of all gene transcripts in a particular cell, at a particular time, and in a particular state. Up-regulation and down-regulation of genes result in different levels of proteins and metabolites that induce phenotypic changes in the animal. Thus, a better understanding of the regulation of genes should provide insight into the biological functioning and detection of genes that are important in diseases or production traits. The most common approach to analyze expression data is to compare expression levels between two states, e.g. healthy versus diseased or high-productive versus low-productive animals, also called differential expression analysis . Several studies have focussed on the detection of differentially expressed (DE) genes for various production and health traits in different species, for example adiposity in broilers , muscle development in cattle , skeletal muscle development in pigs  and intestinal parasite resistance in sheep . Experiments on animals are done under controlled conditions and allow their dissection for the collection of various tissue samples, such as brain tissues, which is rarely possible in human studies. For example, Band et al.  reported a bovine genomics project that studied expression data from spleen, placenta and brain tissue for a large group of animals. Expression studies on different tissues may lead to a better understanding of the pathophysiology of health and production traits in livestock.
Besides DE analyses, transcriptomic studies analyze gene–gene interactions by using a network approach that focuses on the detection of clusters of co-regulated genes. A popular method is the weighted gene co-expression network analysis (WGCNA)  which is implemented in an R-package. It detects the co-expression of genes using Pearson’s correlation and calculates the topological overlap measure (TOM) that represents the number of shared neighbouring genes across gene pairs. Based on this measure, genes are clustered and the clusters are further linked to phenotypic data to reveal the important pathways that are involved in the biological background of the trait under study. Several livestock transcriptomic studies have used this approach to elucidate the genetic and biological background of health and production traits. In sheep, numerous pathways were detected in relation to muscling  and intestinal parasite resistance . In pigs, pathways and genes that affect muscle and meat quality were identified  and in Hanwoo (Korean) cattle, genes related to intramuscular fat (marbling) were detected .
Emerging technologies in transcriptomics
Following the trend in human research, transcriptomic studies that are carried out on livestock species are making a shift from microarray expression data to RNA-sequencing (RNA-Seq) data and providing new opportunities to detect novel transcripts and genetic variants. With RNA-Seq, we now have the possibility of identifying and quantifying: isoforms, exon-specific expression, allele-specific expression and haplotype-specific expression. A comparison of RNA-Seq data with microarray data and its advantages are discussed in detail by Malone and Oliver . Nookaew et al.  performed such a comparison using real expression data in Saccharomyces cerevisiae from both types of platforms and showed that findings based on microarray and RNA-Seq technologies were consistent. As in microarray studies, the most commonly applied method is the detection of DE genes, as was done in cattle [59–61], horses  and pigs [63–65]. Results of DE analyses can be used in a systems biology approach, as reported by Lee et al.  who integrated DE results across tissues. Several other studies have taken a step forward by using a gene co-expression network (GCN) approach using RNA-Seq. In Nellore cattle, a first study detected eight DE genes for feed efficiency , while a second study applied WGCNA to unravel the genetic architecture of feed efficiency  and showed that co-expressed (CE) genes were mainly related to insulin responses and lipid metabolism. These findings combined with data from histopathological analyses of the liver revealed that low feed efficient animals had a larger number of liver lesions than high feed efficient animals. A similar project is being conducted using different cattle breeds (Holstein and Jersey cows) with extreme feed efficiency phenotypes . A gene co-expression network (GCN) approach was applied to pigs to detect co-expressed clusters of genes related to, for example, backfat androstenone phenotype , Salmonella shedding  and obesity-related genes .
Animal reproduction through assisted reproductive technologies, such as in vitro production of embryos (IVEP) combined with genomic selection, can result in rapid genetic improvement . Transcriptomic and systems biology investigations on oocytes and embryo traits related to IVP, embryo transfer and subsequent pregnancy rates have detected biomarkers for successful IVP, embryo transfer and pregnancy rates [74–76].
In pig production, one of the challenges is to reduce boar taint i.e. an offensive taste or odour of the pork emitted during cooking, which makes it unpleasant for consumers. It is caused primarily by two compounds i.e. androstenone produced in the testicles and skatole produced in the hindgut. Both compounds accumulate mainly in the back fat of intact males and the only way to reduce boar taint is by surgical castration. However, surgical castration raises serious pig welfare issues that contributed to a voluntary ban on surgical castration in Europe to avoid boar taint (http://boars2018.com/). Heritabilities of androstenone and skatole levels are moderate to high , which indicates that breeding for low boar taint in males has good potential and may resolve welfare issues in the long-run, since intact males could then be used in the food chain. GWAS has detected several genomic regions associated with androstenone and skatole levels [77–79]. It was shown that boar taint is not significantly correlated with growth traits and litter size [80, 81] and is even favourably correlated with male fertility . Recent studies on RNA-Seq transcriptome profiling of pigs with high or low boar taint showed that there are key differences in the expression profiles of some genes [70, 82]. Following this, our studies now focus on RNA-Seq transcriptomics and systems biology of boar taint in Danish pigs, to identify DE and CE genes and build GCN to improve pig meat production in Denmark . To gain more insight into the regulatory architecture of a particular trait or disease, several approaches can be combined or integrated to elucidate gene–gene interactions.
Another relatively novel and promising method is single-cell transcriptome analysis, giving a deep-sequencing insight into the cell’s gene transcription . This deeper insight can lead to a better understanding of the link between genome and phenome studies, and has potential mainly in cell development research (e.g. stem cell research) and in situations where the collection of biopsy samples in sufficient numbers for RNA-Seq transcriptomics is difficult (e.g. in the case of embryo biopsies before embryo transfer to donor cattle). Although a wide range of candidate genes and transcripts using RNA-Seq analyses can be exploited, the major challenge is to identify true positives. Methods based on mapping and genome assemblies might miss some candidate genes, if appropriate filtering techniques are not used. Another challenge is to quantify G/C blocks, paralogons, isochores, 5′UTR regions, expression specific to splice variants, exon-specific and allele-specific expression. Transcriptomics studies based on RNA-Seq enable the study of non-coding RNAs but how such studies on non-coding RNAs can be used for systems genomics approaches remains to be explored. In the last few years, there have been increasing efforts to sequence small RNAs including miRNAs. Since the initial application of RNA-Seq, quantifying small RNAs, ascertaining alternate splicing events, transcription start sites (TSS) and mapping strand-specific genes  have benefited from such techniques. Many studies are underway to provide rigorous strategies for miRNA-Seq and other small RNA measurements, but this is beyond the scope of this review.
Animal metabolomics and proteomics
Proteomics aims at describing the complete repertoire of proteins in an organism , while metabolomics (or metabonomics) is the study of global metabolite profiles in living systems . Although the use of both terms metabolomics and metabonomics is still debated, analysis of the metabolome is a challenging task since it considers all the metabolites, regardless of their chemical nature, i.e. amino acids, antibodies, aptamers, small biomolecules, etc. and provides coherent gene expression data in an integrated manner. Metabolomics serves not only as a source of qualitative but also quantitative data on intracellular metabolites that are essential for the model-based description of the metabolic network operating under in vivo conditions. In recent years, several studies in livestock have investigated the metabolome, and metabolite profiling studies are now a rapidly expanding area in animal and veterinary genomics. Metabolomics tools aim at filling the gap between genotype and phenotype by permitting the simultaneous monitoring of molecules in a living system. Such metabolic information has applications in clinical practice, in the discovery of biomarkers that are linked to cellular integrity, cell and tissue homeostasis resulting from cell damage or death , and in metabolic engineering to optimize microorganisms for biotechnology.
In dairy cattle, numerous potential biomarkers were detected for milk production and quality by studying the metabolome of different body fluids . Likewise, in chickens, several potential biomarkers were identified for the ascites syndrome by investigating the liver metabolome . Another potential of metabolomics is the prediction of phenotypes that are of economic interest, as was reported for pigs .
Emerging technologies in proteomics and metabolomics
Proteomic and metabolomic datasets provide vast amounts of multi-dimensional data points that need to be carefully quality-controlled, analysed and interpreted. As in genomics and transcriptomics, there are a wide variety of publicly available databases and tools for storing, querying, browsing, analysing and visualizing metabolomic networks. For example, the PathCase Metabolomics Analysis Workbench (PathCaseMAW: http://nashua.case.edu/PathwaysMAW/Web/) runs on a manually created generic mammalian metabolic network. The mapping of protein–protein interactions (PPI) networks to phenotype and disease pathways is a key to understanding various biological and patho-physiological processes. Such interaction studies can be combined with studies on the conservation of non-coding RNAs across large evolutionary distances and on their potential functions in mammalian genomes i.e. . Overall, ecosystems that are revealed by such association networks, assemblies and interaction studies are challenged by various environmental conditions. In this process, a new field has emerged termed “synecology” that deals with the interactions of groups of organisms with their abiotic and biotic environments and is driven by the advances of the meta-omics methods using bioinformatics-centred approaches . This could lead to a new “omics” term i.e. synecomics defined as molecular systems synecology, which will contribute to understand not only the mammalian or animal dynamics, but also the microbial processes that rely on systems-level responses.
Recently, many structural proteomics initiatives were launched to ascertain biochemical and cellular functions and have allowed the design of drugs at the molecular level . The methods used include advances in hardware design, data acquisition methods, sample preparation and further automation of data analysis. With 40 to 50 % of the identified genes corresponding to proteins of unknown function, a functional annotation screening technology using nuclear magnetic resonance (NMR) (FAST-NMR) was developed to assign a biological function. These methods assume that a biological function can be described based on the similarities between binding regions among proteins and that a given ligand interacts with a targeted sequence. The resulting structural and functional assignment to a protein can provide a starting point for the discovery of drugs as well as functional clues for regions that are regulatory or non-regulatory . Nevertheless, functional proteomics/metabolomics has evolved as the necessary next step for which NMR spectroscopy is used to study the functions of a large repertoire of sequences that cannot be inferred based on the current methods for the detection of sequence homologies alone. Moreover, three-dimensional structures of proteins/metabolites contribute greatly to inferring molecular functions (physical and chemical function). We can foresee that systems genomics in the future will embrace large-scale proteomics/metabolomics as an additional layer to provide connections with phenotypic variations.
Functional annotation and pathway analyses
In the preceding sections, we discuss individual ‘omics’ platforms and datasets with regards to their current status and emerging trends. Regardless of which ‘omics’ platforms are used, the Gene Ontology (GO) annotation is the most important and valuable means of assigning functional information using standardized vocabulary. Several computational methods and tools are available for functional annotation across all species. Gene-based annotation can identify whether SNPs or CNV cause protein coding changes. For this purpose, gene definition systems such as RefSeq genes, UCSC genes, ENSEMBL genes, GENCODE genes are used. Genomic region-based annotation identifies variants in specific genomic regions, for example, conserved regions, NGS-based DE/CE regions, transcription factor binding sites, GWAS regions, etc. Although similarity-based GO annotation is widely applied, it primarily encompasses sequence data with reciprocal best hits to predict candidates from a huge repertoire of multi-omics data. However, some of the orthologues of these sequences do not remain associated to GO terms and can be cross-validated with conserved domains, manually reviewed data or determined by wet lab experiments, thus allowing the biological appropriateness of the functional assignments. The unannotated regions in the form of hypothetical proteins or “known unknowns” i.e. their existence is predicted but their function is not known, represent a huge problem, since they remain assigned to the three root terms as in the case of AMIGO (http://amigo.geneontology.org/amigo). A few methods have been designed to integrate different structural and functional results with data corresponding to GO relationships of organisms . In addition, the genome assemblies of many species are regularly refined and updated when new information is available. There has been an increase in the development of integrated analyses that provide comprehensive and robust GO annotations of genome assemblies, providing a solid foundation for functional interrogation of other genomes (http://www.ebi.ac.uk/GOA). Development of pathway maps and identification of unique and novel signals have transformed pathway association studies in cattle . Furthermore, Medical Subject Headings (MeSH, http://www.nlm.nih.gov/mesh) provides a comprehensive life science vocabulary for human and model organisms’ research. Multi-faceted ‘omics’ is aided by the choice of annotation and enrichment analyses for interpreting GO-aided MeSH functional terms. In summary, such GO annotations correspond to specific biological conditions or complex traits in specific species.
Pathway analysis can be described as “a group of statistical methods that exploit a priori knowledge of pathways” . It forms the link between ‘omics’ results and the phenotype/disease under study and provides a biological meaning to the genes and variants detected (interpretation of results). Furthermore, it reduces the multiple-testing burden and, thus, offers a huge analysis potential. Aslibekyan et al.  showed that in spite of the great potential of pathway analysis, there are still many obstacles to overcome (for example, due to the lack of a golden analysis standard).
Emerging technologies in pathway profiling and genetic networks
In spite of the multidimensional HTO efforts to understand phenotypic variation, there remains a major scientific bottleneck regarding the inference of contextual pathways that underlie the translation from variation in biological systems to phenotypic variation. Recently, some integration models that address the characterization of the interaction between functional modules have been reported . For example, the PAthway Network Analysis approach (PANA) integrates high-throughput data and their functional annotation using machine-learning methods . The end-user can detect the functional modules that are associated within the molecular system and the transcriptional connections in a disease or a phenotype. Molecular systems biology integrates networks in the form of pathways, interactions and/or associations. Associations are inferred only as links within the relationships, whereas physical relationships in the form of pull-down assays or biochemical experiments are inferred as interactions. Nevertheless, the paradigm that all interactions are associations but not all associations are interactions can be widely applied across all functional modules. As discussed previously, keeping in view the various biochemical pathways and reactions, the models aim at analysing and measuring the quantity of molecules that are present within a cell . However, the models used depend on the type of assay, on how epigenetic modifications are deduced from the transcriptomic data, on whether or not the disease risk is considered, and on the type of genetic heterogeneity investigated in relation to the phenotypic trait [97, 98].
Many studies have shown the importance of genetic interactions, especially in the determination of complex polygenic traits [99–101], which support the great potential for network genetics. In general, a GWAS investigates the genome of different individuals to detect variants that are associated with a trait, but it does not take the interactions between loci into account. However, studies on genome-wide interactions among SNPs for specific traits are being carried out, see for example, a paper on carcass-related traits in Brahman cattle . One further step is to include epistatic interactions in a network approach, for example using the weighted interaction SNP hub (WISH) network method , which has been successfully applied to a pig resource population to detect genes and pathways related to human obesity . This method pre-selects SNPs based on their genome-wide significance by setting a much lower significance level than in standard GWAS, and subsequent calculation of the epistatic interaction effects between all SNPs is used in a clustering approach. Another promising method is the association weight matrix (AWM) approach, which combines data from several GWAS by looking at interactions between SNPs based on the sizes of their estimated additive effect . This method has successfully identified genes and pathways for growth in cattle  and puberty in tropical cattle breeds . The latest reports show how lncRNAs contribute to regulatory interactions with their non-coding peers such as miRNAs . Whether lncRNA-protein networks restrain interactions is not clearly known. How such regulatory interactions between classes of lncRNAs and proteins can have a significant influence on an organism is a focus of interest. Recently, our group reported the detection of one such lncRNA-protein association that was consistent with interaction networks built from RNA-Seq data . These studies will allow us to understand how such association networks contribute to transcriptional regulation in various organelles. In addition, applying network methods and pathway analyses to genes that are related to a wide range of diseases and phenotypes will allow researchers to gain deeper insight into pathophysiological and biological processes.
Multi-omic data from genome to phenome: integration in systems genomics
The term ‘systems genetics’ or ‘systems genomics’ in an animal breeding context was originally proposed by Kadarmideen et al. , but there are recent reviews on this topic with applications in humans  as well as in animals . As thoroughly discussed in these articles, systems genetics/genomics focuses on the integration of different ‘omics’ levels. This includes a wide range of approaches, from relating the individual’s ‘omics’ levels with functional annotation, both on a single gene level and pathway analysis level, to integrating all different multi-omic levels to phenotypes. A typical data integration process goes from genome → epigenome → transcriptome → metabolome → proteome → phenotype or disease variome.
The integration of data related to protein abundance/mRNA expression using regulatory networks has been investigated with respect to gene expression involved in bovine puberty . Integration of genomic and transcriptomic data, for example using the expression quantitative trait loci (eQTL) approach, detects regions in the genome that are associated with transcript levels . These eQTL can be cis- or trans-acting: a cis-acting eQTL is located near the gene that encodes the transcript, while a trans-acting eQTL is located at quite a distance or even on another chromosome. Several studies in pig have incorporated this approach to detect candidate genes, for example for muscle characteristics [112–114] and obesity phenotypes . A recent multi-parental population study on heterogeneous stocks (HS) in rats, mice and humans identified targeted candidate genes and mapped them to disease phenotypes . In this work, the authors applied differential expression analysis followed by eQTL analysis, and then using a mixed-model analysis based on the sequence of the founder animal, they identified variants within the detected region. Although such studies can detect variants within disease-causing genes, whether or not these causal genes alone may play a role in these complex phenotypes remains a challenge. Questions remain such as: (1) how important these eQTL are for the study of genetic networks that underlie phenotypic variation? (2) Can these eQTL data generated from transcriptomics analyses be linked to the proteomics level? Although transcriptomic and phenomic data may appear uncorrelated, mapping genetic determinants of gene expression (eQTL) can provide a remarkable framework for understanding large phenotypic effects and linking genetic variants to disease .
Clustered regularly interspaced short palindromic repeats (abbreviated as CRISPR) in combination with Cas9 protein (CRISPR/Cas) systems guide RNAs into a cell’s genome (the nuclease) and cut the genome at desired locations (this technique is often referred to as genome-editing). Since it was first introduced by Cong et al. , many site-directed mutagenesis experiments have been carried out across various tissues, large animal models and populations [119, 120]. This site-directed genome editing technology has extensively improved the precision at which genome modifications can be obtained compared to earlier transcription activator-like effector nucleases and zinc-finger nucleases systems. To date, it has not been used extensively in animal and veterinary sciences, but it is clearly foreseen that animal genome modification using CRISPR/Cas systems will play a key role in improving disease resistance or trait performances in animals or to create “designer animals” such as transgenic animals. One other application is that once a causal gene/QTL is validated, it can be specifically edited by applying the CRISPR-Cas system. Such modifications of the genetic architecture of an organism are only just beginning and have not been adequately studied in animals. A recent paper on human albumin produced in pigs through CRISPR/Cas9-mediated knockin of human cDNA into the swine albumin locus at the zygotic level illustrates the potential of this technology for animal research . The use of RNA-Seq transcriptomic studies to infer CRISPR-mediated systems should play a crucial role in advancing this technology even further.
Animal systems genomics is still far from fully exploiting the power of deep NGS. For instance, RNAseq can provide not only accurate measures of gene expression levels but also data on isoforms, exon-specific expression, allele-specific expression and haplotype-specific expression. The benefits of RNA-Seq in systems genomics studies are yet to be fully exploited, for instance, in eQTL studies. Wang et al.  reported the use of the Lyon hypertensive (LH) rat bred for high blood pressure in eQTL mapping by taking advantage of RNA-Seq data. To what extent these additional benefits of RNA-Seq can be integrated in eQTL studies are still unknown but we can foresee the emergence of eQTL studies that will use this information in integrative systems genomics studies.
During the last decade, strategies to detect metabolic QTL have emerged and are based on the characterization of metabolites and small molecules using large-scale analytical methods. Such methods allow researchers to better understand the biochemical pathways that span a metabolic network. Whether or not the environment has an impact on the metabolism can be analyzed by using a phenotypic state of the metabolism called a “metabotype”. This metabolomic/metabotype quantitative trait locus (mQTL) mapping and metabolomic genome-wide association studies (mGWAS) have been widely applied to derive information from genetic polymorphism studies, see for example, a human study of cardiovascular diseases . However, a comprehensive framework to understand this multi-omic convergence of high-resolution metabolomics is lacking such as that developed in the R-package mQTL.NMR for an integrative analytical framework for genomic and metabolomic profiles to characterise mixed systems . The principle of mQTL or mGWAS can be also applied to a list of known metabolites (e.g. low-throughput metabolite profiles consisting of up to 200 compounds) that are assumed to affect a given disease phenotype in animals. For instance, Pant et al.  identified several QTL that influence a large range of metabolites affecting obesity and obesity-related phenotypes via combined linkage disequilibrium linkage analysis (LDLA) in F2 crossbred pigs, and subsequently, investigated the human chromosomal regions that were syntenic to these identified mQTL. Further discussion on this work is beyond the scope of this review, but clearly quality control and analyses of large-scale metabolic/metabolomic phenotype data represent a big challenge for animal genetic studies [126, 127]. Another novel area of research is the mapping of genetic variants that affect protein abundance (pQTL), which to date has been successfully applied on an F2 mouse population  and for the analysis of cellular responses to chemotherapy . Such developments provide great opportunities to identify biomarkers for animal disease and production traits. Regardless of what type of QTL or SNP is detected for animal traits (eQTL, mQTL, pQTL), they can be incorporated into models that aim at understanding/detecting causal and regulatory loci in the genome, as discussed in  and .
In this review, we discuss the current and emerging technologies within the fields of genomics/epigenomics, transcriptomics, metabolomics and proteomics, and provide some examples from livestock species. We re-introduce systems genetics/genomics in a “sequence-space” and multi-omic context with a focus on animal and veterinary biosciences. Due to the enormous progress in (e.g. sequencing) technologies, data generation is becoming cheaper and easier, resulting in huge amounts of data at different ‘omics’ levels. In the last few decades, data were generated to elucidate the biological mechanisms that underlie animal production, health and welfare traits. This has led to great insight into mechanisms and detection of (potential) biomarkers and vaccines, and improved animal breeding strategies. We briefly mention different existing and emerging ‘omic’ technologies and their implementation in livestock species (including genomics, epigenomics, transcriptomics, metabolomics and proteomics). The challenge that remains is to use all these ‘omics’-level data sets efficiently by removing errors/noise via good quality control methods for each layer of dataset, appropriate data integration as per the defined systems genomics hypothesis and statistical models, application of advanced statistical-bioinformatic algorithms and meaningful interpretation of results. The clear advantage of these integrative methods is to increase the power of detecting true causal genes, regulatory networks and pathways leading to improved animal health, welfare and/or production. Outcomes such as causal genes or variants (QTN or QTL), regulator genes, biomarkers and gene networks should be incorporated into genomic selection and breeding programs for larger impact. These prospects are becoming more feasible, as genomic selection methods tend more and more to include various types of QTL information in genomic prediction models. Through such extended, biologically and functionally meaningful and accurate genomic selection methods, improvement of animal production, health and welfare will be even faster and more sustainable.
The article is based on the invited lecture given by HNK at ISAFG2015. HNK conceived the overall framework of this review article and assisted in the writing of the initial draft. PS and LK wrote the initial draft. HNK made significant contributions with the figures. All three authors mutually shared discussions and worked equally. All authors read and approved the final manuscript.
PS was funded by a Grant from the EU-FP7 Marie Curie Actions Grant (CIG-293511) and LJAK was funded by a Grant from the Danish Innovation Fund for the BioChild Project (Grant Number 0603-00457B and Project website: www.biochild.ku.dk). HNK, as a project leader and Grant holder, thanks EU-FP7 Marie Curie Actions Grant and Danish Innovation Fund. This paper is part of the collection ‘ISAFG2015’ (6th International Symposium on Animal Functional Genomics, 27–29 July 2015, Piacenza, Italy). The publication of the papers in this collection was partly sponsored by OECD Co-operative Research Programme: Biological Resource Management for Sustainable Agricultural Systems (CRP). HNK’s participation in ISAFG2015 was financed by OECD Co-operative Research Programme. The opinions expressed and arguments employed in this paper are the sole responsibility of the authors and do not necessarily reflect those of the OECD or of the governments of its Member countries.
The authors declare that they have no competing interests.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- van der Sijde MR, Ng A, Fu J. Systems genetics: from GWAS to disease pathways. Biochim Biophys Acta. 2014;1842:1903–9.PubMedView ArticleGoogle Scholar
- Morrison N, Cochrane G, Faruque N, Tatusova T, Tateno Y, Hancock D, et al. Concept of sample in OMICS technology. OMICS. 2006;10:127–37.PubMedView ArticleGoogle Scholar
- Kadarmideen HN, von Rohr P, Janss LL. From genetical genomics to systems genetics: potential applications in quantitative genomics and animal breeding. Mamm Genome. 2006;17:548–64.PubMedPubMed CentralView ArticleGoogle Scholar
- Kadarmideen HN. Genomics to systems biology in animal and veterinary sciences: progress, lessons and opportunities. Livest Sci. 2014;166:232–48.View ArticleGoogle Scholar
- Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, et al. The Ensembl genome database project. Nucleic Acids Res. 2002;30:38–41.PubMedPubMed CentralView ArticleGoogle Scholar
- Bernstein BE, Meissner A, Lander ES. The mammalian epigenome. Cell. 2007;128:669–81.PubMedView ArticleGoogle Scholar
- Wang Z, Gerstein M, Snyder M. RNA-seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 2009;10:57–63.PubMedPubMed CentralView ArticleGoogle Scholar
- Haoudi A, Bensmail H. Bioinformatics and data mining in proteomics. Expert Rev Proteomics. 2006;3:333–43.PubMedView ArticleGoogle Scholar
- Dunn WB, Bailey NJC, Johnson HE. Measuring the metabolome: current analytical technologies. Analyst. 2005;130:606–25.PubMedView ArticleGoogle Scholar
- Wishart DS, Knox C, Guo AC, Eisner R, Young N, Gautam B, et al. HMDB: a knowledgebase for the human metabolome. Nucleic Acids Res. 2009;37:D603–10.PubMedPubMed CentralView ArticleGoogle Scholar
- Rochfort S. Metabolomics reviewed: a new “omics” platform technology for systems biology and implications for natural products research. J Nat Prod. 2005;68:1813–20.PubMedView ArticleGoogle Scholar
- Mueller UG, Sachs JL. Engineering microbiomes to improve plant and animal health. Trends Microbiol. 2015;23:606–17.PubMedView ArticleGoogle Scholar
- Thomas T, Gilbert J, Meyer F. Metagenomics—a guide from sampling to data analysis. Microb Inform Exp. 2012;2:3.PubMedPubMed CentralView ArticleGoogle Scholar
- Freimer N, Sabatti C. The human phenome project. Nat Genet. 2003;34:15–21.PubMedView ArticleGoogle Scholar
- Paigen K, Eppig JT. A mouse phenome project. Mamm Genome. 2000;11:715–7.PubMedView ArticleGoogle Scholar
- Greenbaum D, Luscombe NM, Jansen R, Qian J, Gerstein M. Interrelating different types of genomic data, from proteome to secretome: ‘oming in on function. Genome Res. 2001;11:1463–8.PubMedView ArticleGoogle Scholar
- Stenuit B, Agathos SN. Deciphering microbial community robustness through synthetic ecology and molecular systems synecology. Curr Opin Biotechnol. 2015;33:305–17.PubMedView ArticleGoogle Scholar
- Li H. Systems genetics in “-omics” era: current and future development. Theory Biosci. 2013;132:1–16.PubMedView ArticleGoogle Scholar
- Woelders H, Te Pas MF, Bannink A, Veerkamp RF, Smits MA. Systems biology in animal sciences. Animal. 2011;5:1036–47.PubMedView ArticleGoogle Scholar
- Zhu M, Yu M, Zhao S. Understanding quantitative genetics in the systems biology era. Int J Biol Sci. 2009;5:161–70.PubMedPubMed CentralView ArticleGoogle Scholar
- Cole JB, Lewis RM, Maltecca C, Newman S, Olson KM, Tait RG Jr. Systems biology in animal breeding: identifying relationships among markers, genes, and phenotypes. J Anim Sci. 2013;91:521–2.PubMedView ArticleGoogle Scholar
- McLaren W, Pritchard B, Rios D, Chen Y, Flicek P, Cunningham F. Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor. Bioinformatics. 2010;26:2069–70.PubMedPubMed CentralView ArticleGoogle Scholar
- Toyoda T, Wada A. Omic space: coordinate-based integration and analysis of genomic phenomic interactions. Bioinformatics. 2004;20:1759–65.PubMedView ArticleGoogle Scholar
- Sharma A, Lee JS, Dang CG, Sudrajad P, Kim HC, Yeon SH, et al. Stories and challenges of genome-wide association studies in livestock—a review. Asian-Australas J Anim Sci. 2015;28:1371–9.PubMedPubMed CentralView ArticleGoogle Scholar
- Hawken RJ, Zhang YD, Fortes MRS, Collis E, Barris WC, Corbet NJ, et al. Genome-wide association studies of female reproduction in tropically adapted beef cattle. J Anim Sci. 2012;90:1398–410.PubMedView ArticleGoogle Scholar
- Do DN, Strathe AB, Ostersen T, Jensen J, Mark T, Kadarmideen HN. Genome-wide association study reveals genetic architecture of eating behavior in pigs and its implications for humans obesity by comparative mapping. PLoS One. 2013;8:e71509.PubMedPubMed CentralView ArticleGoogle Scholar
- Do DN, Strathe AB, Ostersen T, Pant SD, Kadarmideen HN. Genome-wide association and pathway analysis of feed efficiency in pigs reveal candidate genes and pathways for residual feed intake. Front Genet. 2014;5:307.PubMedPubMed CentralView ArticleGoogle Scholar
- Wang H, Misztal I, Aguilar I, Legarra A, Fernando RL, Vitezica Z, et al. Genome-wide association mapping including phenotypes from relatives without genotypes in a single-step (ssGWAS) for 6-week body weight in broiler chickens. Front Genet. 2014;5:134.PubMedPubMed CentralGoogle Scholar
- Kogelman LJA, Pant SD, Fredholm M, Kadarmideen HN. Systems genetics of obesity in an F2 pig model by genome-wide association, genetic network and pathway analyses. Front Genet. 2014;5:214.PubMedPubMed CentralView ArticleGoogle Scholar
- Korte A, Farlow A. The advantages and limitations of trait analysis with GWAS: a review. Plant Methods. 2013;9:29.PubMedPubMed CentralView ArticleGoogle Scholar
- Meuwissen T, Hayes B, Goddard M. Accelerating improvement of livestock with genomic selection. Annu Rev Anim Biosci. 2013;1:221–37.PubMedView ArticleGoogle Scholar
- Legarra A, Aguilar I, Misztal I. A relationship matrix including full pedigree and genomic information. J Dairy Sci. 2009;92:4656–63.PubMedView ArticleGoogle Scholar
- Christensen OF, Madsen P, Nielsen B, Ostersen T, Su G. Single-step methods for genomic evaluation in pigs. Animal. 2012;6:1565–71.PubMedView ArticleGoogle Scholar
- Koivula M, Stranden I, Su G, Mantysaari EA. Different methods to calculate genomic predictions–comparisons of BLUP at the single nucleotide polymorphism level (SNP-BLUP), BLUP at the individual level (G-BLUP), and the one-step approach (H-BLUP). J Dairy Sci. 2012;95:4065–73.PubMedView ArticleGoogle Scholar
- Tribout T, Larzul C, Phocas F. Efficiency of genomic selection in a purebred pig male line. J Anim Sci. 2012;90:4164–76.PubMedView ArticleGoogle Scholar
- Hayes BJ, Bowman PJ, Chamberlain AJ, Goddard ME. Invited review: genomic selection in dairy cattle: progress and challenges. J Dairy Sci. 2009;92:433–43.PubMedView ArticleGoogle Scholar
- Perez-Enciso M, Rincon J, Legarra A. Sequence-vs. chip-assisted genomic selection: accurate biological information is advised. Genet Sel Evol. 2015;47:43.PubMedPubMed CentralView ArticleGoogle Scholar
- Do DN, Janss LL, Jensen J, Kadarmideen HN. SNP annotation-based whole genomic prediction and selection: an application to feed efficiency and its component traits in pigs. J Anim Sci. 2015;93:2056–63.PubMedView ArticleGoogle Scholar
- Zhai Z, Zhao W, He C, Yang K, Tang L, Liu S, et al. SNP discovery and genotyping using restriction-site-associated DNA sequencing in chickens. Anim Genet. 2015;46:216–9.PubMedView ArticleGoogle Scholar
- De Donato M, Peters SO, Mitchell SE, Hussain T, Imumorin IG. Genotyping-by-sequencing (GBS): a novel, efficient and cost-effective genotyping method for cattle using next-generation sequencing. PLoS One. 2013;8:e62137.PubMedPubMed CentralView ArticleGoogle Scholar
- Gorjanc G, Cleveland M, Houston R, Hickey J. Potential of genotyping-by-sequencing for genomic selection in livestock populations. Genet Sel Evol. 2015;47:12.PubMedPubMed CentralView ArticleGoogle Scholar
- Jiang J, Wang J, Wang H, Zhang Y, Kang H, Feng X, et al. Global copy number analyses by next generation sequencing provide insight into pig genome variation. BMC Genomics. 2014;15:593.PubMedPubMed CentralView ArticleGoogle Scholar
- Couldrey C, Cave V. Assessing DNA methylation levels in animals: choosing the right tool for the job. Anim Genet. 2014;45(Suppl 1):15–24.PubMedView ArticleGoogle Scholar
- Meissner A, Gnirke A, Bell GW, Ramsahoye B, Lander ES, Jaenisch R. Reduced representation bisulfite sequencing for comparative high-resolution DNA methylation analysis. Nucleic Acids Res. 2005;33:5868–77.PubMedPubMed CentralView ArticleGoogle Scholar
- Cotney JL, Noonan JP. Chromatin immunoprecipitation with fixed animal tissues and preparation for high-throughput sequencing. Cold Spring Harb Protoc. 2015;2015: pdb.prot084848.Google Scholar
- Trepte P, Buntru A, Klockmeier K, Willmore L, Arumughan A, Secker C, et al. DULIP: a dual luminescence-based co-immunoprecipitation assay for interactome mapping in mammalian cells. J Mol Biol. 2015;427:3375–88.PubMedView ArticleGoogle Scholar
- Kadarmideen HN, Reverter A. Combined genetic, genomic and transcriptomic methods in the analysis of animal traits. CAB Rev perspect Agric Veterinary Sci Nutr Nat Resour 2007;2:16. doi:https://doi.org/10.1079/PAVSNNR20072042.
- Bourneuf E, Hérault F, Chicault C, Carré W, Assaf S, Monnier A, et al. Microarray analysis of differential gene expression in the liver of lean and fat chickens. Gene. 2006;372:162–70.PubMedView ArticleGoogle Scholar
- Lehnert SA, Reverter A, Byrne KA, Wang Y, Nattrass GS, Hudson NJ, et al. Gene expression studies of developing bovine longissimus muscle from two different beef cattle breeds. BMC Dev Biol. 2007;7:95.PubMedPubMed CentralView ArticleGoogle Scholar
- Huang TH, Zhu MJ, Li XY, Zhao SH. Discovery of porcine microRNAs and profiling from skeletal muscle tissues during development. PLoS One. 2008;3:e3225.PubMedPubMed CentralView ArticleGoogle Scholar
- Kadarmideen HN, Watson-Haigh NS, Andronicos NM. Systems biology of ovine intestinal parasite resistance: disease gene modules and biomarkers. Mol BioSyst. 2011;7:235–46.PubMedView ArticleGoogle Scholar
- Band MR, Olmstead C, Everts RE, Liu ZL, Lewin HA. A 3800 gene microarray for cattle functional genomics: comparison of gene expression in spleen, placenta, and brain. Anim Biotechnol. 2002;13:163–72.PubMedView ArticleGoogle Scholar
- Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinf. 2008;9:559.View ArticleGoogle Scholar
- Kogelman LJA, Byrne K, Vuocolo T, Watson-Haigh N, Kadarmideen HN, Kijas J, et al. Genetic architecture of gene expression in ovine skeletal muscle. BMC Genomics. 2011;12:607.PubMedPubMed CentralView ArticleGoogle Scholar
- Ponsuksili S, Du Y, Hadlich F, Siengdee P, Murani E, Schwerin M, et al. Correlated mRNAs and miRNAs from co-expression and regulatory networks affect porcine muscle and finally meat properties. BMC Genomics. 2013;14:533.PubMedPubMed CentralView ArticleGoogle Scholar
- Lim D, Lee SH, Kim NK, Cho YM, Chai HH, Seong HH, et al. Gene co-expression analysis to characterize genes related to marbling trait in Hanwoo (Korean) cattle. Asian-Australas J Anim Sci. 2013;26:19–29.PubMedPubMed CentralView ArticleGoogle Scholar
- Malone JH, Oliver B. Microarrays, deep sequencing and the true measure of the transcriptome. BMC Biol. 2011;9:34.PubMedPubMed CentralView ArticleGoogle Scholar
- Nookaew I, Papini M, Pornputtapong N, Scalcinati G, Fagerberg L, Uhlen M, et al. A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae. Nucleic Acids Res. 2012;40:10084–97.PubMedPubMed CentralView ArticleGoogle Scholar
- McCabe M, Waters S, Morris D, Kenny D, Lynn D, Creevey C. RNA-seq analysis of differential gene expression in liver from lactating dairy cows divergent in negative energy balance. BMC Genomics. 2012;13:93.View ArticleGoogle Scholar
- Li Y, Carrillo JA, Ding Y, He Y, Zhao C, Liu J, et al. Transcriptomic profiling of spleen in grass-fed and grain-fed Angus cattle. PLoS One. 2015;10:e0135670.PubMedPubMed CentralView ArticleGoogle Scholar
- McLoughlin KE, Nalpas NC, Rue-Albrecht K, Browne JA, Magee DA, Killick KE, et al. RNA-seq transcriptional profiling of peripheral blood leukocytes from cattle infected with Mycobacterium bovis. Front Immunol. 2014;5:396.PubMedPubMed CentralView ArticleGoogle Scholar
- Park KD, Park J, Ko J, Kim BC, Kim HS, Ahn K, et al. Whole transcriptome analyses of six thoroughbred horses before and after exercise using RNA-Seq. BMC Genomics. 2012;13:473.PubMedPubMed CentralView ArticleGoogle Scholar
- Chen C, Ai H, Ren J, Li W, Li P, Qiao R, Ouyang J, et al. A global view of porcine transcriptome in three tissues from a full-sib pair with extreme phenotypes in growth and fat deposition by paired-end RNA sequencing. BMC Genomics. 2011;12:448.PubMedPubMed CentralView ArticleGoogle Scholar
- Zhang X, Huang L, Wu T, Feng Y, Ding Y, Ye P, et al. Transcriptomic analysis of ovaries from pigs with high and low litter size. PLoS One. 2015;10:e0139514.PubMedPubMed CentralView ArticleGoogle Scholar
- Jing L, Hou Y, Wu H, Miao Y, Li X, Cao J, et al. Transcriptome analysis of mRNA and miRNA in skeletal muscle indicates an important network for differential residual feed intake in pigs. Sci Rep. 2015;5:11953.PubMedPubMed CentralView ArticleGoogle Scholar
- Lee HJ, Park HS, Kim W, Yoon D, Seo S. Comparison of metabolic network between muscle and intramuscular adipose tissues in Hanwoo Beef cattle using a systems biology approach. Int J Genomics. 2014;2014:679437.PubMedPubMed CentralGoogle Scholar
- Tizioto PC, Coutinho LL, Decker JE, Schnabel RD, Rosa KO, Oliveira PS, et al. Global liver gene expression differences in Nelore steers with divergent residual feed intake phenotypes. BMC Genomics. 2015;16:242.PubMedPubMed CentralView ArticleGoogle Scholar
- Alexandre PA, Kogelman LJ, Santana MH, Passarelli D, Pulz LH, Fantinato-Neto P, et al. Liver transcriptomic networks reveal main biological processes associated with feed efficiency in beef cattle. BMC Genomics. 2015;16:1073.PubMedPubMed CentralView ArticleGoogle Scholar
- Salleh MSB, Höglund J, Løvendahl P, Kadarmideen HN. Systems genetics and transcriptomics of feed efficiency in Nordic Dairy cattle. In: Proceedings of the 66th Annual Meeting of European Association for Animal Production: 31 August–4 September 2015; Warsaw.Google Scholar
- Sahadevan S, Tholen E, Grosse-Brinkhaus C, Schellander K, Tesfaye D, Hofmann-Apitius M, et al. Identification of gene co-expression clusters in liver tissues from multiple porcine populations with high and low backfat androstenone phenotype. BMC Genet. 2015;16:21.PubMedPubMed CentralView ArticleGoogle Scholar
- Kommadath A, Bao H, Arantes AS, Plastow GS, Tuggle CK, Bearson SM, et al. Gene co-expression network analysis identifies porcine genes associated with variation in Salmonella shedding. BMC Genomics. 2014;15:452.PubMedPubMed CentralView ArticleGoogle Scholar
- Kogelman LJA, Cirera S, Zhernakova DV, Fredholm M, Franke L, Kadarmideen HN. Identification of co-expression gene networks, regulatory genes and pathways for obesity based on adipose tissue RNA Sequencing in a porcine model. BMC Med Genomics. 2014;7:57.PubMedPubMed CentralView ArticleGoogle Scholar
- Kadarmideen HN, Mazzoni G, Watanabe YF, Strøbech L, Baruselli PS, Meirelles F, et al. Genomic selection on in vitro produced and somatic cell nuclear transfer embryos for rapid genetic improvement in cattle production. Anim Reprod. 2015;12:389–96.Google Scholar
- Cao S, Han J, Wu J, Li Q, Liu S, Zhang W, Pei Y, et al. Specific gene-regulation networks during the pre-implantation development of the pig embryo as revealed by deep sequencing. BMC Genomics. 2014;15:4.PubMedPubMed CentralView ArticleGoogle Scholar
- Jiang Z, Sun J, Dong H, Luo O, Zheng X, Obergfell C, et al. Transcriptional profiles of bovine in vivo pre-implantation development. BMC Genomics. 2014;15:756.PubMedPubMed CentralView ArticleGoogle Scholar
- Strøbech L, Mazzoni G, Pedersen HS, Freude KK, Kadarmideen H, Callesen H, et al. In vitro production of bovine embryos—revisiting oocyte development and application of systems biology. Anim Reprod. 2015;12:465–72.Google Scholar
- Grindflek E, Meuwissen TH, Aasmundstad T, Hamland H, Hansen MH, Nome T, et al. Revealing genetic relationships between compounds affecting boar taint and reproduction in pigs. J Anim Sci. 2011;89:680–92.PubMedView ArticleGoogle Scholar
- Große-Brinkhaus C, Storck LC, Frieden L, Neuhoff C, Schellander K, Looft C, Tholen E. Genome-wide association analyses for boar taint components and testicular traits revealed regions having pleiotropic effects. BMC Genet. 2015;16:36.PubMedPubMed CentralView ArticleGoogle Scholar
- Rowe SJ, Karacaoren B, de Koning DJ, Lukic B, Hastings-Clark N, Velander I, et al. Analysis of the genetics of boar taint reveals both single SNPs and regional effects. BMC Genomics. 2014;15:424.PubMedPubMed CentralView ArticleGoogle Scholar
- Strathe AB, Velander IH, Mark T, Kadarmideen HN. Genetic parameters for androstenone and skatole as indicators of boar taint and their relationship to production and litter size traits in Danish Landrace. J Anim Sci. 2013;91:2587–95.PubMedView ArticleGoogle Scholar
- Strathe AB, Velander IH, Mark T, Ostersen T, Hansen C, Kadarmideen HN. Genetic parameters for male fertility and its relationship to skatole and androstenone in Danish Landrace boars. J Anim Sci. 2013;91:4659–68.PubMedView ArticleGoogle Scholar
- Gunawan A, Sahadevan S, Neuhoff C, Grosse-Brinkhaus C, Gad A, Frieden L, et al. RNA deep sequencing reveals novel candidate genes and polymorphisms in boar testis and liver tissues with divergent androstenone levels. PLoS One. 2013;8:e63259.PubMedPubMed CentralView ArticleGoogle Scholar
- Drag M, Kogelman LJA, Meinert L, Maribo H, Kadarmideen HN. Genomics and systems biology of boar taint and meat quality in pigs. In: Proceedings of the 6th international symposium on animal functional genomics: 27–29 July 2015; Piacenza; 2015.Google Scholar
- Tang F, Lao K, Surani MA. Development and applications of single-cell transcriptome analysis. Nat Methods. 2011;8(4 Suppl):S6–11.PubMedPubMed CentralGoogle Scholar
- Ozsolak F, Milos PM. RNA sequencing: advances, challenges and opportunities. Nat Rev Genet. 2011;12:87–98.PubMedPubMed CentralView ArticleGoogle Scholar
- Claudino WM, Quattrone A, Biganzoli L, Pestrin M, Bertini I, Di Leo A. Metabolomics: available results, current research projects in breast cancer, and future applications. J Clin Oncol. 2007;25:2840–6.PubMedView ArticleGoogle Scholar
- Sun HZ, Wang DM, Wang B, Wang JK, Liu HY, le Guan L, et al. Metabolomics of four biofluids from dairy cows: potential biomarkers for milk production and quality. J Proteome Res. 2015;14:1287–98.PubMedView ArticleGoogle Scholar
- Shen Y, Shi S, Tong H, Guo Y, Zou J. Metabolomics analysis reveals that bile acids and phospholipids contribute to variable responses to low-temperature-induced ascites syndrome. Mol BioSyst. 2014;10:1557–67.PubMedView ArticleGoogle Scholar
- Rohart F, Paris A, Laurent B, Canlet C, Molina J, Mercat MJ, et al. Phenotypic prediction based on metabolomic data for growing pigs from three main European breeds. J Anim Sci. 2012;90:4729–40.PubMedView ArticleGoogle Scholar
- Wenguang Z, Jianghong W, Jinquan L, Yashizawa M. A subset of skin-expressed microRNAs with possible roles in goat and sheep hair growth based on expression profiling of mammalian microRNAs. OMICS. 2007;11:385–96.PubMedView ArticleGoogle Scholar
- Powers R, Mercier KA, Copeland JC. The application of FAST-NMR for the identification of novel drug discovery targets. Drug Discov Today. 2008;13:172–9.PubMedPubMed CentralView ArticleGoogle Scholar
- Benso A, Di Carlo S, Ur Rehman H, Politano G, Savino A, Suravajhala P. A combined approach for genome wide protein function annotation/prediction. Proteome Sci. 2013;11(Suppl 1):S1.PubMedPubMed CentralView ArticleGoogle Scholar
- Gutierrez-Gil B, Arranz JJ, Wiener P. An interpretive review of selective sweep studies in Bos taurus cattle populations: identification of unique and shared selection signals across breeds. Front Genet. 2015;6:167.PubMedPubMed CentralGoogle Scholar
- Aslibekyan S, Almeida M, Tintle N. Pathway analysis approaches for rare and common variants: insights from Genetic Analysis Workshop 18. Genet Epidemiol. 2014;38(Suppl 1):S86–91.PubMedPubMed CentralView ArticleGoogle Scholar
- Ponzoni I, Nueda M, Tarazona S, Gotz S, Montaner D, Dussaut J, et al. Pathway network inference from gene expression data. BMC Syst Biol. 2014;8(Suppl 2):S7.PubMedPubMed CentralView ArticleGoogle Scholar
- Deans AR, Lewis SE, Huala E, Anzaldo SS, Ashburner M, Balhoff JP, et al. Finding our way through phenotypes. PLoS Biol. 2015;13:e1002033.PubMedPubMed CentralView ArticleGoogle Scholar
- Mason CE, Porter SG, Smith TM. Characterizing multi-omic data in systems biology. Adv Exp Med Biol. 2014;799:15–38.PubMedView ArticleGoogle Scholar
- Wray NR, Goddard ME. Multi-locus models of genetic risk of disease. Genome Med. 2010;2:10.PubMedPubMed CentralView ArticleGoogle Scholar
- Huang W, Richards S, Carbone MA, Zhu D, Anholt RR, Ayroles JF, et al. Epistasis dominates the genetic architecture of Drosophila quantitative traits. Proc Natl Acad Sci USA. 2012;109:15553–9.PubMedPubMed CentralView ArticleGoogle Scholar
- Jarvis JP, Cheverud JM. Mapping the epistatic network underlying murine reproductive fatpad variation. Genetics. 2011;187:597–610.PubMedPubMed CentralView ArticleGoogle Scholar
- Shao H, Burrage LC, Sinasac DS, Hill AE, Ernest SR, O’Brien W, et al. Genetic architecture of complex traits: large phenotypic effects and pervasive epistasis. Proc Natl Acad Sci USA. 2008;105:19910–4.PubMedPubMed CentralView ArticleGoogle Scholar
- Ali AA, Khatkar MS, Kadarmideen HN, Thomson PC. Additive and epistatic genome-wide association for growth and ultrasound scan measures of carcass-related traits in Brahman cattle. J Anim Breed Genet. 2015;132:187–97.PubMedView ArticleGoogle Scholar
- Kogelman LJA, Kadarmideen H. Weighted interaction SNP hub (WISH) network method for building genetic networks for complex diseases and traits using whole genome genotype data. BMC Syst Biol. 2014;8(Suppl 2):S5.PubMedPubMed CentralView ArticleGoogle Scholar
- Fortes MRS, Reverter A, Zhang Y, Collis E, Nagaraj SH, Jonsson NN, et al. Association weight matrix for the genetic dissection of puberty in beef cattle. Proc Natl Acad Sci USA. 2010;107:13642–7.PubMedPubMed CentralView ArticleGoogle Scholar
- Widmann P, Reverter A, Fortes MR, Weikard R, Suhre K, Hammon H, et al. A systems biology approach using metabolomic data reveals genes and pathways interacting to modulate divergent growth in cattle. BMC Genomics. 2013;14:798.PubMedPubMed CentralView ArticleGoogle Scholar
- Fortes MRS, Reverter A, Nagaraj SH, Zhang Y, Jonsson NN, Barris W, Lehnert S, et al. A single nucleotide polymorphism-derived regulatory gene network underlying puberty in 2 tropical breeds of beef cattle. J Anim Sci. 2011;89:1669–83.PubMedView ArticleGoogle Scholar
- Jalali S, Bhartiya D, Lalwani MK, Sivasubbu S, Scaria V. Systematic transcriptome wide analysis of lncRNA–miRNA interactions. PLoS One. 2013;8:e53823.PubMedPubMed CentralView ArticleGoogle Scholar
- Suravajhala P, Kogelman LJA, Mazzoni G, Kadarmideen HN. Potential role of lncRNA cyp2c91–protein interactions on diseases of the immune system. Front Genet. 2015;6:255.PubMedPubMed CentralGoogle Scholar
- Civelek M, Lusis AJ. Systems genetics approaches to understand complex traits. Nat Rev Genet. 2014;15:34–48.PubMedPubMed CentralView ArticleGoogle Scholar
- Canovas A, Reverter A, DeAtley KL, Ashley RL, Colgrave ML, Fortes MR, et al. Multi-tissue omics analyses reveal molecular regulatory networks for puberty in composite beef cattle. PLoS One. 2014;9:e102551.PubMedPubMed CentralView ArticleGoogle Scholar
- Westra HJ, Franke L. From genome to function by studying eQTLs. Biochim Biophys Acta. 2014;1842:1896–902.PubMedView ArticleGoogle Scholar
- Ponsuksili S, Murani E, Schwerin M, Schellander K, Wimmers K. Identification of expression QTL (eQTL) of genes expressed in porcine M. longissimus dorsi and associated with meat quality traits. BMC Genomics. 2010;11:572.PubMedPubMed CentralView ArticleGoogle Scholar
- Steibel JP, Bates RO, Rosa GJ, Tempelman RJ, Rilington VD, Ragavendran A, et al. Genome-wide linkage analysis of global gene expression in loin muscle tissue identifies candidate genes in pigs. PLoS One. 2011;6:e16766.PubMedPubMed CentralView ArticleGoogle Scholar
- Ponsuksili S, Jonas E, Murani E, Phatsara C, Srikanchai T, Walz C, et al. Trait correlated expression combined with expression QTL analysis reveals biological pathways and candidate genes affecting water holding capacity of muscle. BMC Genomics. 2008;9:367.PubMedPubMed CentralView ArticleGoogle Scholar
- Kogelman LJ, Zhernakova DV, Westra HJ, Cirera S, Fredholm M, Franke L, et al. An integrative systems genetics approach reveals potential causal genes and pathways related to obesity. Genome Med. 2015;7:105.PubMedPubMed CentralView ArticleGoogle Scholar
- Tsaih SW, Holl K, Jia S, Kaldunski M, Tschannen M, He H, et al. Identification of a novel gene for diabetic traits in rats, mice, and humans. Genetics. 2014;198:17–29.PubMedPubMed CentralView ArticleGoogle Scholar
- Buchner DA, Nadeau JH. Contrasting genetic architectures in different mouse reference populations used for studying complex traits. Genome Res. 2015;25:775–91.PubMedPubMed CentralView ArticleGoogle Scholar
- Cong L, Ran FA, Cox D, Lin S, Barretto R, Habib N, et al. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339:819–23.PubMedPubMed CentralView ArticleGoogle Scholar
- Jinwei Z, Qipin X, Jing Y, Shumin Y, Suizhong C. CRISPR/Cas9 genome editing technique and its application in site-directed genome modification of animals. Hereditas. 2015;37:1011–20.PubMedGoogle Scholar
- Whitelaw CB, Sheets TP, Lillico SG, Telugu BP. Engineering large animal models of human disease. J Pathol. 2015;238:247–56.PubMedPubMed CentralView ArticleGoogle Scholar
- Peng J, Wang Y, Jiang J, Zhou X, Song L, Wang L, et al. Production of human albumin in pigs through CRISPR/Cas9-mediated knockin of human cDNA into swine albumin locus in the zygotes. Sci Rep. 2015;5:16705.PubMedPubMed CentralView ArticleGoogle Scholar
- Wang J, Ma MC, Mennie AK, Pettus JM, Xu Y, Lin L, et al. Systems biology with high-throughput sequencing reveals genetic mechanisms underlying the metabolic syndrome in the Lyon hypertensive rat. Circ Cardiovasc Genet. 2015;8:316–26.PubMedPubMed CentralView ArticleGoogle Scholar
- Kraus WE, Muoio DM, Stevens R, Craig D, Bain JR, Grass E, et al. Metabolomic quantitative trait loci (mQTL) mapping implicates the ubiquitin proteasome system in cardiovascular disease pathogenesis. PLoS Genet. 2015;11:e1005553.PubMedPubMed CentralView ArticleGoogle Scholar
- Hedjazi L, Gauguier D, Zalloua PA, Nicholson JK, Dumas ME, Cazier JB. mQTL.NMR: an integrated suite for genetic mapping of quantitative variations of 1H NMR-based metabolic profiles. Anal Chem. 2015;87:4377–84.PubMedView ArticleGoogle Scholar
- Pant SD, Karlskov-Mortensen P, Jacobsen MJ, Cirera S, Kogelman LJ, Bruun CS, et al. Comparative analyses of QTLs influencing obesity and metabolic phenotypes in pigs and humans. PLoS One. 2015;10:e0137356.PubMedPubMed CentralView ArticleGoogle Scholar
- Gauguier D. Application of quantitative metabolomics in systems genetics in rodent models of complex phenotypes. Arch Biochem Biophys. 2016;589:158–67.PubMedView ArticleGoogle Scholar
- Dumas ME. Metabolome 2.0: quantitative genetics and network biology of metabolic phenotypes. Mol BioSyst. 2012;8:2494–502.PubMedView ArticleGoogle Scholar
- Holdt LM, von Delft A, Nicolaou A, Baumann S, Kostrzewa M, Thiery J, et al. Quantitative trait loci mapping of the mouse plasma proteome (pQTL). Genetics. 2013;193:601–8.PubMedPubMed CentralView ArticleGoogle Scholar
- Stark AL, Hause RJ Jr, Gorsic LK, Antao NN, Wong SS, Chung SH, et al. Protein quantitative trait loci identify novel candidates modulating cellular response to chemotherapy. PLoS Genet. 2014;10:e1004192.PubMedPubMed CentralView ArticleGoogle Scholar