The advantages of using mouse models in biomedicine have been discussed extensively. Some benefits are the following: (i) the availability of genetic tools for creating disease models by transgenic, knockout, and knock-in technologies; (ii) inbred mouse strains are nearly isogenic, enabling to study how the same genetic mutation modifies a phenotype of interest in different genetic backgrounds; (iii) mouse tissues are available for omics studies which can be challenging to obtain from humans. Some limitations include different evolutive pressures for mice and humans; therefore, some systems, such as the immune system, do not function similarly in both species.
1. Hybrid Mouse Diversity Panel
Currently available resources in rodents to find modifiers genes by association studies can be defined in two categories: (i) reference panels, consisting of inbred strains such as the Hybrid Mouse Diversity Panel (HMDP) and the Collaborative Cross (CC); (ii) populations derived from pseudo-random breeding of inbred strains, such as the Diversity Outbred (DO) and Heterogeneous Stock (HS) (Figure 1).
Figure 1. Breeding schemes for inbred (CC) and outbred (HS and DO) mice populations: Inbred founder strains for each panel are indicated in the right box. CC and DO populations share the same eight founder strains, five of which are standard laboratory inbred strains, while three are wild-derived strains. Colors represent the genotypes of strain chromosomes. The first steps include the combination of all eight founder genomes (outcrosses). CC is then generated as a recombinant inbred (RI) after multiple brother–sister breeding. HS and DO panels were developed as high-diversity outbred panels by over 40 generations of random outcrosses. DO was created from partially inbred Collaborative Cross (CC) mice. Quantitative phenotyping can be performed in the strains and used for gene mapping. Some signals in chromosomal locations will probably pass the threshold of significance (red line) in the LOD plot. The functional relevance of these variants can be assessed in animal models such as knockout mice and induced pluripotent stem cells (iPSC) derived from patients.
HMDP is a large panel of approximately 100 commercially available (
https://www.jax.org (accessed on 22 May 2022)) and fully sequenced (
www.sanger.ac.uk/science/data/mouse-genomes-project (accessed on 22 May 2022)) inbred strains: ~30 classical inbred strains and ~70 recombinant inbred (RI) strains derived mainly from crosses between C57BL/6J and DBA mice and A/J and C57BL/6J mice
[1].
Advantages of using the HMDP panel are the following: (i) their genomes are known (
http://mouse.cs.ucla.edu/mouseHapMap/ (accessed on 22 May 2022)); thus, it is unnecessary to spend funds performing this step; (ii) HMDP possesses ~4 million common single-nucleotide variants (SNVs), which is similar to the number present in humans
[2]; (iii) high-resolution association mapping
[1], which is at least an order of magnitude higher than in linkage analysis; (iv) it is possible to integrate gene mapping with other omics (transcriptomics, proteomics, and metabolomics data)
[3]; (v) commercially available (from The Jackson Laboratory, Harlan, and others); (vi) sufficient bioinformatics tools for data mining of complex mouse and human disease traits, such as the Systems Genetics Resource (SGR) (
http://systems.genetics.ucla.edu (accessed on 22 May 2022)); (vii) servers to perform association mapping and statistical power simulation, which are also available in R to run them in house
[4].
The HMDP also has limitations. For example, extensive linkage disequilibrium (LD) blocks are observed, both within and between chromosomes, probably as a result of the selection of allelic combinations conceding higher fitness during the inbreeding
[5]. Consequently, regions in LD can lead to false-positive associations in GWAS analyses. Although the HMDP has a high mapping resolution, the statistical power to detect the effect of loci is small (estimated at 50% to variants explaining 10% of the trait variance)
[1]. Since most loci contributing to a complex trait have an effect size below 5%
[6], variants with subtle effects cannot always be detected by the HMDP. Power can be enhanced by including additional inbred and RI strains and performing meta-analyses from other panels such as the CC or traditional crosses
[7].
An exciting application of the use of mouse panels in translational research comes from crossing the classical Alzheimer’s disease (AD) mouse model (5XFAD) bearing mutations in APP and PSEN1 with 28 different strains of the BXD panel (AD-BXD). The F1 represents isogenic lines that were studied in a controlled environment. The AD-BXD panel mimicked several signs of the AD patients, including phenotypic variation in disease onset and severity. As in humans, the Apoe allele significantly affected spatial memory and other behavioral tests in the AD-BXD panel. Furthermore, hippocampal gene expression in the severe and mild lines agrees with transcriptomic changes observed in patients
[8].
2. The Collaborative Cross (CC) Panel
The CC is a large panel of RI mouse strains obtained through systematically outcrossing eight founder strains, followed by randomized breeding
[9]. The founder strains of the CC include five of the widely used classical inbred laboratory strains (A/J, C57BL/6J, NOD/ShiLtJ, 129S1/SvImJ, and NZO/HILtJ), as well as three wild-derived strains descendent of three
M musculus subspecies (WSB, Castaneous, and PWK) (
Figure 1). These eight strains have been fully sequenced and carry ~45 million SNVs, four times more than those of classical laboratory mouse strains
[10].
The genomes of the CC panel are known (
http://csbio.unc.edu/CCstatus/CCGenomes (accessed on 22 May 2022)), which is helpful for genetic association studies. Haplotypes can be easily visualized or reconstructed as a mosaic of the genomes of the founders
[11]. Parental strains capture approximately 90% of the genetic diversity seen in the
Mus musculus species
[12]. This high genetic diversity significantly reduces false candidate loci. Additionally, randomized breeding substantially increases mapping resolution by reducing population structure effects
[13]. CC strains have been used to map quantitative trait loci (QTLs) to less than 5 Mb intervals
[14]. Online tools are available to perform GWAS and linkage analyses
[15]. Several aspects of human genetics and behavioral factors can be modeled in this system, including the heterogeneities observed in neurodevelopmental disorders such as autistic spectrum disorders (ASDs)
[16]. The CC panel allowed the discovery of novel candidate severity modifiers of ASD, e.g.,
Bai3, considered a potential target for pharmacological intervention
[16].
Some considerations associated with using the CC panel are the following: (i) unique outlier phenotypes can arise in large studies, probably due to the complex genetic regulatory networks involving multiple loci with epistatic interactions
[17]; in such cases, the preferred approach for identifying causal genes is traditional F2 analysis or backcrosses
[18]; (ii) because identifying loci could be time-consuming, it is suggested to perform a pilot study and expand as necessary
[17]; (iii) creating a panel like the CC can generate breeding complications and infertility, mainly caused by genomic incompatibility introduced by the wild-derived strains. For that reason, the initial CC project aimed to produce 1000 strains but finished with only ~100 and inspired the creation of the Diversity Outbred (DO) population.
CC lines have been used for genetic association studies of many complex traits. QTL mapping for 15 metabolism- and exercise-related traits revealed five significant loci for body weight, some of which overlapped with previous human studies
[19]. Gene mapping of rotarod (exercise) performance and body weight identified 45 loci, many of them related to neurological disorders and obesity in humans, suggesting a link between physical activity and neurodegeneration
[20]. A study of glucose tolerance response in the CC panel identified, only in female mice, a genomic region comprising 51 genes. This research highlighted sex differences in glucose response which should be considered in human studies
[21]. The CC panel is also a valuable and reliable resource for studying host–pathogen interactions
[17]. For example, to map genetic modifiers affecting the severity of
Pseudomonas aeruginosa lung infections, 39 CC lines were inoculated with this pathogen. The phenotypic variability was enormous, ranging from complete resistance to lethality. It is particularly relevant to study the resistant lines since they have the biological secrets to design novel therapies for the susceptible. Genomic mapping and functional validation identified dihydropyrimidine dehydrogenase (
Dpyd) and sphingosine-1-phosphate receptor 1 (
S1pr1) as modifier genes. In a cohort of patients with cystic fibrosis, two SNVs in the
S1PR1 gene are associated with
Pseudomonas aeruginosa infection
[22], again indicating the translational relevance of multigenetic background studies in animal organisms.
3. Heterogeneous Stock and Diversity Outbred Populations
Both HS and DO are high-diversity outbred mice populations. The HS was established by breeding eight inbred strains and then outbreeding them in either a circular strategy or using random crosses (
Figure 1) to minimize inbreeding
[23]. After 50 or more generations, the HS-generated mice were a genetic mosaic of the founders’ haplotypes
[24][25]. On the other hand, the DO was established from partially inbred CC lines and is maintained indefinitely through pseudorandomized fashion non-sibling mating
[26] (
Figure 1). Since the DO is derived from the same eight founders as the CC, it presents the same allelic diversity as the CC strains. It can be used as a complementary tool in genetic association studies
[27].
There are several advantages of using HS or DO mice compared to classical inbred mice. The outbred randomized mating increases the number of additional recombination sites compared to those of classically inbred mice; thus, each HS or DO mouse has a unique genome, which is a mosaic of the original eight founder lines, resembling human heterozygosity and allows high-resolution genetic mapping
[27]. HS and DO mice have been used to finely map to intervals of 2.7 Mb
[28] and less than 2 Mb
[27], respectively. In addition, outbred animals are more vigorous and less prone to both early and late recessive allelic effects
[29]. This genetic variability within both HS and DO populations results in a high degree of phenotypic variability; thus, outbred models enable the fine mapping of many phenotypic traits. Since the founders of CC and DO lines include wild-derived strains, unique behaviors can be observed compared to classical laboratory strains and represent a valuable tool for genetic behavior association studies
[10]. A repository of DO QTL studies can be shared between laboratories (
https://dodb.jax.org (accessed on 22 May 2022)). Lastly, the founders of the HS and DO populations have been sequenced
[30], reducing time and expense in locating the sequences.
Alternatively, some considerations must be made in the case of HS and DO mice. Since each outbred animal is genetically and phenotypically distinct, each HS and DO mouse requires genotyping and haplotype reconstruction to perform each QTL analysis
[26]. High-resolution mapping can be achieved with these panels, but analyzing many animals is necessary for sufficient statistical power, which is not always possible
[31]. Candidate modifiers of wild behaviors can be identified with outbred mice. However, it is challenging to validate in these panels because each animal has a unique genotype, in contrast to inbred lines
[32].
An interesting translational study using the DO panel identified a diagnostic biomarker for human tuberculosis (TB). By applying machine learning algorithms to multidimensional data, the authors discovered CXCL1 as a putative biomarker of TB in the serum of mice. The biomarker was further validated in samples derived from human patients, discriminating active TB from latent infection and non-TB lung disease
[33].
This entry is adapted from the peer-reviewed paper 10.3390/ijms23147570