DNA methylation at the Igf2/H19 imprinting control region is associated with cerebellum mass in outbred mice

Background Insulin-like growth factor 2 (Igf2) is a paternally expressed imprinted gene regulating fetal growth, playing an integral role in the development of many tissues including the brain. The parent-of-origin specific expression of Igf2 is largely controlled by allele-specific DNA methylation at CTCF-binding sites in the imprinting control region (ICR), located immediately upstream of the neighboring H19 gene. Previously we reported evidence of a negative correlation between DNA methylation in this region and cerebellum weight in humans. Results We quantified cerebellar DNA methylation across all four CTCF binding sites spanning the murine Igf2/H19 ICR in an outbred population of Heterogeneous Stock (HS) mice (n = 48). We observe that DNA methylation at the second and third CTCF binding sites in the Igf2/H19 ICR shows a negative relationship with cerebellar mass, reflecting the association observed in human post-mortem cerebellum tissue. Conclusions Given the important role of the cerebellum in motor control and cognition, and the link between structural cerebellar abnormalities and neuropsychiatric phenotypes, the identification of epigenetic factors associated with cerebellum growth and development may provide important insights about the etiology of psychiatric disorders.


Background
Genomic imprinting regulates the monoallelic expression of genes in a parent-of-origin specific manner. To date >90 imprinted loci have been identified in the mouse genome [1], many residing in co-regulated clusters with other imprinted genes, and their allele-specific expression controls a diverse range of functions including growth and development [1]. Of note, the imprinted insulin-like growth factor 2 (Igf2) gene located on mouse distal chromosome 7, along with other genes in the insulin and insulin-like growth factor regulatory pathway, has been shown to be integral for fetal growth and the development of many tissues including the brain [2][3][4][5].
In most somatic cells Igf2 and the neighboring H19 gene are reciprocally imprinted; Igf2 gene expression is silenced on the maternal allele, whereas H19 is silenced on the paternal allele. In mice this allele-specific expression is associated with allele-specific DNA methylation at several differentially methylated regions (DMRs) in the Igf2 gene and at the Igf2/H19 imprinting control region (ICR) located immediately upstream of the H19 promoter [6]. The ICR contains four methylation-sensitive CCCTCbinding factor (CTCF) binding-sites mediating the assembly of a chromatin insulator that blocks interactions between the Igf2 promoter and enhancers downstream of the H19 gene. On the unmethylated maternal allele, CTCF binds to the ICR silencing Igf2 expression and stimulating the transcription of H19 (Figure 1). IGF2 is a major driver of prenatal growthfor example, placenta specific Igf2 transcripts control the growth of the placenta and supply of maternal nutrients to the developing fetus [7]. IGF2 also acts as part of the IGF signaling pathway to regulate postnatal growth of somatic tissues including the brain [5,8]. The H19 gene encodes an untranslated RNA which acts as a trans-regulator of the imprinted gene network controlling embryonic growth in mice [9].
Despite being a classically imprinted region, with clear-cut patterns of allele-specific gene expression, there is evidence of considerable epigenetic heterogeneity at the Igf2/H19 locus. Population studies have shown that DNA methylation across the homologous region in the human genome is a normally distributed quantitative trait, which can be influenced by stochastic, genetic and environmental factors [10,11]. For example, individuals conceived during famine show alterations in IGF2 methylation [12], whilst polymorphisms in the region show an independent and additive association with DNA methylation [10]. Even isogenic strains of mice raised in a controlled environment demonstrate evidence of epigenetic heterogeneity; allelic DNA methylation levels at the Igf2/H19 locus vary extensively in a tissue-specific manner during fetal development, primarily on the expressed paternal allele [13].
The cerebellum is a region of the brain with an important role in motor control and cognition [14]. The cerebellum has a protracted period of development [15] and ultimately accounts for~12% of total brain weight [16]. Only a handful of studies have investigated Igf2 expression and its regulation by genomic imprinting in the mouse cerebellum. Early work revealed that the Igf2 gene is paternally expressed in cerebellar granule and glial cells during early postnatal development [17], with expression being spatially and temporally specific, indicating that the local synthesis of IGF2 is important for cerebellar development. A recent study showed that Igf2 regulates the proliferation of granule cell precursors, which ultimately determine the final size and shape of the cerebellum [18]. Together these results indicate that the regulation of Igf2 expression during early postnatal development likely plays an important role in the development of the cerebellum.
Previously we reported evidence of a strong, negative correlation between DNA methylation at the IGF2/H19 ICR and cerebellum weight in humans [11]. However, we observed no relationship between IGF2/H19 DNA methylation in the frontal cortex and net brain mass, suggesting that the association is cerebellum-specific. In the current study we comprehensively quantify cerebellar DNA methylation at all four CTCF binding sites spanning the murine Igf2/H19 ICR in an outbred population of Heterogeneous Stock (HS) mice. We report that epigenetic variation in the region shows an association with between-individual differences in cerebellar mass, reflecting the association observed in human postmortem brain tissue.

Results
Structure of cerebellar DNA methylation across the murine ICR Figure 2 shows the correlation between DNA methylation at all 26 CpG sites spanning the four CTCF binding sites assessed in this study, using 5 amplicons: CTCF1, CTCF2, CTCF3, CTCF4_1 and CTCF4_2. Because DNA methylation is strongly positively correlated between individual CpG sites within each amplicon, the ampliconmean DNA methylation level was used for all subsequent analyses. DNA methylation was highly correlated between the two adjacent amplicons covering CTCF4 (CTCF4_1 and CTCF4_2). Interestingly, although DNA methylation is largely positively correlated across the amplicons spanning CTCF1, 2 and 3, it is negatively correlated between these amplicons and those spanning CTCF4.
Association of Igf2/H19 DNA methylation with cerebellum mass Mean DNA methylation across amplicons CTCF2 and CTCF3 was significantly lower in the high cerebellar mass group than the low cerebellar mass group (CTCF2: t(39) = −1.97, mean high cerebellum mass = 58.1%, mean low cerebellum mass = 61.9%, p = 0.028; CTCF3: IGF2/H19 ICR DMR 0 CTCF3 CTCF4 Figure 1 Schematic map of the mouse Igf2/H19 locus, with paternal allele (♂) on the top and maternal allele (♀) on the bottom. DMRs are represented by circles: filled circles indicate a typically methylated allele and empty circles a typically unmethylated allele. In the current study, DNA methylation was assessed across 5 amplicons spanning the 1st-4th CTCF binding sites of the Igf2/H19 ICR (CTCF1, CTCF2, CTCF3 and CTCF4). SNPs genotyped are shown as gray triangles (from left to right rs33816033, rs33820807, rs51306385, rs33820056, rs33815374, rs33818240, rs33816897, rs33816896, rs33815163, rs33818772, rs33817683, rs33816812, rs33818680). t(37) = −1.72, mean high cerebellum mass = 52.9%, mean low cerebellum mass = 55.9%, p = 0.047) ( Figure 3). Furthermore, cerebellar DNA methylation at CTCF2 and CTCF3 showed a significant negative relationship with cerebellum mass at both amplicons (CTCF2: Figure 3). No significant relationship was observed between DNA methylation at any other CTCF region and cerebellum mass. Of note, no association was observed between cerebellar DNA methylation and net brain mass at either amplicon, suggesting that the findings we observe are cerebellum-specific. Finally, we tested whether this effect is specific to DMRs regulating the monoallelic expression of Igf2 and H19 by quantifying DNA methylation across DMRs associated with four additional brainexpressed imprinted genes (Kcnq1ot1, Mcts2, Nap1l5, and Sgce). No between-group differences in DNA methylation were observed between animals with high and low cerebellar mass (Additional file 1: Figure S1). Multiple linear regression was used to further investigate the relationship between cerebellar mass and DNA methylation at CTCF2 and CTCF3, including animal batch and net brain mass as covariates in the analysis. For CTCF2 the overall regression model was significant (adjusted R-squared = 0.184, F = 4.00 p = 0.015) and explained approximately 18% of the variance in cerebellum mass, with DNA methylation (β = −0.31, p = 0.041) and net brain mass being significantly predictive (β = 0.38, p = 0.031). The regression model for CTCF3 was also significant (adjusted R-squared = 0.224, F = 4.65, p = 0.008), explaining approximately 22% of the variance in cerebellum mass, with DNA methylation (β = −0.36, p = 0.023) and net brain mass both being predictive (β = 0.48, p = 0.006). Hierarchical linear regression was employed to determine the unique contribution of net brain mass and DNA 0.4 0.6 0.8 CTCF2 CTCF3 CTCF4_1 CTCF4_2 Figure 2 Correlation between DNA methylation at individual CpG sites within the mouse Igf2/H19 ICR, assayed using five amplicons spanning the four CTCF binding sites. CpG sites for each amplicon are numbered according to the CpG sites in Additional file 1: Materials 1. Two CpG sites have been removed after quality control steps, as described in Additional file 1: Table S2.
methylation to the variance observed in cerebellar mass. As expected, net brain weight was significantly associated with cerebellum weight, explaining 13% of the variance (R-squared = 0.130, F = 6.993, p = 0.012) for the samples included in the CTCF2 analysis and 14% of the variance (R-squared = 0.144, F = 7.374, p = 0.010) for the samples included in the CTCF3 analysis. DNA methylation, added in the second step of the analysis, was also a significant contributor to the observed variance in cerebellar mass, with CTCF2 methylation explaining 7% of the variance (ΔR-squared = 0.071, ΔF = −0.944, p = 0.005) and CTCF3 methylation explaining 8% of the variance (ΔR-squared = 0.075, ΔF = −1.056, p = 0.004) respectively.

Genetic analyses
In addition to quantifying DNA methylation, we genotyped 13 polymorphic single nucleotide polymorphisms (SNPs) spanning the Igf2 and H19 imprinted gene region (chr7:149745781-149890507). Standard genetic association analyses showed that none of the SNPs tested were directly associated with total brain mass, cerebellum mass, or DNA methylation either individually or as multi-marker haplotypes.

Discussion
These data provide further evidence that epigenetic variation at the Igf2/H19 ICR is associated with growth and development of the cerebellum. To our knowledge this is the first detailed investigation of DNA methylation across all four CTCF binding sites in the Igf2/H19 ICR in mouse cerebellum. Our results suggest that cerebellar DNA methylation across the 2nd and 3rd CTCF-binding sites, upstream of H19, shows a negative relationship with cerebellum mass, with~7% of the variance in cerebellum mass being uniquely explained by DNA methylation at these loci. No significant relationship was observed between DNA methylation and overall net brain mass for either amplicon, indicating that the effect of cerebellar DNA methylation at these sites is likely to be cerebellum-specific. The results from this study concur with our previous data, which highlight a negative relationship between DNA methylation in the IGF2/H19 ICR in post-mortem human cerebellum tissue and cerebellum mass measured at autopsy. Although the human association was primarily mediated by altered DNA methylation at the 3rd CTCF binding site, it is hard to directly compare specific locations across species because there is incomplete sequence homology between the mouse and human genome at this regionfor example the murine ICR contains four CTCF binding sites whereas the human ICR contains seven. Epigenetic heterogeneity in the IGF2/H19 ICR is a normally distributed quantitative trait influenced by stochastic, genetic and environmental factors [19]. We observe considerably more inter-individual variation at this locus in human compared to mouse cerebellum, which may result from the fact that the HS mice used in the current study were maintained in a controlled laboratory housing environment and, although outbred, have limited genetic variation compared to humans. Indeed when we investigated cis-acting genetic factors we found no association with DNA methylation, suggesting that the reported DNA methylation differences are most likely to result from stochastic factors, trans-acting genetic factors, or unmeasured environmental factors. Of note, nutrition early in development is associated with decreased cerebellum mass and alterations to the IGF system within the cerebellum [20]. Although the HS mice in this study were given a standard controlled diet, they may have been exposed to subtle differences in their rearingfor example, maternal behavior and suckling prior to weaningthat could also have influenced DNA methylation.
The mouse cerebellum continues to grow for three weeks postnatally, at which point its circumference has reached adult size [21]. Corresponding expression data suggests that Igf2 mRNA levels are coordinated with specific growth patterns in the brain [17]. In this study we have looked at DNA methylation in adult mice to serve as a proxy for levels of gene expression during development. Investigations into the factors determining cerebellar growth and development are important as a growing body of work from human and animal studies suggests that the role of the cerebellum may extend beyond the regulation and coordination of motor function, to attention, perception, working memory and spatial orientation [14]. In humans, cerebellar abnormalities are among the most consistently reported structural findings in autism and attention deficit hyperactivity disorder [22], and progressive loss of cerebellar volume has been reported in childhood-onset schizophrenia and other types of psychosis [23].
The current study has a number of limitations. First, the absolute mean DNA methylation difference between the high and low cerebellar groups is relatively modest (~4%), but is consistent across the multiple adjacent CpG sites assessed in the CTCF2 and CTCF3 amplicons. Previous work has shown that Igf2 expression in the cerebellum is highly cell-type specific [17], and the small absolute difference we observe may result from much larger changes in a specific subpopulation of cells. Second, although the animals were maintained in a controlled environment, and we were careful to control for variables such as batch and net brain mass in our analyses, it is possible that our findings are biased by unmeasured variables or confounding influences such as maternal behavior. Third, the sodium bisulfite conversion method does not distinguish between methylated cytosines and hydroxymethylated cytosines, which may be particularly important here as hydroxymethylation is relatively enriched in the Purkinje cells of the cerebellum [24]. Fourth, we did not have access to RNA from the same samples, so we are unable to relate our epigenetic data to changes in steady-state mRNA levels. Our previous work [11] in humans, however, did not report an association between altered DNA methylation at this region and absolute levels of gene expression. It is possible that the observed DNA methylation changes result in allelic skewing of expression, rather than changes in absolute expression values or that DNA methylation may be a mark of gene expression changes occurring during development, rather than current gene expression levels. Fifth, we only analysed DNA methylation in the cerebellum itself, so we cannot assess the tissuespecificity of the findings. However, our previous work in humans looking at several brain regions [11] indicated that the association between IGF2/H19 DNA methylation and brain region weight was cerebellum-specific. Further work would be required to assess the tissuespecificity of the finding in mouse. Finally, we only assessed cerebellum mass and DNA methylation at one time point in adulthood, so could not examine changes occurring during development.
To conclude, we show that epigenetic variation across the Igf2/H19 ICR shows a significant negative relationship with cerebellar mass, with hypomethylation at specific CTCF binding motifs being associated with increased cerebellar mass, reflecting our previous work in human tissue. Future work will expand these analyses to look at brain-region and cell-type specific effects, to determine the genetic and environmental factors underlying the changes in Igf2/H19 DNA methylation and to explore the relationship between DNA methylation and gene expression and measures of behavior.

Animals and brain tissue samples
All housing and experimental procedures were performed in compliance with the UK Home Office Animals Scientific Procedures Act 1986. Male HS mice were shipped from the Institute for Behavioral Genetics, University of Colorado at Boulder (Boulder, USA) to the UK at the age of approximately eight weeks. Mice were individually identified upon arrival, and were singly housed and habituated under strictly controlled housing condition at the Institute of Psychiatry, King's College London. The animals used in this study had an average age of 134.4 days (+/− 11.6). Individual animals were weighed and killed by cervical dislocation. Immediately after cervical dislocation, the whole brain including olfactory bulbs and brainstem was carefully removed from the skull and placed on the ventral side (dorsal facing up) on a sterile petri dish containing ice-cold artificial cerebrospinal fluid over ice. The brains were transected at the caudal margin of the cerebellum and the olfactory bulbs dissected out by cutting the fissura rhinalis using a surgical blade. Wet cerebri (whole brain minus cerebellum and olfactory bulbs) were briefly rolled on filter paper to dry excess aCSF and weighed immediately to the nearest 0.1 mg (net brain mass). Similarly, the dissected cerebellums were rolled in filter paper and weighed (cerebellum mass). Tissue samples were snap frozen on dry ice and stored at −80°C until used for DNA extraction. Because samples came from three experimental batches of HS mice, cerebellar mass was scaled and centered separately by batch. The resulting distribution was approximately normal with no age effect, and using these standardized values 48 mice were selected from a total of 274 from three batches as representing extremes of cerebellar mass in order to increase the statistical power to detect an association between DNA methylation and phenotype [25]. The descriptive statistics for these animals are given in Table 1.

DNA methylation analysis
DNA was extracted from the cerebellar samples using the Qiagen AllPrep DNA/RNA Mini Kit (Qiagen, Valencia, CA, USA) according to the manufacturer's standard protocol. Sodium bisulfite conversion of 500 ng genomic DNA was performed in duplicate using the EZ-96 DNA Methylation kit (Zymo Research, CA, USA), following the manufacturer's standard protocol. Bisulfite pyrosequencing was used to quantify DNA methylation at individual CpG sites within the Igf2 ICR. Primer sequences for 4 amplicons spanning CTCF binding sites 2, 3 and 4 were acquired from a previously published study [26]. Primers for CTCF binding site 1 were designed using the Pyro-Mark Assay Design software 2.0 (Qiagen, UK). Amplicons are referred to as CTCF1, CTCF2, CTCF3, CTCF4_1 and CTCF4_2, and their locations can be seen in Figure 1. Primer sequences for each amplicon are given in Table 2 and the genomic sequences of each amplicon are given in the Additional file 1: Materials 1. Bisulfite-PCR amplification was performed in duplicate using optimized cycling conditions (see Table 2). DNA methylation was quantified using the Pyromark Q24 system (Qiagen, UK) following the manufacturer's standard instructions and the Pyro Q24 CpG 2.0.6 software. DNA methylation across DMRs associated with four additional imprinted genes (Kcnq1ot1, Mcts2, Nap1l5, and Sgce) was assessed using Sequenom EpiTYPER as described previously [27]. Assay details are given in Additional file 1: Table S1.

Statistical analysis
Prior to analysis, stringent data quality control was performed to remove potentially unreliable measurements of DNA methylation (see Additional file 1: Table S2). Non-CpG cytosines were used as internal controls to verify the efficiency of bisulfite conversion. No indication of incomplete bisulfite conversion was observed for any sample. All statistical analysis was performed within the "R" statistical programming environment (http:// www.R-project.org). Spearman's rank correlation tests were used to test whether DNA methylation was coordinated within and between the CTCF binding sites. As CpG sites within each of the amplicons were strongly and significantly correlated (see Figure 2), mean DNA methylation scores for the five amplicons were used in all subsequent analyses. One-sided t-tests were used to test our hypothesis that high cerebellar mass is associated with relative hypomethylation at the Igf2/H19 ICR CTCF binding sites. Linear regression was subsequently used to explore any observed association between cerebellar mass and DNA methylation. A simple bias correction method was used to correct for only profiling DNA methylation in mice with extremes of cerebellar mass [25]. Multiple linear regression was used to examine other predictors of cerebellum mass; additional explanatory variables in the model were batch and net brain mass. Hierarchical linear regression models were used to assess the unique contribution of DNA methylation in these models. To investigate the specificity of our analyses to the cerebellum we also used linear regression to test for an association between net brain mass and cerebellar DNA methylation.

Genotype analysis
13 polymorphic SNPs spanning the Igf2/H19 imprinted region (chr7:149745781-149890507) were selected for genotype analysis. The SNPs were extracted from existing Table 1 Uncorrected wet brain and cerebellum mass (mean ± SD) for the HS mice used in this study, split by a) batch and b) high and low cerebellum mass a) n Age (days) Cerebellum mass (g) Net brain mass (g)  microarray data obtained from 40 of the mice included in this study (20 high and 20 low cerebellum mass). Samples were genotyped using the Affymetrix Mouse Diversity Genotyping Array and analysis performed using the MouseDivGeno package in R (http://cgd.jax.org/tools/ mousedivgeno) [28]. Haploview 4.2 (http://www.broadinstitute.org/haploview) was used to establish that all 13 SNPs were in Hardy-Weinberg Equilibrium (HWE). Lewontin's D' and the linkage disequilibrium (LD) coefficient r 2 were calculated using Haploview to measure the LD between all pairs of biallelic loci. HWE p-values, minor allele frequencies and LD between the SNPs are given in Additional file 1: Table S3 and Additional file 1: Figure S2. ANOVA was used to test each SNP for an association with total brain mass, cerebellum mass, and mean percentage cerebellar DNA methylation at each of the five amplicons.
To further interrogate the association between genotype and DNA methylation we used UNPHASED 3.1.4 (http:// www.mrc-bsu.cam.ac.uk/personal/frank/software/ unphased) to test the association between haplotypes and mean cerebellum DNA methylation values.

Additional file
Additional file 1: Supplementary Materials 1. Genomic sequences of each amplicon used in this study. Figure S1. No differences in DNA methylation at four additional imprinted loci (Kcnq1ot1, Mcts2, Nap1l5 and Sgce). Figure S2. High linkage disequilibrium (LD) across the SNPs genotyped in the murine Igf2/H19 region. Table S1. Details for the Sequenom EpiTYPER assays used to profile DNA methylation across DMRs associated with Kcnq1ot1, Mcts2, Nap1l5 and Sgce. Table S2. Stringent quality control and filtering steps used in analysis of Igf2/H19 DNA methylation data. Table S3. LD between the SNPs genotyped in Igf2 and H19.