INTRODUCTION
Noncoding genetic variation is a major driver of phenotypic diversity and the risk of a broad spectrum of diseases. For example, of the common single-nucleotide polymorphisms and short insertions/deletions identified by genome-wide association studies to be linked to specific traits or diseases, ~90% are typically found to reside in noncoding regions of the genome (
1). The recent application of genome-wide approaches to define the regulatory landscapes of many different cell types and tissues allows intersection of these variants with cell-specific regulatory elements and strongly supports the concept that alteration of transcription factor binding sites at these locations is an important mechanism by which they influence gene expression (
2–
4). Despite these major advances, it remains difficult to predict the consequences of most forms of noncoding genetic variation. Major challenges that remain include defining the causal variant within a block of variants that are in high linkage disequilibrium, identifying the gene that is regulated by the causal variant, and understanding the cell type and cell state–specific regulatory landscape in which a variant might have a functional consequence (
5). For example, a variant that affects the binding of a signal-dependent transcription factor (SDTF) may only be of functional importance in a cell that is responding to a signal that activates that factor (
6). Also, sequence variants can have a range of effects on transcription factor binding motifs, from abolishing or inducing binding by affecting critical nucleotides to quantitatively changing binding by affecting an intermediate affinity motif (
7–
9).
Studies of the impact of natural genetic variation on signal-dependent gene expression have demonstrated large differences in absolute levels of gene expression under basal and stimulated conditions, which result in corresponding differences in the dynamic range of the response (
10–
12). The molecular mechanisms by which genetic variation results in these qualitatively and quantitatively different signal-dependent responses remain poorly understood but are likely to be of broad relevance to understanding how noncoding variation influences responses to signals that regulate development, homeostasis, and disease-associated patterns of gene expression.
To investigate the influence of genetic variation on signal-dependent gene expression, we performed transcriptomic and epigenetic studies of the responses of macrophages derived from five different inbred mouse strains to the anti-inflammatory cytokine interleukin-4 (IL-4) (
Fig. 1A). The selected strains include both similar and highly divergent strain pairs, allowing modeling of the degree of variation between two unrelated individuals (~4 million variants) and that observed across large human populations (>50 million variants). Using this approach, we previously showed that strain-specific variants that disrupt the recognition motif for one macrophage lineage-determining transcription factor (LDTF; e.g., PU.1), besides reducing binding of the LDTF itself, also result in decreased binding of other collaborative factors and SDTFs (
13,
14). Collectively, these findings supported a model in which relatively simple combinations of LDTFs collaborate with an ensemble of additional transcription factors to select cell-specific enhancers that provide sites of action of broadly expressed SDTFs (
15).
IL-4 has many biological roles, including regulation of innate and adaptive immunity (
16). In macrophages, IL-4 drives an “alternatively activated” program of gene expression associated with inhibition of inflammatory responses and promotion of wound repair (
17). The immediate transcriptional response to IL-4 is mediated by activation of signal transducer and activator of transcription 6 (STAT6) (
18,
19), which rapidly induces the expression of direct target genes that include effector proteins such as Arginase 1 (Arg1) and transcription factors like peroxisome proliferator–activated receptor γ (PPARγ) (
20,
21) and early growth response 2 (EGR2) (
22). However, the extent to which natural genetic variation influences the program of alternative macrophage activation has not been systematically evaluated. Here, we demonstrate highly differential IL-4–induced gene expression and enhancer activation in bone marrow–derived macrophages (BMDMs) across the five mouse strains, thereby establishing a robust model system for quantitative analysis of the effects of natural genetic variation on signal-dependent gene expression. Through the application of deep learning methods and motif mutation analysis of strain-differential IL-4–activated enhancers, we provide functional evidence for a dominant set of LDTFs and SDTFs required for late IL-4 enhancer activation, which include STAT6, PPARγ, and EGR2, and validate these findings in
Egr2-knockout BMDMs. Assessment of the quantitative effects of natural genetic variants on recognition motifs for LDTFs and SDTFs suggests general principles by which such variation affects enhancer activity patterns and dynamic signal responses.
DISCUSSION
Here, we report a systematic investigation of the effects of natural genetic variation on signal-dependent gene expression by exploiting the highly divergent responses of BMDMs from diverse strains of mice to IL-4. Unexpectedly, despite broad conservation of IL-4 signaling pathways and downstream transcription factors in all five strains, only 26 of more than 600 genes observed to be induced >2-fold by IL-4 at 24 hours reached that level of activation in all five strains, and more than half were induced in only a single strain. To the extent that this remarkable degree of variation observed in BMDMs occurs in tissue macrophages and other cell types in vivo, it is likely to have substantial phenotypic consequences with respect to innate and adaptive immunity, tissue homeostasis, and wound repair. Notably, only ~25% of the variation in response to IL-4 was due to altered dynamic ranges in the context of an equivalent level of basal expression. Nearly half of the genes showing strain-specific impairment in IL-4 responsiveness exhibited low basal activity, whereas lack of induction was associated with constitutively high basal levels of expression in the remaining ~25%. These qualitatively different patterns of strain responses to IL-4 imply distinct molecular mechanisms by which genetic variation exerts these effects.
Motif mutation analysis of strain-differential enhancer activation recovered a dominant set of motifs recognized by known LDTFs PU.1, C/EBPβ, and AP-1 family members, as well as motifs recognized by SDTFs STAT6 and PPARγ that have been previously established to play essential roles in the IL-4 response. In addition, effects of mutations in motifs for EGR, NRF, and KLF also strongly implicate these factors as playing important roles in establishing basal and induced activities of IL-4–responsive enhancers, which was genetically confirmed for EGR2 in this study and a recent study by Daniel
et al. (
22). It will be of interest in the future to perform analogous studies of NRF and KLF factors.
Analysis of strain-differentially activated enhancers revealed qualitative differences in basal and IL-4–dependent activity that were analogous to the qualitative differences observed for strain-differentially activated genes. As expected, sequence variants reducing the affinity of SDTFs STAT6, PPARγ, and EGR2 were the major forms of variation resulting in strain-differential IL-4 induction of equal basal enhancers. From the standpoint of interpreting the effects of noncoding variation, these types of sequence variants are silent in the absence of IL-4 stimulation. As also expected, sequence variants strongly reducing the binding affinity of LDTFs prevented the generation of open chromatin required for subsequent binding of SDTFs. These variants are thus expected to result in loss of enhancer function in a signal-independent manner. Of particular significance, these analyses also provide strong evidence that quantitative variation in suboptimal motif scores for LDTFs is a major determinant of differences in the absolute levels and dynamic range of high basal enhancers across strains. The importance of low-affinity motifs in establishing appropriate quantitative levels of gene expression within a given cell type and cell specificity across tissues has been extensively evaluated (
41–
43). Here, we present evidence that improvement of low-affinity motifs for LDTFs not only increases basal binding of the corresponding transcription factor but also is associated with increased basal binding of STAT6 and PPARγ, thereby rendering their actions partially or fully IL-4 independent. These findings thus provide evidence that quantitative effects of genetic variation on LDTF motif scores play major roles in establishing different absolute enhancer activity levels and dynamic ranges of their responses to IL-4 that are observed between strains.
To go beyond the discovery of mechanisms mediating the IL-4 response using natural genetic variation, a major objective of these studies was to use the resulting datasets as the basis for interpreting and predicting the effects of specific variants. As expected, enhancers exhibiting strain-specific differences in IL-4 responses were significantly enriched for sequence variants. However, the background frequencies of variants in the much larger sets of strain-similar enhancers ranged from 17 to 93%, consistent with the vast majority of these variants being silent and underscoring the challenges of discriminating them from functional variants. The application of recently developed deep learning approaches illustrates both the potential of these methods to improve predictive power and their current limitations. Nucleotides predicted by DeepLIFT to be of functional importance frequently intersected with variants at strain-differential enhancers that significantly altered LDTF or SDTF motifs, with over eightfold enrichment in enhancers with strongest strain differences (top 1% variants for C57 versus BALB comparison;
Fig. 2I), strongly suggesting causality. Although DeepLIFT scored a substantial fraction of variants present in strain-similar enhancers with low importance, a large fraction of remaining strain-similar enhancers contained variants associated with high DeepLIFT scores, most likely representing false positives. Furthermore, we found that the highest scoring variants in some cases depended on the choice of data used to train the convolutional neural network (e.g., using random versus noninduced enhancers as negative training examples). This observation has important implications with respect to application of deep learning models to identify potential functional variants in disease contexts. The datasets generated by these studies will therefore provide an important resource for further improvements in methods for interpretation of local genetic variation.
These analyses further indicated that 20 to 50% of the most divergent IL-4–responsive enhancers lacked any functional variants in the proximity of open chromatin. This fits with previous observations that variant-free enhancers can reside in cis-regulatory domains (CRD) containing functionally interacting enhancers, suggesting that a variant strongly affecting one enhancer within the CRD could have domain-wide effects (
14). This concept was supported and extended here by HiChIP experiments. In addition to demonstrating that the IL-4 response was primarily associated with preexisting enhancer-promoter connections, the HiChIP assay also captured a large number of enhancer-enhancer interactions. Examination of these connected enhancers provided evidence that a substantial fraction of strain-differential enhancers lacking local variants were connected to strain-differential enhancers containing functional variants. An important future direction will be to further investigate the significance and mechanisms underlying these associations.
Collectively, these studies reveal general mechanisms by which noncoding genetic variation influences signal-dependent enhancer activity, thereby contributing to strain-differential patterns of gene expression and phenotypic diversity. A major future goal will be to incorporate these findings into improved algorithms for prediction of absolute levels and dynamic responses of genes to IL-4 at the level of individual genes.
MATERIALS AND METHODS
Experimental design
To investigate the influence of genetic variation on signal-dependent gene expression, enhancer activation, and transcription factor binding, we performed RNA-seq, ATAC, and ChIP-seq to study the responses of macrophages derived from five different inbred mouse (C57, BALB, NOD, PWK, and SPRET) strains to the anti-inflammatory cytokine IL-4 (
Fig. 1A).
Mice
Female and male breeder mice for C57, BALB, NOD, PWK, and SPRET mice were purchased from the Jackson laboratory. F1 C57 × SPRET mice were crossed, and Egr2fl/fl mice were generously donated by Drs. V. Lazarevic and L. Warren (National Institutes of Health) and crossed to LyzM-Cre mice (the Jackson laboratory) to achieve myeloid-specific targeted deletion of Egr2. Mice were housed at the University of California San Diego animal facility on a 12-hour/12-hour light/dark cycle with free access to normal chow food and water. All animal procedures were in accordance with University of California San Diego research guidelines for the care and use of laboratory animals. Eight- to 12-week-old healthy female mice were used for all our experiments.
BMDM culture
Femur, tibia, and iliac bones from the different mouse strains were flushed with Dulbecco’s modified Eagle’s medium (DMEM) high glucose (Corning), and red blood cells were lysed using red blood cell lysis buffer (eBioscience). After counting, 20 million bone marrow cells were seeded per 15-cm nontissue culture plates in DMEM high glucose (50%) with 20% fetal bovine serum (FBS; Omega Biosciences), 30% L929 cell–conditioned laboratory-made media [as source of macrophage colony-stimulating factor (M-CSF), as described before (
14)], penicillin/streptomycin +
l-glutamine (100 U/ml; Gibco), and amphotericin B (2.5 μg/ml; HyClone). After 4 days of differentiation, mouse M-CSF (16.7 ng/ml; Shenandoah Biotechnology) was added to the media. After an additional 2 days of culture, adherent cells were scraped and subsequently seeded onto tissue culture–treated petri dishes in DMEM containing 10% FBS, penicillin/streptomycin +
l-glutamine (100 U/ml), amphotericin B (2.5 μg/ml), and M-CSF (16.7 ng/ml). Macrophages were left untreated or treated with mouse recombinant IL-4 (20 ng/ml; PeproTech) for 1, 6, or 24 hours.
Immunofluorescence
Cells were fixed with Cytofix/Cytoperm Buffer (BD Biosciences, BD554714) for 10 min at room temperature. Cytofix/Cytoperm buffer was removed, and cells were washed twice with Hanks’ balanced salt solution containing 2% bovine serum albumin (BSA) and 1 mM EDTA. Cells were kept in permeabilization/wash buffer (BD Biosciences, BD554714) for 1 hour at 4°C or until the experiment was performed. Fixed cells were blocked using 3% BSA, 0.1% Triton–phosphate-buffered saline (PBS) for 30 min at room temperature and then with 1/200 of the EGR2 antibody (Abcam) overnight at 4°C. The next day, cells were washed with 0.1% Triton-PBS and incubated with 1/200 donkey anti-rabbit 555 (Thermo Fisher Scientific, no. A31572) secondary antibody and phalloidin (Abcam, ab176759) for staining actin filaments, and nuclei were counterstained with 4′,6-diamidino-2-phenylindole. After washing with 0.1% Triton-PBS, slides were mounted with ProLong Gold Antifade Reagent (Life Technologies, no. 10144). Images were taken using a Leica SP8 with light deconvolution microscope.
RNA-seq library preparation
Total RNA was isolated from cells and purified using RNA Direct-zol microprep columns and ribonuclease (RNase)–free deoxyribonuclease digestion according to the manufacturer’s instructions (Zymo Research). Sequencing libraries were prepared in biological replicates from polyadenylate [poly(A)]–enriched mRNA as previously described (
14). Libraries were polymerase chain reaction (PCR)–amplified for 9 to 14 cycles, size-selected using tris-boric acid-EDTA (TBE) gels or one-sided 0.8× AMPure cleanup, quantified by the Qubit dsDNA HS Assay Kit (Thermo Fisher Scientific), and 75-bp single-end–sequenced on a HiSeq 4000 or NextSeq 500 (Illumina).
Cross-linking for ChIP-seq
For histone marks, PU.1, C/EBPβ, and RNA Pol II ChIP-seqs, culture media were removed and plates were washed once with PBS and then fixed for 10 min with 1% formaldehyde (Thermo Fisher Scientific) in PBS at room temperature; reaction was then quenched by adding glycine (Thermo Fisher Scientific) to 0.125 M. For STAT6, PPARγ, and EGR2 ChIP-seq, cells were cross-linked for 30 min with 2 mM DSG (Pierce) in PBS at room temperature. Subsequently, cells were fixed for 10 min with 1% formaldehyde at room temperature, and the reaction was quenched with 0.125 M glycine. After fixation, cells were washed once with cold PBS and then scraped into supernatant using a rubber policeman and pelleted for 5 min at 400g at 4°C. Cells were transferred to Eppendorf DNA LoBind tubes and pelleted at 700g for 5 min at 4°C, snap-frozen in liquid nitrogen, and stored at −80°C until ready for ChIP-seq protocol preparation.
Chromatin immunoprecipitation
ChIP was performed in biological replicates as described previously (
44). Samples were sonicated using a probe sonicator in 500-μl lysis buffer [10 mM tris-HCl (pH 7.5), 100 mM NaCl, 1 mM EDTA, 0.5 mM EGTA, 0.1% deoxycholate, 0.5% sarkozyl, and 1× protease inhibitor cocktail]. After sonication, 10% Triton X-100 was added to 1% final concentration, and lysates were spun at full speed for 10 min. One percent was taken as input DNA, and immunoprecipitation was carried out overnight with 20-μl Protein A Dynabeads (Invitrogen) and 2-μg specific antibodies for PU.1 (Santa Cruz Biotechnology, sc-352X), H3K4me2 (Millipore, 07-030), H3K4me3 (Millipore, 04-745), H3K27ac (Active Motif, 39135), RNA Pol II (GeneTex, GTX102535), STAT6 (Santa Cruz Biotechnology, sc-374021), EGR2 (Abcam, ab43020), and C/EBPβ (Santa Cruz Biotechnology, sc-150). Beads were washed three times each with wash buffer I (20 mM tris-HCl, 150 mM NaCl, 0.1% SDS, 1% Triton X-100, and 2 mM EDTA), wash buffer II (10 mM tris-HCl, 250 mM LiCl, 1% IGEPAL CA-630, 0.7% Na-deoxycholate, and 1 mM EDTA), tris-EDTA (TE) 0.2% Triton X-100, and TE 50 mM NaCl and subsequently resuspended 25-μl 10 mM tris-HCl (pH 8.0) and 0.05% Tween 20, and sequencing libraries were prepared on the Dynabeads as described below.
For PPARγ ChIP-seq, fixed cells were lysed in 500-μl radioimmunoprecipitation assay (RIPA) lysis buffer [20 mM tris-HCl (pH 7.5), 1 mM EDTA, 0.5 mM EGTA, 0.1% SDS, 0.4% Na-deoxycholate, 1% NP-40 alternative, 0.5 mM dithiothreitol (DTT), and 1× protease inhibitor cocktail (Sigma-Aldrich)] and chromatin was sheared using a probe sonicator. One percent was taken as input DNA, and immunoprecipitation was carried out overnight with 20-μl Protein A Dynabeads (Invitrogen) and 2 μg of both PPARγ antibodies (Santa Cruz Biotechnology, sc-271392 and sc-7273). Beads were then collected using a magnet and washed with 175-μl ice cold buffer as indicated by incubating samples on ice for 3 min: three times with RIPA wash buffer [20 mM tris-HCl (pH 7.5), 1 mM EDTA, 0.5 mM EGTA, 0.1% SDS, 0.4% Na-deoxycholate, 1% NP-40 alternative, 0.5 mM DTT, and 1× protease inhibitor cocktail (Sigma-Aldrich)], six times with LiCl wash buffer [10 mM tris-HCl (pH 7.5), 250 mM LiCl, 1 mM EDTA, 0.7% Na-deoxycholate, 1% NP-40 alternative, and 1× protease inhibitor cocktail (Sigma-Aldrich)], twice with TET [10 mM tris-HCl (pH 8.0), 1 mM EDTA, 0.2% Tween 20, and 1× protease inhibitor cocktail (Sigma-Aldrich)], and once with TE-NaCl [10 mM tris-HCl (pH 8.0), 0.1 mM EDTA, 50 mM NaCl, and 1× protease inhibitor cocktail (Sigma-Aldrich)]. Bead complexes were resuspended in 25-μl TT [10 mM tris-HCl (pH 8.0) and 0.05% Tween 20], and sequencing libraries were prepared on the Dynabeads as described below.
ChIP-seq library preparation
ChIP libraries were prepared while bound to Dynabeads using a NEBNext Ultra II Library preparation kit (NEB) using half reactions. DNA was polished, poly(A)–tailed and ligated after which dual UDI (IDT) or single (Bioo Scientific) barcodes were ligated to it. Libraries were eluted and cross-links reversed by adding to the 46.5-μl NEB reaction, 16-μl water, 4-μl 10% SDS, 4.5-μl 5 M NaCl, 3-μl 0.5 M EDTA, 4-μl 0.2 M EGTA, 1-μl RNAse (10 mg/ml), and 1-μl proteinase K (20 mg/ml), followed by incubation at 55°C for 1 hour and 75°C for 30 min in a thermal cycler. Dynabeads were removed from the library using a magnet, and libraries were cleaned up by adding 2-μl SpeedBeads 3 EDAC (Thermo Fisher Scientific) in 124-μl 20% polyethylene glycol, molecular weight 8000/1.5 M NaCl, mixing well and then incubating at room temperature for 10 min. SpeedBeads were collected on a magnet and washed two times with 150-μl 80% ethanol for 30 s. Beads were collected, and ethanol was removed following each wash. After the second ethanol wash, beads were air-dried and DNA-eluted in 12.25-μl 10 mM tris-HCl (pH 8.0) and 0.05% Tween 20. DNA was amplified by PCR for 14 cycles in a 25-μl reaction volume using NEBNext Ultra II PCR master mix and 0.5 μM each Solexa 1GA and Solexa 1GB primers. Libraries were size-selected using TBE gels for 200 to 500 bp, and DNA was eluted using gel diffusion buffer [500 mM ammonium acetate (pH 8.0), 0.1% SDS, 1 mM EDTA, and 10 mM magnesium acetate] and purified using ChIP DNA Clean & Concentrator (Zymo Research). Sample concentrations were quantified by the Qubit dsDNA HS Assay Kit (Thermo Fisher Scientific) and 75-bp single-end–sequenced on HiSeq 4000 or NextSeq 500 (Illumina).
ATAC-seq library preparation
Approximately 80K cells were lysed in 50-μl room temperature ATAC lysis buffer [10 mM tris-HCl, (pH 7.4), 10 mM NaCl, 3 mM MgCl2, and 0.1% IGEPAL CA-630], 2.5-μl DNA Tagmentation Enzyme mix (Nextera DNA Library Preparation Kit, Illumina) was added. The mixture was incubated at 37°C for 30 min and subsequently purified using the ChIP DNA purification kit (Zymo Research) as described by the manufacturer. DNA was amplified using the Nextera Primer Ad1 and a unique Ad2.n barcoding primer using NEBNext High-Fidelity 2× PCR MM for 8 to 14 cycles. PCRs were size-selected using TBE gels for 175 to 350 bp, and DNA was eluted using gel diffusion buffer [500 mM ammonium acetate (pH 8.0), 0.1% SDS, 1 mM EDTA, and 10 mM magnesium acetate] and purified using ChIP DNA Clean & Concentrator (Zymo Research). Samples were quantified by the Qubit dsDNA HS Assay Kit (Thermo Fisher Scientific) and 75-bp single-end–sequenced on HiSeq 4000 or NextSeq 500 (Illumina).
H3K4me3 HiChIP
For H3K4me3 HiChIP, 10 million formaldehyde cross-linked cells per condition in biological replicates were used. HiChIP was performed as described before (
29). In our experiments, 375 U of Mbo I (NEB, R0147M) restriction enzyme was used for chromatin digestion. Shearing was performed in three Covaris microtubes per sample and using the following parameters on a Covaris E220 (Fill Level = 6, Duty Cycle = 5, PIP = 140, Cycles/Burst = 200, Time = 200 s). H3K4me3 IP was performed using 7.5 μg of antibody (Millipore, 04-745). Final PCR was performed using NEBNext High-Fidelity PCR MM and Nextera general Primer Ad1 and specific Nextera Primer Ad2.n. PCR product was run on a TBE gel (Invitrogen), and libraries were size-selected from 250 to 700 bp and cleaned up using 150-μl gel diffusion buffer [500 mM ammonium acetate (pH 8.0), 0.1% SDS, 1 mM EDTA, and 10 mM magnesium acetate] and purified using ChIP DNA Clean & Concentrator (Zymo Research). Samples were quantified by the Qubit dsDNA HS Assay Kit (Thermo Fisher Scientific) and 75-bp paired-end–sequenced on a NextSeq 500 (Illumina).
Data mapping
Custom genomes were generated for BALB, NOD, PWK, and SPRET mice from the C57 or mm10 genome as before (
14) using MMARGE v1.0 (
45) and the variant call format (VCF) files from the Mouse Genomes Project (
46). Data generated from different mouse strains were first mapped to their respective genomes using STAR v2.5.3 (
47) for RNA-seq data or bowtie2 v2.2.9 (
48) for ATAC-seq, ChIP-seq, and HiChIP data. Then, the mapped data were shifted to the mm10 genome using the MMARGE v1.0 “shift” function (
45) for downstream comparative analyses.
RNA-seq data analysis
RNA-seq data processing
Transcripts were quantified using HOMER v4.11.1 “analyzeRepeats” script (
15). Transcripts per kilobase million (TPM) values were reported by using the parameters -count exons -condenseGenes -tpm. Log-scaled TPM values were computed by log
2(TPM + 1). Raw read counts within transcripts were reported by using the parameters -count exons -condenseGenes -noadj. Differentially expressed genes were identified by feeding raw read counts into DESeq2 (
49) through the “getDiffExpression” script of HOMER. IL-4–induced and IL-4–repressed genes were called by fold changes greater than 2 or less than half, respectively, together with
q values smaller than 0.05. Gene ontology analysis was performed using Metascape (
50).
Categorization of strain-differential genes
Strain-differential genes were defined on the basis of pairwise comparisons between C57 and one of the other strains as being called IL-4–induced or IL-4–repressed in one strain but not in the other. Strain-differential IL-4–induced genes were further classified into three categories based on the relative level of basal expression between the induced strain versus the noninduced strain: high basal, equal basal, and low basal. In the high basal group, the noninduced strain has at least 1.5-fold greater basal expression level than the induced strain. The direction of difference flipped for the low basal group where the induced strain has over 1.5-fold greater basal expression than the noninduced strain. The genes in between are categorized into the equal basal group.
F1 mice data processing
RNA-seq data from F1 mice were mapped to both parental genomes (C57 and SPRET) and analyzed in the same way as before (
14). In short, the read counts for each transcript were multiplied by the ratio of reads overlapping mutations times 10 and assigned to the parental genomes. Transcripts without any assigned reads in one of the F1 alleles were filtered out. To determine cis versus trans effects of genetic variation on gene expression, the difference of fold change between parental alleles and F1 alleles were calculated. The genes with majorly cis effects were defined by −1 < log
2(parental fold change) – log
2(F1 fold change) < 1, while those with majorly trans effects were defined by F1 fold change < parental fold change for genes with over ±2 fold-change in parental alleles.
WGCNA analysis
For each strain, a differential gene expression analysis was performed to compare IL-4 to basal with Limma Voom (
51). A linear model was fit for all five differential comparisons at once, and 1912 genes that were significant with
q value below 0.05 and an absolute fold change of 1.5 in any comparison were included in a WGCNA (
52). WGCNA was performed with a softpower value of 20, and a signed network was generated. Modules were cut with minimum module size of 50 and a cut height of 0.999 including PAM-stage. Nine modules were detected of which two genes were part of the gray (nonconnected) module that was subsequently excluded. Module Eigengenes (ME) were calculated and visualized using the verbose-boxplots function that also performed a Kruskal-Wallis significance test to test whether all ME values belong to the same distribution, and all modules were significantly different between conditions (all
P values below <0.0012). Two modules exhibited consistent differential expression between IL-4 and notx across strains, while the other six modules were most prominently influenced in a strain-specific manner. Modules were annotated with Metascape (
50).
ATAC-seq and ChIP-seq data analysis
On the basis of the HOMER tag directories created from mapped sequencing data, the reproducible ATAC-seq and transcription factor ChIP-seq peaks were identified by using HOMER to call unfiltered 200-bp peaks (parameters -L 0 -C 0 -fdr 0.9 -size 200) and running IDR v2.0.3 on replicates of the same sample with the default parameters (
53). The levels of histone modifications and RNA Pol II were quantified within ±500 bp around the centers of reproducible ATAC-seq peaks using HOMER annotatePeak.pl with parameters “-size -500,500 -norm 1e7.” The transcription factor binding intensities were quantified within ±300 bp around the identified ChIP-seq peaks using parameters “-size -150,150 -norm 1e7.” For comparisons across multiple samples (e.g., different time points, mouse strains, and transcription factors), we merged the set of peaks first using HOMER mergePeaks “-d given” before quantifying the features above. To visualize the average profile of a dataset around a certain set of peaks, we used HOMER annotatePeaks.pl with parameters “-norm 1e7 -size 4000 -hist 20” to help compute the histograms of 20-bp bins within ±2000 bp regions.
Identification of IL-4–responsive regulatory elements
IL-4–responsive enhancers were identified by the strong fold changes of H3K27ac and RNA Pol II at intergenic or intronic open chromatin. Reproducible ATAC-seq peaks called from each mouse strain for the basal and IL-4 conditions were first merged and then annotated for genomic positions and the enrichment of H3K27ac and RNA Pol II within ±500 bp using HOMER v4.11.1. On the basis of the genomic annotations from HOMER annotatePeaks.pl, we classified regions at promoter-transcription start sites (TSS) as promoters and regions at intergenic or intronic positions as enhancers. Regions with less than 16 normalized tags of H3K27ac or less than 8 normalized tags of RNA Pol II were filtered out. For the remaining promoters and enhancers, we computed the fold changes of the normalized tags of H3K27ac and RNA Pol II between basal and IL-4 conditions for each mouse strain. Regions were called IL-4–induced or IL-4–repressed if there were at least 2.5-fold increases or decreases, respectively, from basal to IL-4 state for both histone markers. Regions with less than 1.4-fold changes were called neutral elements.
Super enhancer
We used ROSE to call super enhancers for the five mouse strains (
25). The active enhancers were first merged within each strain for both basal and IL-4 conditions to obtain a set of starting conventional enhancers. Then, the ROSE algorithm was run for each strain on the mapped H3K27ac ChIP-seq data with the parameter “-t 2500” to exclude TSS. The overall activity of a super enhancer was quantified by the H3K27ac ChIP-seq read counts within the entire identified super enhancer region.
H3K4me3 HiChIP
H3K4me3 ChIP-seq HiChIP reference preprocessing
H3K4me3 ChIP-seqs from basal and 24-hour IL-4–stimulated macrophages were performed in duplicate with input controls. Fastq files were aligned with bowtie2 (
48) to the mm10 reference genome, and peak calling was done with MACS (
54) for each replicate separately. Significant peaks were merged using bedtools (
55) into a general bed file that was used as corresponding peak file for MAPS.
H3K4me3 HiChIP preprocessing
HiChIP sequencing (HiChIP-seq) data were processed with MAPS (
56) at 5000-bp resolution as described previously for proximity ligation-assisted ChIP-seq (PLAC-seq) (
28) for all four samples separately, basal and 24-hour IL-4 duplicate samples combined, and a merge of all four samples.
Differential analysis
To identify interactions that were significantly stronger in Il-4 or control, a differential analysis was performed as described in (
28). Briefly, significant interactions that were identified in the combined duplicate analysis of IL-4 and notx were merged in a general interaction set. Paired-end read counts that fell within these interactions were quantified for each sample separately. The quantified matrix of all significant interactions for all cell types was used as input for Limma (
57) differential interaction analysis. A linear model was fit, with one pairwise contrast (IL-4 versus control), with and without batch correction. No interactions were identified that were significantly different between IL-4 and control by either method (false discovery rate < 0.1, and absolute log
2 fold change > 1). Hence, the combined interaction set (generated using both IL-4 and control samples) was used for downstream analysis.
Interactions among promoters and enhancers
Significant interactions captured by HiChIP-seq were overlapped with previously identified active promoters and enhancers for the five mouse strains using HOMER mergePeaks “-d 2500” to identify three categories of interactive pairs: enhancer-enhancer, enhancer-promoter, and promoter-promoter. Enhancer-promoter interactions have enhancers on one end and promoters on the other end, while enhancer-enhancer or promoter-promoter interactions are the linked pairs of enhancers or promoters, respectively. We ended up with 145,907 enhancer-enhancer interactions, 81,411 enhancer-promoter interactions, and 10,710 promoter-promoter interactions. To better understand the regulatory landscape associated with IL-4 stimulation, we subsequently focused on enhancer-promoter interactions that contained IL-4–induced, IL-4–repressed, and/or IL-4–neutral promoters on one end and IL-4–induced, IL-4–neutral, and/or IL-4–repressed enhancers on the other end, and quantified the number of interactions between these possible promoter-enhancer combinations in nine categories as a contingency table. Fisher’s exact test was applied to the contingency table to determine whether any of the categories were significantly different for three comparisons of interest: IL-4–induced enhancer/promoter interactions versus noninduced enhancer/promoters; IL-4–repressed enhancer/promoter interactions versus nonrepressed enhancer/promoters; and IL-4–induced enhancer/promoter interactions versus IL-4–repressed enhancer/promoter interactions. For enhancer-enhancer interactions, we preselected enhancers that have at least fourfold difference in H3K27ac ChIP-seq tags between any two strains under the 24-hour IL-4 condition to obtain a set of strongly strain-differential enhancers. We then computed the Pearson correlation of H3K27ac tags across the five strains for every pair of interactive enhancers among the preselected set. To obtain noninteractive enhancers, we either randomly paired preselected enhancers on the same chromosome (same-chromosome random enhancers) or looked for enhancers within certain distances but not connected based on our data (distance-matched random enhancers).
Genetic variants at local and connected enhancers
Genetic variation between C57 and the other four strains at strain-differential enhancers was extracted using MMARGE annotate_mutations (
45), which was based on the VCF files from the Mouse Genomes Project (
46). Variants were searched within ±150 bp around the centers of enhancers. At least one genetic variant from the comparative strain needs to be present within the search area for such enhancer to be counted as having variants.
Motif analysis
Motif enrichment analysis
Given a certain set of peaks, we used HOMER findMotifsGenome.pl with parameters “-size 200 -mask” to identify de novo motifs and their matched known motifs (
15). The background sequences were either the default random sequences or a different set of peaks from a comparative condition in the main text and in the figure legends.
Motif mutation analysis
To integrate the genetic variation across mouse strains into motif analysis, we used MAGGIE, which is able to identify functional motifs out of the currently known motifs by testing for the association between motif mutations and the changes in specific epigenomic features (
32). The known motifs are obtained from the JASPAR database (
58). We applied this tool to strain-differential IL-4–responsive enhancers and transcription factor binding sites. Strain-differential IL-4–responsive enhancers were defined as previously described for KLA-responsive enhancers (
32). In brief, from every pairwise comparison across the five strains, enhancers identified as “IL-4 activated” or “IL-4 repressed” only in one of the compared strains were called strain-differential and were pooled together. For enhancer sites to be included in the analysis, enhancer activity had to be differentially regulated between two strains. As required by MAGGIE, sequences from the genomes of the responsive strains were input as “positive sequences,” and those from the other strains as “negative sequences.” Strain-differential transcription factor binding sites were defined by reproducible ChIP-seq peaks called in one strain but not in the other. Positive sequences and negative sequences were specified as sequences from the bound and unbound strains, respectively. The output
P values with signs indicating directional associations were averaged for clusters of motifs grouped by a maximum correlation of motif score differences larger than 0.6. Only motif clusters with at least one member showing a corresponding gene expression larger than 2 TPM in BMDMs were shown in figures.
Categorization of IL-4–induced enhancers
Among the strain-differential IL-4–induced enhancers as described above, we further split them into three categories based on the level of H3K27ac under the basal condition in noninduced strains. High basal enhancers have more than twofold stronger H3K27ac in noninduced strains, while low basal enhancers have more than twofold stronger H3K27ac in induced strains (lower basal H3K27ac in noninduced strains). Equal basal enhancers are those in between.
Deep learning
Neural network training
We adapted a similar strategy as AgentBind (
59) for our training procedure. We started with a pretrained DeepSEA (
26) model consisting of three convolutional layers and two fully connected layers and then fine-tuned it to generate three models based on our data: IL-4 active enhancers versus random backgrounds (auROC = 0.894), IL-4–induced enhancers versus random backgrounds (auROC = 0.919), and IL-4–induced enhancers versus noninduced enhancers (auROC = 0.796). The enhancer sequences were extended to 300 bp long. In all experiments, we left out sequences on chromosome 8 for cross-validation and sequences on chromosome 9 for testing. IL-4 active enhancers and noninduced enhancers were from C57 mice, while IL-4–induced enhancers were pooled from all the five strains to reach a comparable sample size. Random genomic backgrounds were generated by randomly selecting nearby guanine-cytosine (GC) content-matched equal-length sequences on the mm10 genome. We applied binary cross-entropy as the loss function. During each training, the initial learning rate was set as 1 × 10
−4 and reduced by a factor of 0.9 when learning stagnated. The training process stopped when the loss value had not decreased for more than 20 epochs.
DeepLIFT and importance score
We used DeepLIFT (
27) to generate importance scores with single-nucleotide resolution using uniform nucleotide backgrounds. For each input sequence, we generated two sets of scores, one for the original sequence and the other for its reverse complement. The final scores were the absolute maximum at each aligned position. We defined predicted functional nucleotides by the top 20% (i.e., top 60) positions within each input 300-bp sequence. To interpret the most important sequence patterns learned by neural networks, we computed the odds ratio of each 5-mer within top 10% of all 5-mers (
59). Fisher’s exact test was performed to determine whether 5-mers were enriched. We used TOMTOM (
60) to match 5-mers with known transcription factor binding motifs.
Data and code availability
All sequencing data have been made available by deposition in the Gene Expression Omnibus (GEO) database: GSE159630. The UCSC genome browser was used to visualize sequencing data. The codes for neural network model training and interpretation are available on our Github repository:
https://github.com/zeyang-shen/macrophage_IL4Response.
Statistical analysis
Two independent groups were tested using Mann-Whitney
U test for medians and using Levene’s test for variance. Gene expression comparisons were reported by adjusted
P values (i.e.,
q values) from DESeq2 (
49). Enrichment was computed by odds ratio and tested by Fisher’s exact test. Effect sizes were reported by Cohen’s
d. All gene expression data are displayed as means with 95% confidence interval. All data distributions are shown with means, 25th percentiles, and 75th percentiles.
Acknowledgments
We would like to thank J. Collier and J. Chang for technical assistance, the IGM core for library sequencing, L. Van Ael for assistance with manuscript preparation, and Drs. L. Warren and V. Lazarevic for donating Egr2flfl mice. Funding: These studies were supported by NIH grants DK091183 and HL147835 and a Leducq Transatlantic Network grant 16CVD01 to C.K.G. Sequencing costs were partially supported by DK063491. M.A.H. was supported by a Rubicon grant from the Netherlands Organization for Scientific Research and postdoctoral grants from the Amsterdam Cardiovascular Sciences Institute and the American Heart Association. Author contributions: Conceptualization: M.A.H., Z.S., and C.K.G. Formal analysis: Z.S., M.A.H., I.R.H., and A.Z. Investigation: M.A.H., I.C., and N.J.S. Writing: M.A.H., Z.S., and C.K.G. Visualization: Z.S., M.A.H., I.R.H., and I.C. Supervision: C.K.G. and M.G. Funding acquisition: C.K.G. Competing interests: The authors declare that they have no competing interests. Data and materials availability: All data needed to evaluate the conclusions in the paper are present in the paper and the sequencing data were deposited in the GEO database: GSE159630. Additional data related to this paper may be requested from the authors.