Enhancer RNAs (eRNAs) are non-coding RNAs produced from active enhancers. An optimal approach to study eRNA functions is to manipulate their levels in the native chromatin regions. Here we introduce a robust system for eRNA studies by using CRISPR-dCas9-fused transcriptional activators to induce the expression of eRNAs of interest.
Enhancers are pivotal genomic elements scattered through the mammalian genome and dictate tissue-specific gene expression programs. Increasing evidence has shown that enhancers not only provide DNA binding motifs for transcription factors (TFs) but also generate non-coding RNAs that are referred to as eRNAs. Studies have demonstrated that eRNA transcripts can play significant roles in gene regulation in both physiology and disease. Commonly used methods to investigate the function of eRNAs are constrained to “loss-of-function” approaches by knockdown of eRNAs, or by chemical inhibition of the enhancer transcription. There has not been a robust method to conduct “gain-of-function” studies of eRNAs to mimic specific disease conditions such as human cancer, where eRNAs are often overexpressed. Here, we introduce a method for precisely and robustly activating eRNAs for functional interrogation of their roles by applying the dCas9 mediated Synergistic Activation Mediators (SAM) system. We present the entire workflow of eRNA activation, from the selection of eRNAs, the design of gRNAs to the validation of eRNA activation by RT-qPCR. This method represents a unique approach to study the roles of a particular eRNA in gene regulation and disease development. In addition, this system can be employed for unbiased CRISPR screening to identify phenotype-driving eRNA targets in the context of a specific disease.
The human genome contains a constellation of regulatory elements1,2,3. Among these, enhancers emerge to be one of the most critical categories4,5,6. Enhancers play essential roles in regulating development, and are responsible for generating spatial-temporal gene expression programs to determine cell identity5,6,7. Conventionally, enhancers are only considered to be DNA elements that provide binding motifs for transcription factors (TFs), which then control target gene expression6,8. However, a series of studies found that many active enhancers also transcribe non-coding enhancer RNAs (i.e., eRNAs)4,9,10.
The level of eRNA transcription was found to correlate with the activity of an enhancer4,10. Active enhancers produce more eRNA transcripts and show higher levels of epigenome markers associated with active transcription, such as H3K27ac and H3K4me19,11,12. Some studies have demonstrated that eRNA transcripts can play important roles in transcriptional activation of target genes10,12. A large number of eRNAs were identified to be deregulated in human cancers13,14,15,16, many of which exhibited high cancer type specificity and clinical relevance. These findings bring opportunities that the elucidation of eRNAs that can drive/promote tumorigenesis may offer novel targets for therapeutic intervention13,15.
Current methods to study eRNA functions are almost exclusively based on knockdown strategies that used small interference RNAs (siRNA), short hairpin RNAs (shRNAs), or antisense oligonucleotides (ASOs, of which locked nucleic acids (LNAs) are the commonly used type in research)10,12,17. However, human diseases such as cancer predominantly show overexpression of eRNAs as compared to their adjacent normal tissue15, demanding tools to “overexpress” eRNAs to mimic their disease-relevant expression patterns for functional studies. To achieve this, a plasmid-based ectopic overexpression system is not optimal because the exact transcription start and termination sites of eRNAs remain largely unclear. In addition, a plasmid expression system may alter the location of eRNAs, causing potential artifacts of their functions18. Here we provide a detailed protocol to facilitate the functional characterization of eRNAs by enforcing their “overexpression” in the native genomic locus of their production (i.e., in situ), which is based on the CRISPR/dCas9-Synergistic Activation Mediators System (SAM).
The SAM system was initially developed for activating coding genes and long intergenic non-coding RNAs (lincRNAs) associated with BRAF inhibitor resistance in melanoma cells19. Unlike other CRISPR activation (CRISPRa) technologies, the SAM system consists of a combination of transcription activators to confer robust transcriptional activation of target regions. These activators include: an enzymatically dead Cas9 (dCas9) fused with VP64 (i.e., dCas9-VP64); a guide RNA containing two MS2 RNA aptamers, and an MS2-p65-HSF1 fusion activator protein. The presence of the MS2 aptamers in the gRNA can recruit the MS2-p65-HSF1 fusion protein to the vicinity of dCas9/gRNA binding sites. Among these, VP64 is an engineered tetramer of the herpes simplex VP16 transcriptional activator domain, which has been shown to strongly activate gene transcription by recruiting general transcription factors20,21,22. The MS2-p65-HSF1 fusion protein consists of three parts. The first part, the MS2-N55K, is a mutant form of MS2 binding protein that has a stronger affinity23; the other two parts of this fusion protein are the transactivation domain of p65 and heat shock factor 1 (HSF1), both of which are transcription factors that possess strong transactivation domains and can induce robust transcription programs24,25. Therefore, the SAM system essentially created a highly potent activator complex to activate transcription of designated coding genes and lincRNAs19.
The entire workflow of this protocol is shown in Figure 1.
1. Enhancer RNA (eRNA) selection
2. gRNA design
3. Clone gRNAs into a lentiviral construct
4. gRNA efficiency test
NOTE: Although it may not be necessary for every gRNA, it is recommended that researchers examine the quality of gRNA by performing Surveyor assay (i.e., mismatch cleavage assay) to detect indels or mutations that can only be efficiently generated by good quality gRNAs33,34. Other methods such as Tracking of Indels by Decomposition (TIDE) can also be used to determine gRNA efficiency30,35. Surveyor nuclease is a member of a family of mismatch-specific endonucleases that can cut double-strand DNA with mismatches (Figure 3A). The quality of gRNAs can be revealed by the efficacy of producing smaller DNA species. Practically, surveyor cutting efficacy can also be affected by the transfection efficiency of gRNAs and Cas9.
5. Lentivirus generation
6. Cell culture
7. Cell infection and selection
8. RNA extraction and quantitative RT-PCR to examine eRNA levels
9. dCas9 ChIP and qPCR
NOTE: This step is an optional experiment to validate the binding of dCas9/SAM-gRNA complex to the target enhancer by the specific gRNAs. While it is encouraged that users perform this step, it is not necessary to test every single gRNA. Refer to an example shown in Figure 5B. Refer to primers listed in Supplementary Table 1.
10. Cell growth assay and other functional tests of eRNA over-activation
Figure 1 illustrates the overall workflow of this protocol. Our focus was on a representative eRNA, NET1e15, which is overexpressed in breast cancer, for which SAM system was used to activate and study it’s biological role in regulating gene expression, cell proliferation and cancer drug response. For this NET1 enhancer, several p300 ChIP-Seq peaks, flanked by transcribed eRNA transcripts (Figure 2A,B) were marked. Based on strong GRO-Seq signals, one of the p300 ChIP-Seq peaks was selected as the enhancer core (where bi-direction eRNA transcripts originate, Figure 2B). NET1 enhancer region was also marked by dense ChIP-Seq peaks of H3K4me1 and H3K27ac. Multiple gRNAs were designed within this enhancer core p300 peak, with two of the gRNAs denoted by arrows in the inlet (Figure 2B). A chromatin loop between NET1 enhancer core and NET1 promoter can be detected in the data of Chromatin Interaction Analysis by Paired-End Tag Sequencing (ChIA-PET, black arch, Figure 2B), suggesting the direct regulatory relationship between the NET1e and the NET1 gene.
The quality of designed gRNA can be tested by two different methods i.e., Surveyor assay (Figure 3) or ChIP-qPCR (Figure 5). In Figure 3A, a schematic is used to illustrate the procedure of surveyor assay used to examine the quality of gRNAs. In surveyor assay, a PCR product spanning the gRNA recognition site was amplified from 60 ng genomic DNA, which was subjected to denaturing and rehybridization, and subsequently surveyor cleavage. In the control cells transfected with Cas9 plasmid only (pSpCas9(BB)-2A-Puro, Addgene #62988), only one band of 1,205 bp (indicated by the blue asterisk, Lane 1 in Figure 3B) was detected. By contrast, the assay using genomic DNAs from cells transfected with NET1e gRNA#1 together with Cas9 generated two cleavage products of 777 bp and 428 bp (denoted by orange asterisks), respectively (lane 2, Figure 3B). Similarly, shown in lane 3, the co-transfection of NET1e gRNA#2 with Cas9 caused Surveyor digestion of the 1,205 bp PCR product into two digested products of 905 bp and 300 bp (marked by red asterisks), respectively (Figure 3B). The sizes of these cleavage products matched the expected sizes of digestion due to the in-dels or mutations generated by the gRNAs, indicating that gRNAs and Cas9 can cut our targeted enhancer DNA in cells.
MCF7 parental SAM cell line was successfully generated, which expresses the two effector proteins as shown both by western blotting (Figure 4A) and RT-qPCR (Figure 4B). After infecting this parental line with NT-gRNAs or specific gRNAs targeting NET1 enhancer, ChIP-qPCR was conducted to test if the gRNA can recruit dCas9-VP64 (one of the effectors) to our target site. Prior to ChIP-qPCR, the amplification linearity of ChIP-qPCR primers needs to be tested by a PCR-cycle versus diluted-cDNA/ChIP-DNA standard curve, as shown in Figure 5A. For each pre-diluted standard shown in the x axis of Figure 5A, four folds of serial dilution were conducted to generate them from sonicated genomic DNAs. A standard curve was plotted between delta cycle threshold (∆CT) values (y axis) against the relative quantity (log2) of each diluted standard to the initial amount of genomic DNA (Figure 5A). This specific primer set conferred an R square of 0.994, and we generally recommend primer sets with an R square higher than 0.95. We then conducted ChIP by using a ChIP-quality antibody targeting Cas9 and found that the dCas9-VP64 protein was recruited by our NET1e gRNA to the target NET1 enhancer region, but not to another irrelevant genomic region (Figure 5B).
RT-qPCR was used to examine the expression levels of the targeted NET1e RNA after SAM activation by two different gRNAs in MCF7 SAM cells. Figure 6A shows that in cells with engineered SAM system targeting NET1 enhancer, >30-fold of NET1e upregulation was successfully achieved with two different gRNAs. Because NET1e was overexpressed in breast tumors than adjacent normal tissue based on our analysis of the Cancer Genome Atlas (TCGA) datasets15, and because its overexpression correlated with poor survival and altered response to a set of cancer drugs15, we tested if the enforced overexpression of NET1e by SAM can directly alter cell proliferation or cellular response to cancer drugs. We conducted cell growth assay and measured cell confluence by a live cell imager. The confluences were measured every 6 h and normalized to that at 0 h, which showed that SAM-enforced overexpression of NET1e accelerated cell growth (Figure 6B). We selected the BEZ235 and the Obatoclax to examine cell response to cancer drugs because NET1e expression is positively correlated with IC50 of these two drugs based on the analysis from Cancer Therapeutics Response Portal (CTRP) and Genomics of Drug Sensitivity in Cancer (GDSC). The result suggested that NET1e overexpression directly conferred resistance to specific cancer drugs (Figure 6C).
Figure 1: A work-flow chart demonstrating procedures used to generate the SAM system for enforced activation of eRNAs of interest. Please click here to view a larger version of this figure.
Figure 2: Epigenetic features of NET1 enhancer and its neighborhood in MCF7 cells. (A) A diagram showing the structure of the “Enhancer core” to denote the DNA region bound by transcription factors and cofactors, which is used for gRNA design to anchor SAM system in step 2 of the protocol. (B) A snapshot of genome browser tracks of ChIP-Seq, GRO-Seq, and ChIA-PET of indicated factors at NET1 and NET1e loci in MCF7 cells. GRO-Seq is stranded (Red: Watson strand, Orange: Crick strand). The zoom-in inlet to the right demonstrates the specific “enhancer core” used for designing two gRNAs (red lines and pointed by arrows). ChIA-PET track indicates a potential regulatory relationship between NET1e and NET1 gene, with the black and grey arch denoting chromatin loops. Figure 2B is modified from Zhang, Z. et al.15. The MCF7 ChIA-PET dataset is from ENCODE data portal1. Please click here to view a larger version of this figure.
Figure 3: Surveyor nuclease digestion assay to test gRNA quality. (A) The workflow of the Surveyor assay. The red dot on the DNA indicates in-dels or mutations generated by CRISPR/Cas9 to the target region. The forward primer was designed to be ~300-400 bp from the cutting site and the reverse primer ~800-900 bp from the cutting site, allowing easy discerning of their cutting in gel electrophoresis. (B) An agarose gel picture of Surveyor digestion assay. Blue asterisk indicates uncleaved DNA from PCR, while orange and red asterisks indicated cleaved products by Surveyor. Please click here to view a larger version of this figure.
Figure 4: Examination of dCas9-VP64 and MS2-p65-HSF1 expression in stable MCF7 SAM cell line. (A) Western blots with a Cas9 antibody in parental MCF7 cells (MCF7 WT), and in the MCF7 cell line expressing dCas9-VP64 and MS2-p65-HSF1 proteins (MCF7 SAM). GAPDH was used as a loading control. (B) RT-qPCR results show the mRNA levels of dCas9-VP64 and MS2-p65-HSF1 in MCF7 WT and MCF7 SAM, respectively. The values were normalized to GAPDH. Error bars represent mean ± SD; N=3. P value: Student’s t-test. Please click here to view a larger version of this figure.
Figure 5: ChIP-qPCR to validate dCas9-VP64 recruitment to NET1 enhancer. (A) Amplification linearity test of the primer set used for the NET1 enhancer ChIP-qPCR. (B) ChIP-qPCR based on Cas9 antibody using primers specific for the NET1 enhancer region, and an irrelevant genomic region as a negative control (hg19, chr14:35,025,431-35,025,595). Error bars represent mean ± SD; N=3. P value: Student’s t-test. Please click here to view a larger version of this figure.
Figure 6: NET1e expression level, cell growth assay, and responses to anti-cancer drugs in MCF7 NET1e-SAM cells. (A) RT-qPCR result showing the expression levels of NET1e, or its neighboring gene, NET1, in the SAM cell lines expressing non-targeting gRNA (NT-gRNA), NET1e gRNA#1 or NET1e gRNA#2. The values are normalized to GAPDH. Error bars represent mean ± SD; N=3. P value: Student’s t-test. (B) Normalized cell confluence of MCF7 SAM cells expressing gRNAs as indicated. (C) The IC50 of two drugs (Obatoclax and BEZ235) in SAM cell lines expressing NET1e gRNA#1 or NT-gRNA. The relative cell amount was measured with live cell imager. Error bars represent mean ± SD. Student’s t-test was used to calculate the p values. This figure is modified from Zhang et al15. Please click here to view a larger version of this figure.
Supplementary Table 1: List of primer sequences for gRNAs, Surveyor assay, RT-qPCR, and ChIP qPCR. Please click here to download this file.
Supplementary Table 2: PCR and RT-qPCR experimental setup and reaction conditions. Please click here to download this file.
Based on our data, we conclude that the SAM system is suitable for studying the role of eRNAs in regulating cellular phenotypes, e.g., cell growth or drug resistance. However, careful gRNA designing is required for robust eRNA activation, due to the following reasons. First of all, the transcription start site (TSS) of eRNA in each specific cell lines/types remains less clearly annotated. Due to this, epigenomic information (e.g., ChIP-Seq of H3K27ac, of transcription factors, or of p300), transcriptional activity depicted by GRO-Seq (or additional CAGE4 or GRO-cap26 datasets if available) need to be utilized to deduce the transcription starting sites so that gRNAs can be designed in a way to avoid the potential impediment of normal enhancer transcription by the dCas9/SAM complex. Second, gRNAs provided by current design tools may show differential eRNA activation potency in SAM systems, the basis of which remains mechanistically unclear. It can be due to different binding dynamics of distinct gRNA sequences30, or due to the relative location of the gRNA/dCas9 binding site versus the other critical regulatory DNA elements near the enhancer core region. To ensure robust eRNA activation and to reduce off-targeting, we recommend readers to design multiple gRNAs (n>3) and test their actual activation efficacy by RT-qPCR. If a consistent result is generated by multiple gRNAs, it is considered conclusive. Furthermore, the process of generating stable lines also varies based on the choice of cell lines, which may require different selection time, length, and antibiotic concentrations. Finally, we found it important to ensure the quality of primer sets for examining eRNAs in RT-qPCR, as their abundance is sometimes one or two magnitudes lower than common mRNAs. The primers’ performance in qPCR and their linear range of amplification should be carefully examined. It is also important and necessary to remove the trace of genomic DNAs in RNA samples when the levels of eRNAs are examined, because these DNAs may confound RT-qPCR results.
We largely followed the original method developed by the group of Feng Zhang et al.19. The modifications here is to target the dCas9-SAM complex to enhancers and to activate eRNAs instead of lincRNAs or mRNAs.
Here we introduce a highly effective method to activate the transcription of targeted enhancers at its native chromosomal sites (i.e., in situ). This method complements conventional approaches to study eRNAs in gene regulation that often emphasize on the knockdown of eRNAs by shRNAs, siRNAs, or ASOs10,17. Indeed, it has been largely infeasible to apply strategies to activate the transcription of eRNAs for functional interrogation until the advent of CRISPR mediated epigenetic tools. This is because ectopic expression of an eRNA driven by a heterologous promoter may not be optimal in two aspects: 1) the full-length transcript of eRNAs from their transcriptional start to end sites (particularly the ends) are difficult to be determined or are possible of mixed species in a cell; 2) the ectopic expression may alter the location of eRNAs, or may not contain proper structural features of the RNAs due to the lack of genomic/epigenomic contexts. CRISPR-dCas9 based epigenetic activation of eRNAs overcomes the above problems and provides an ideal method to “over-express” eRNAs in situ. Importantly, the enhanced eRNA expression recapitulates the disease-relevant enhancer over-activation commonly observed in human cancers13,14,15,16, permitting subsequent studies of the pathological consequence.
Here we used NT-gRNA as a control for eRNA-targeting gRNAs to deduce the effects of the eRNA over-activation. However, there are other controls that can be employed. For example, cells with eRNA-targeting gRNAs with or without one of the two effector proteins may also be considered to deduce the genes affected by the effector proteins. Additional genome-wide tools such as ChIP-seq may be utilized to examine the global binding patterns of dCas9 effectors together with a targeting gRNA to further test the specificity of the SAM activator. Regardless, we consider that a robust conclusion should always be based on at least two separate gRNAs generating consistent results. Currently, little knowledge is available in terms of how the enforced binding of dCas9 effectors may impede the normal binding of transcription factors or cofactors on enhancers, which should be taken into consideration when users are designing and/or interpreting their experiments. This should be an important research question for future work to better take advantage of this robust tool. While our mainly focus was on the oncogenic functions of eRNAs in this paper, the SAM system can be applied to examine eRNA functions in other biological or pathological settings40,41,42. One limit of this system is that it is not rapidly inducible, nor is it reversible, which may elicit indirect effects on cell growth that is not directly caused by eRNA activation. Therefore, one of the future directions to improve this protocol is to make it inducible and reversible. This will permit our manipulation of enhancers and eRNAs with higher temporal or spatial precision.
The content is solely the responsibility of the authors and does not necessarily represent the official views of the Cancer Prevention and Research Institute of Texas.
This work is supported by grants to W.L (Cancer Prevention and Research Institute of Texas, CPRIT RR160083 and RP180734; NCI K22CA204468; NIGMS R21GM132778; The University of Texas UT Stars Award; and the Welch foundation AU-2000-20190330) and a post-doctoral fellowship to J.L (UTHealth Innovation for Cancer Prevention Research Training Program Post-doctoral Fellowship, CPRIT RP160015). We acknowledge the original publicataion15 where some of our current figures were adopted from (with modifications), which follows the Creative Commons license (https://creativecommons.org/licenses/by/4.0/).
Name | Company | Catalog Number | Comments |
Blasticidin | Goldbio | B-800-100 | |
BsmBI restriction enzyme | New England BioLabs Inc. | R0580S | |
Cas9 mAb | Active Motif | 61757 | Lot: 10216001 |
Deoxynucleotide (dNTP) Solution Mix | New England BioLabs Inc. | N0447S | |
Dulbecco’s Modified Eagle Medium | Corning | 10-013-CM | |
Dynabeads Protein G | Thermo Fisher Scientific | 65002 | |
EDTA | Thermo Fisher Scientific | BP118-500 | |
EGTA | Sigma | E3889 | |
Fetal Bovine Serum | GenDEPOT | F0900-050 | |
Glycogen | Thermo Fisher Scientific | 10814010 | |
Hepes-KOH | Thermo Fisher Scientific | BP310-100 | |
Hexadimethrine Bromide | Sigma | H9268 | |
Hygromycin B | Goldbio | H-270-25 | |
IGEPAL CA630 | Sigma | D6750 | |
IncuCyte live-cell imager | Essen BioScience | IncuCyte S3 Live-Cell Analysis System | |
lenti_dCAS-VP64_Blast | Addgene | 61425 | |
lenti_gRNA(MS2)_zeo backbone | Addgene | 61427 | |
lenti_MS2-p65-HSF1_Hygro | Addgene | 61426 | |
LiCL | Sigma | L9650 | |
Lipofectamine 2000 | Thermo Fisher Scientific | 11668-500 | |
NaCl | Sigma | S3014 | |
Na-Deoxycholate | Sigma | D6750 | |
NaHCO3 | Thermo Fisher Scientific | BP328-500 | |
N-lauroylsarcosine | Sigma | 97-78-9 | |
Opti-MEM Reduced Serum Medium | Thermo Fisher Scientific | 31985070 | |
PES syringe filter | BASIX | 13-1001-07 | |
Protease Inhibitor Cocktail Tablet | Roche Diagnostic | 11836145001 | |
pSpCas9(BB)-2A-Puro | Addgene | 62988 | |
Q5 High-Fidelity DNA Polymerase | New England BioLabs Inc. | M0491S | |
Q5 Reaction Buffer | New England BioLabs Inc. | B9027S | |
Quick-DNA Miniprep | ZYMO Research | D3025 | |
Quick-RNA Miniprep | ZYMO Research | R1054 | |
Restriction enzyme buffer | New England BioLabs Inc. | B7203S | |
RT-qPCR Detection System | Thermo Fisher Scientific | Quant Studio3 | |
SDS | Thermo Fisher Scientific | BP359-500 | |
Sonicator | Qsonica | Q800R2 | |
Sso Advanced Universal SYBR Green Supermix | Bio-Rad Laboratories | 1725274 | |
Stbl3 competent cell | Thermo Fisher Scientific | C7373-03 | |
Superscript IV reverse transcript | Thermo Fisher Scientific | 719000 | |
Surveyor Mutation Detection Kits | Integrated DNA Technologies | 706020 | |
T4 DNA Ligase | New England BioLabs Inc. | M0202S | |
T4 DNA Ligase Reaction Buffer | New England BioLabs Inc. | B0202S | |
TE buffer | Thermo Fisher Scientific | 46009CM | |
Thermal cycler | Bio-Rad Laboratories | T100 | |
Thermomixer | Sigma | 5384000020 | |
Zeocin | Thermo Fisher Scientific | ant-zn-1p |
Request permission to reuse the text or figures of this JoVE article
Request PermissionThis article has been published
Video Coming Soon
Copyright © 2025 MyJoVE Corporation. All rights reserved