CRISPR-sciATAC

CRISPR-sciATAC is a multi-omic method to perform pooled sgRNA screen using single-cell ATAC-seq as readout. The data give you the chromatin accessibility profile of each single cell and the sgRNA information from the same cell. You can think of this as the the pooled version of Purterb-ATAC. It uses the CROP-seq vector backbone for the sgRNA, meaning it profiles the sgRNA via the transcripts. The authors also identified a new ViPar transposase with mutations P50K and M53Q, which they called TnY, and it gives higher yield during protein purification and has similar tagmentation activity compared to the commonly used Tn5. Then they use TnY with the combinatorial indexing strategy to achieve high cell throughput.

The workflow drawn in the page is based on their Supplementary Figure 1. The full barcoded oligo sequences can be found from their Supplementary Table 2.


sgRNA cloning

You can start with the vector backbone of CROPseq-Guide-Puro which the CROP-seq authors have deposited into Addgene #86708:

sgRNA vector backbone

The idea is the same as a regular sgRNA vector construction. We clone the sgRNA spacer into the plasmid between the two BsmBI sites. We order the oligos and anneal them like this:


5'- CACCG[sgRNA-Spacer]
        C[sgRNA-Spacer]CAAA -5'


Digest and ligate to the vector backbone. During the screen, the transcripts that we are interested in are transcribed by the EF-1a promoter. The transcripts are in the following structure (note the full U6 promoter sequence is too long (241 bp) to write in full, so I only write the relevant bits here):


                                                                                                                                                                               
5'-[PuroR]-[WPRE]-...UGCAUAUACGAUACAAGGCUGUUAG...UUGUGGAAAGGACGAAACACCG[sgRNA-Spacer]GUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCUUUUU-[Poly-A] -3'
                  |-------------- U6 promoter ---------------|                                                       CROP-seq sgRNA scaffold


Adapter and primer sequences:

pMENT (ME): 5'-/Phos/ CTGTCTCTTATACACATCT -3'

T5 tagment sciATAC: 5'- TCGTCGGCAGCGTCTCCACGCAACCAACGCACCGTCTCCGCCTCAGATGTGTATAAGAGACAG -3'

Barcoded T7 tagment ATAC: 5'- GTCTCGTGGGCTCGGCTGTCCCTGTCC[8-bp barcode1]GCGATCGAGGACGGCAGATGTGTATAAGAGACAG -3'

Barcoded RTi sgRNA: 5'- CCGGGAGCTGCATGT[8-bp barcode1]TTTTTCAAGTTGATAA -3'

Barcoded P5i ATAC PCR1: 5'- AATGATACGGCGACCACCGAGATCTACAC[8-bp barcode2]TCGTCGGCAGCGTC -3'

Barcoded P7i ATAC PCR1: 5'- CAAGCAGAAGACGGCATACGAGAT[8-bp barcode3]GTCTCGTGGGCTCGG -3'

* U6 outer sgRNA PCR1: 5'- TGCATATACGATCACAAGGCTGTTAG -3'

Barcoded P5i sgRNA PCR1: 5'- CGCCGTCCAGCTGA[8-bp barcode2]CCGGGAGCTGCATGT -3'

Staggered P5 sgRNA PCR2: 5'- AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT[None/A/TA/GTA/NNNNNNNN]CGCCGTCCAGCTGA -3'

Barcoded P7i sgRNA PCR2: 5'- CAAGCAGAAGACGGCATACGAGAT[8-bp barcode3]TTGTGGAAAGGACGAAACAC -3'

Read 1 ATACseq: 5'- CACCGTCTCCGCCTCAGATGTGTATAAGAGACAG -'3

Index 1 ATACseq: 5'- CTGTCTCTTATACACATCTGCCGTCCTCGATCGC -3'

Read 2 ATACseq: 5'- GCGATCGAGGACGGCAGATGTGTATAAGAGACAG -3'

TruSeq Read 1: 5'- ACACTCTTTCCCTACACGACGCTCTTCCGATCT -3'

Illumina P5 adapter: 5'- AATGATACGGCGACCACCGAGATCTACAC -3'

Illumina P7 adapter: 5'- CAAGCAGAAGACGGCATACGAGAT -3'

* Here, I think there is an extra C (marked in red) in the primer that will not anneal. This might be a mistake, but I'm not sure. It does not affect the final library structure.


Step-by-step library generation

(1) Anneal pMENT (ME) with T5 tagment sciATAC and Barcoded T7 tagment ATAC, respectively, to form the barcoded TnY:

TnY dimer

(2) Open chromatin TnY tagging by incubation of nuclei and barcoded TnY, and use the barcoded RTi sgRNA to capture the sgRNA transcript.

(2.1) DNA in the open chromatin region is tagged by TnY. There are 3 different products (will create 9 bp gap):


Product 1 (T5 at both ends, not amplifiable due to semi-suppressiev PCR):

5'- TCGTCGGCAGCGTCTCCACGCAACCAACGCACCGTCTCCGCCTCAGATGTGTATAAGAGACAGXXXXXXXXXXXX...XXX         CTGTCTCTTATACACATCT
                                                TCTACACATATTCTCTGTC         XXX...XXXXXXXXXXXXGACAGAGAATATGTGTAGACTCCGCCTCTGCCACGCAACCAACGCACCTCTGCGACGGCTGCT -5'


Product 2 (T7 at both ends, not amplifiable due to semi-suppressiev PCR):

5'- GTCTCGTGGGCTCGGCTGTCCCTGTCC[8-bp barcode1]GCGATCGAGGACGGCAGATGTGTATAAGAGACAGXXXXXXXXXXXX...XXX         CTGTCTCTTATACACATCT
                                                             TCTACACATATTCTCTGTC         XXX...XXXXXXXXXXXXGACAGAGAATATGTGTAGACGGCAGGAGCTAGCG[8-bp barcode1]CCTGTCCCTGTCGGCTCGGGTGCTCTG -5'


Product 3 (different ends, amplifiable):

5'- TCGTCGGCAGCGTCTCCACGCAACCAACGCACCGTCTCCGCCTCAGATGTGTATAAGAGACAGXXXXXXXXXXXX...XXX         CTGTCTCTTATACACATCT
                                                TCTACACATATTCTCTGTC         XXX...XXXXXXXXXXXXGACAGAGAATATGTGTAGACGGCAGGAGCTAGCG[8-bp barcode1]CCTGTCCCTGTCGGCTCGGGTGCTCTG -5'

(2.2) The mRNA transcript that contains the sgRNA information:


Anneal RT primer and reverse transcription:

  5'- CCGGGAGCTGCATGT[8-bp barcode1]
                                    TTTTTCAAGTTGATAA---------------------->
3'- [Poly-A]-UUUUUCGUGGCUGAGCCACGGUGAAAAAGUUCAACUAUUGCCUGAUCGGAAUAAAAUUGAACGAUAAAGAUCGAGAUUUUG[sgRNA-Spacer]GCCACAAAGCAGGAAAGGUGUU...GAUUGUCGGAACAUAGCAUAUACGU...-[WPRE]-[PuroR] -5'


This the first strand cDNA:

5'- CCGGGAGCTGCATGT[8-bp barcode1]TTTTTCAAGTTGATAACGGACTAGCCTTATTTTAACTTGCTATTTCTAGCTCTAAAAC[sgRNA-Spacer]CGGTGTTTCGTCCTTTCCACAA...CTAACAGCCTTGTATCGTATATGCA...-[WPRE]-[PuroR] -3'


(3) Stop reactions, pool nuclei redistribute into new plates, purify DNA and use a mixture of four primers (P5i ATAC PCR1, P7i ATAC PCR1, U6 outer sgRNA PCR1 and P5i sgRNA PCR1) together to amplify ATAC and sgRNA fragments:

(3.1) ATAC fragments are amplified by P5i ATAC PCR1 + P7i ATAC PCR1. Gaps need be filled first (72 degree 5 mins):


5'- AATGATACGGCGACCACCGAGATCTACAC[8-bp barcode2]TCGTCGGCAGCGTC----------->
                                            5'- TCGTCGGCAGCGTCTCCACGCAACCAACGCACCGTCTCCGCCTCAGATGTGTATAAGAGACAGXXXXXXXXXXXX...XXX         CTGTCTCTTATACACATCT
                                                                                            TCTACACATATTCTCTGTC         XXX...XXXXXXXXXXXXGACAGAGAATATGTGTAGACGGCAGGAGCTAGCG[8-bp barcode1]CCTGTCCCTGTCGGCTCGGGTGCTCTG -5'
                                                                                                                                                                                            <----------GGCTCGGGTGCTCTG[8-bp barcode3]TAGAGCATACGGCAGAAGACGAAC -5'

(3.2) sgRNA cDNA is amplified by U6 outer sgRNA PCR1 and P5i sgRNA PCR1:


5'- CGCCGTCCAGCTGA[8-bp barcode2]CCGGGAGCTGCATGT--------->
                             5'- CCGGGAGCTGCATGT[8-bp barcode1]TTTTTCAAGTTGATAACGGACTAGCCTTATTTTAACTTGCTATTTCTAGCTCTAAAAC[sgRNA-Spacer]CGGTGTTTCGTCCTTTCCACAA...CTAACAGCCTTGTATCGTATATGCA...-[WPRE]-[PuroR] -3'
                                                                                                                                                      <---------GATTGTCGGAACATAGCATATACGT -5'
                                                                                                                                                                             C <- this is the extra C that won't anneal, not sure if this is a mistake


(4) These are the PCR products:


ATAC (ready to sequence):

5'- AATGATACGGCGACCACCGAGATCTACAC[8-bp barcode2]TCGTCGGCAGCGTCTCCACGCAACCAACGCACCGTCTCCGCCTCAGATGTGTATAAGAGACAGXXX...XXXCTGTCTCTTATACACATCTGCCGTCCTCGATCGC[8-bp barcode1]GGACAGGGACAGCCGAGCCCACGAGAC[8-bp barcode3]ATCTCGTATGCCGTCTTCTGCTTG
    TTACTATGCCGCTGGTGGCTCTAGATGTG[8-bp barcode2]AGCAGCCGTCGCAGAGGTGCGTTGGTTGCGTGGCAGAGGCGGAGTCTACACATATTCTCTGTCXXX...XXXGACAGAGAATATGTGTAGACGGCAGGAGCTAGCG[8-bp barcode1]CCTGTCCCTGTCGGCTCGGGTGCTCTG[8-bp barcode3]TAGAGCATACGGCAGAAGACGAAC -5'



sgRNA (need another round of PCR, see the next step):

5'- CGCCGTCCAGCTGA[8-bp barcode2]CCGGGAGCTGCATGT[8-bp barcode1]TTTTTCAAGTTGATAACGGACTAGCCTTATTTTAACTTGCTATTTCTAGCTCTAAAAC[sgRNA-Spacer]CGGTGTTTCGTCCTTTCCACAA...CTAACAGCCTTGTGATCGTATATGCA
    GCGGCAGGTCGACT[8-bp barcode2]GGCCCTCGACGTACA[8-bp barcode1]AAAAAGTTCAACTATTGCCTGATCGGAATAAAATTGAACGATAAAGATCGAGATTTTG[sgRNA-Spacer]GCCACAAAGCAGGAAAGGTGTT...GATTGTCGGAACACTAGCATATACGT -5'



(5) Take a fraction (2 uL) from the reaction above and use Staggered P5 sgRNA PCR2 + P7i sgRNA PCR2 to amplify sgRNA fragment:


sgRNA:

5'- AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT[None/A/TA/GTA/NNNNNNNN]CGCCGTCCAGCTGA---------------->
                                                                                  5'- CGCCGTCCAGCTGA[8-bp barcode2]CCGGGAGCTGCATGT[8-bp barcode1]TTTTTCAAGTTGATAACGGACTAGCCTTATTTTAACTTGCTATTTCTAGCTCTAAAAC[sgRNA-Spacer]CGGTGTTTCGTCCTTTCCACAA...CTAACAGCCTTGTGATCGTATATGCA
                                                                                      GCGGCAGGTCGACT[8-bp barcode2]GGCCCTCGACGTACA[8-bp barcode1]AAAAAGTTCAACTATTGCCTGATCGGAATAAAATTGAACGATAAAGATCGAGATTTTG[sgRNA-Spacer]GCCACAAAGCAGGAAAGGTGTT...GATTGTCGGAACACTAGCATATACGT -5'
                                                                                                                                                                                                              <------------CACAAAGCAGGAAAGGTGTT
                                                                                                                                                                                                                                               [8-bp barcode3]TAGAGCATACGGCAGAAGACGAAC -5'


(6) Final library structure:


ATAC:

5'- AATGATACGGCGACCACCGAGATCTACACNNNNNNNNTCGTCGGCAGCGTCTCCACGCAACCAACGCACCGTCTCCGCCTCAGATGTGTATAAGAGACAGXXX...XXXCTGTCTCTTATACACATCTGCCGTCCTCGATCGCNNNNNNNNGGACAGGGACAGCCGAGCCCACGAGACNNNNNNNNATCTCGTATGCCGTCTTCTGCTTG -3'
3'- TTACTATGCCGCTGGTGGCTCTAGATGTGNNNNNNNNAGCAGCCGTCGCAGAGGTGCGTTGGTTGCGTGGCAGAGGCGGAGTCTACACATATTCTCTGTCXXX...XXXGACAGAGAATATGTGTAGACGGCAGGAGCTAGCGNNNNNNNNCCTGTCCCTGTCGGCTCGGGTGCTCTGNNNNNNNNTAGAGCATACGGCAGAAGACGAAC -5'
           Illumina P5             8-bp           s5                                          ME           gDNA           ME                         8-bp             s7                8-bp        Illumina P7
                                 barcode2                                                                                                          barcode1                           barcode3


sgRNA:

5'- AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT...CGCCGTCCAGCTGANNNNNNNNCCGGGAGCTGCATGTNNNNNNNNTTTTTCAAGTTGATAACGGACTAGCCTTATTTTAACTTGCTATTTCTAGCTCTAAAAC[sgRNA-Spacer]CGGTGTTTCGTCCTTTCCACAANNNNNNNNATCTCGTATGCCGTCTTCTGCTTG -3'
3'- TTACTATGCCGCTGGTGGCTCTAGATGTGAGAAAGGGATGTGCTGCGAGAAGGCTAGA...GCGGCAGGTCGACTNNNNNNNNGGCCCTCGACGTACANNNNNNNNAAAAAGTTCAACTATTGCCTGATCGGAATAAAATTGAACGATAAAGATCGAGATTTTG[sgRNA-Spacer]GCCACAAAGCAGGAAAGGTGTTNNNNNNNNTAGAGCATACGGCAGAAGACGAAC -5'
           Illumina P5                  TruSeq Read 1          |                 8-bp                   8-bp                  CROP-seq sgRNA scaffold                                                         8-bp          Illumina P7
                                                        variable bases:        barcode2               barcode1                                                                                              barcode3
                                                  None, A, TA, GTA or 8 Ns


ATAC Library sequencing using MiSeq (slightly different from modern Illumina sequencing schemes in terms of how the i5 index is sequenced):

(1) Add Read 1 ATACseq primer to sequence the first read (bottom strand as template, 45 cycles, open chromatin read):


                                                                  5'- CACCGTCTCCGCCTCAGATGTGTATAAGAGACAG------->
3'- TTACTATGCCGCTGGTGGCTCTAGATGTGNNNNNNNNAGCAGCCGTCGCAGAGGTGCGTTGGTTGCGTGGCAGAGGCGGAGTCTACACATATTCTCTGTCXXX...XXXGACAGAGAATATGTGTAGACGGCAGGAGCTAGCGNNNNNNNNCCTGTCCCTGTCGGCTCGGGTGCTCTGNNNNNNNNTAGAGCATACGGCAGAAGACGAAC -5'

(2) Add Index 1 ATACseq primer to sequence barcode1 + barcode3 (bottom strand as template, 43 cycles, the first 8 bp are barcode1 and the last 8 bp are barcode3):


                                                                                                             5'- CTGTCTCTTATACACATCTGCCGTCCTCGATCGC------------------------------------------>
3'- TTACTATGCCGCTGGTGGCTCTAGATGTGNNNNNNNNAGCAGCCGTCGCAGAGGTGCGTTGGTTGCGTGGCAGAGGCGGAGTCTACACATATTCTCTGTCXXX...XXXGACAGAGAATATGTGTAGACGGCAGGAGCTAGCGNNNNNNNNCCTGTCCCTGTCGGCTCGGGTGCTCTGNNNNNNNNTAGAGCATACGGCAGAAGACGAAC -5'

(3) Bend over and anneal to P5 on the flow cell to sequence the barcode2 (bottom strand as template, 8 cycles):


5'- AATGATACGGCGACCACCGAGATCTACAC------->
3'- TTACTATGCCGCTGGTGGCTCTAGATGTGNNNNNNNNAGCAGCCGTCGCAGAGGTGCGTTGGTTGCGTGGCAGAGGCGGAGTCTACACATATTCTCTGTCXXX...XXXGACAGAGAATATGTGTAGACGGCAGGAGCTAGCGNNNNNNNNCCTGTCCCTGTCGGCTCGGGTGCTCTGNNNNNNNNTAGAGCATACGGCAGAAGACGAAC -5'

(4) Cluster regeneration, add Read 2 ATACseq primer to sequence the second read (top strand as template, 45 cycles, open chromatin read):


5'- AATGATACGGCGACCACCGAGATCTACACNNNNNNNNTCGTCGGCAGCGTCTCCACGCAACCAACGCACCGTCTCCGCCTCAGATGTGTATAAGAGACAGXXX...XXXCTGTCTCTTATACACATCTGCCGTCCTCGATCGCNNNNNNNNGGACAGGGACAGCCGAGCCCACGAGACNNNNNNNNATCTCGTATGCCGTCTTCTGCTTG -3'
                                                                                                          <------GACAGAGAATATGTGTAGACGGCAGGAGCTAGCG -5'


sgRNA Library sequencing:

There is only one single step here: add Illumina TruSeq Read 1 primer to get the first read (bottom strand as template, 162 cycles, the read contains everything, including sgRNA-Spacer and three rounds of barcodes):


                         5'- ACACTCTTTCCCTACACGACGCTCTTCCGATCT----------------------------------------------------------------------------------------------------------------------------------------------------->
3'- TTACTATGCCGCTGGTGGCTCTAGATGTGAGAAAGGGATGTGCTGCGAGAAGGCTAGA...GCGGCAGGTCGACTNNNNNNNNGGCCCTCGACGTACANNNNNNNNAAAAAGTTCAACTATTGCCTGATCGGAATAAAATTGAACGATAAAGATCGAGATTTTG[sgRNA-Spacer]GCCACAAAGCAGGAAAGGTGTTNNNNNNNNTAGAGCATACGGCAGAAGACGAAC -5'