10x Chromium 5' Gene Expression

The 5' Gene Expression kit is similar to the 3' scRNA-seq methods in general. Instead of using barcoded RT primers on the beads, the 5' Gene Expression kit use a universal RT primer for the reverse transcription, but with barcoded Template Switching Oligos (TSO) on their gel beads. The 5' Gene Expression kit is often coupled with profiling V(D)J from T/B cells (with TCR/BCR primers), but you can simply use this kit for gene expression purposes. Conceptually, this kit is very similar to STRT-seq.

Like the 3' Gene Expression kit, there are multiple versions of the 10x Chromium 5' Gene Expression kit based on slightly different chemistry. The library structure of v1 and v2 chemistries are exactly the same, and they are shown on this page. Oligo sequence information is taken from The 10x Genomics Technical Note. The 16-bp cell barcodes are the same as the 3' Gene Expression kit v2, and you can download from here: 737K-august-2016.txt.gz. This file is copied from Cell Ranger (using Cell Ranger v2.1.0 as an example) /path/to/cellranger-2.1.0/cellranger-cs/2.1.0/tenkit/lib/python/tenkit/barcodes.

Around 2023-2024, the v3 chemistry was introduced. It has better cell recovery and sensitivity (number of detected genes per cell and TRA/TRB UMI counts) compared to v1 and v2. See their product sheet for a more detailed overview. The cell barcodes have changed in the v3 chemistry, and the file with all 16-bp cell barcodes can be found here: 3M-5pgex-jan-2023.txt.gz. This file is copied from Cell Ranger (using Cell Ranger v8.0.1 as an example) /path/to/cellranger-8.0.1/lib/python/cellranger/barcodes/. The library structure of 5' v3 is exactly the same as 5' v1 and v2, except that the UMI is 10-bp long in v1 and v2 but 12-bp in v3.


Adapter and primer sequences:

Beads-TSO:


          V1 (PN-220112) & V2 (PN-1000264): |--5'- CTACACGACGCTCTTCCGATCT[16-bp cell barcode][10-bp UMI]TTTCTTATATrGrGrG -3'

                           V3 (PN-2001129): |--5'- CTACACGACGCTCTTCCGATCT[16-bp cell barcode][12-bp UMI]TTTCTTATATrGrGrG -3'

Poly-dT RT primer (PN-2000007): 5'- AAGCAGTGGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTVN -3'

cDNA Primer Mix (for cDNA amplification, PN-220106):


        Forward primer: 5'- CTACACGACGCTCTTCCGATCT -3'

        Reverse primer: 5'- AAGCAGTGGTATCAACGCAG -3'

Illumina Truseq Read 1 primer: 5'- ACACTCTTTCCCTACACGACGCTCTTCCGATCT -3'

Illumina Truseq Read 2 primer: 5'- GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT -3'

Truseq adapter (double stranded DNA with a T overhang, PN-220026):


        5'-  GATCGGAAGAGCACACGTCTGAACTCCAGTCAC -3'
        3'- TCTAGCCTTCTCG -5'

Library PCR primer 1 (PN-220111): 5'- AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTC -3'

Library PCR primer 2 (PN-220103): 5'- CAAGCAGAAGACGGCATACGAGAT[8-bp sample index]GTGACTGGAGTTCAGACGTGT -3'

Sample index sequencing primer: 5'- GATCGGAAGAGCACACGTCTGAACTCCAGTCAC -3'

Illumina P5 adapter: 5'- AATGATACGGCGACCACCGAGATCTACAC -3'

Illumina P7 adapter: 5'- CAAGCAGAAGACGGCATACGAGAT -3'


Step-by-step library generation

(1) Reverse transcription with Poly-dT RT primer using MMLV:


                   <--------NV(T)30CATGAGACGCAACTATGGTGACGAA -5'
    5'- XXXXXXXXXXXXXXXXXXXXXB(A)30

(2) The terminal transferase activity of MMLV adds extra Cs:


     CCCXXXXXXXXXXXXXXXXXXXXXNV(T)30CATGAGACGCAACTATGGTGACGAA -5'
    5'- XXXXXXXXXXXXXXXXXXXXXXB(A)30

(3) cDNA capture by gel bead barcoded TSO:


                                                          <-----------CCCXXXXXXXXXXXXXXXXXXXXXNV(T)30CATGAGACGCAACTATGGTGACGAA -5'
|--5'- CTACACGACGCTCTTCCGATCT[16-bp cell barcode][10-bp UMI]TTTCTTATATGGGXXXXXXXXXXXXXXXXXXXXXXB(A)30------->

(4) Adding cDNA Primer Mix to amplify full length cDNA:


   5'- CTACACGACGCTCTTCCGATCT-------->
|--5'- CTACACGACGCTCTTCCGATCT[16-bp cell barcode][10-bp UMI]TTTCTTATATGGGXXXXXXXXX...XXXXXXXXXB(pA)GTACTCTGCGTTGATACCACTGCTT -3'
   3'- GATGTGCTGCGAGAAGGCTAGA[16-bp cell barcode][10-bp UMI]AAAGAATATACCCXXXXXXXXX...XXXXXXXXXV(dT)CATGAGACGCAACTATGGTGACGAA -5'
                                                                                               <--------GACGCAACTATGGTGACGAA -5'

(5) Use Fragmentase to fragment cDNA and perform A-tailing:


Product 1 (TSO, cell barcode, UMI plus 5'-end of cDNA):


5'-   CTACACGACGCTCTTCCGATCT[16-bp cell barcode][10-bp UMI]TTTCTTATATGGGXXXXXXXXX...XXXXXXXXX*A -3'
3'- A*GATGTGCTGCGAGAAGGCTAGA[16-bp cell barcode][10-bp UMI]AAAGAATATACCCXXXXXXXXX...XXXXXXXXX   -5'


Product 2 (middle of cDNA):


5'-   XXXXXXXXXXXXXXXXXXXXXXXXX...XXXXXXXXXXXXXXXXXXXXXXXXX*A -3'
3'- A*XXXXXXXXXXXXXXXXXXXXXXXXX...XXXXXXXXXXXXXXXXXXXXXXXXX   -5'


Product 3 (3' of cDNA plus Poly-dT RT sequence):


5'-   AAGCAGTGGTATCAACGCAGAGTAC(dT)VXXXXXXXXX...XXXXXXXXX*A -3'
3'- A*TTCGTCACCATAGTTGCGTCTCATG(pA)BXXXXXXXXX...XXXXXXXXX   -5'


(6) Add double stranded Illumina Truseq adapter with a T overhang (PN-220026) for ligation:


Product 1 (I assume the 5' end of Poly-dT RT primer is blocked, so the adapter can only be ligated to the cDNA end. This is the only ampliable fragment):


5'-   CTACACGACGCTCTTCCGATCT[16-bp cell barcode][10-bp UMI]TTTCTTATATGGGXXX...XXXAGATCGGAAGAGCACACGTCTGAACTCCAGTCAC -3'
3'- A*GATGTGCTGCGAGAAGGCTAGA[16-bp cell barcode][10-bp UMI]AAAGAATATACCCXXX...XXXTCTAGCCTTCTCG -5'


Product 2 (will not amplify efficiently due to semi-suppressive PCR??? not really sure about this):


5'-                      GCTCTTCCGATCTXXXXXXXXXXXXXXXXXXXX...XXXXXXXXXXXXXXXXXXXXAGATCGGAAGAGCACACGTCTGAACTCCAGTCAC -3'
3'- CACTGACCTCAAGTCTGCACACGAGAAGGCTAGAXXXXXXXXXXXXXXXXXXXX...XXXXXXXXXXXXXXXXXXXXTCTAGCCTTCTCG -5'


Product 3 (I assume the 5' end of TSO is blocked. This produce is not amplifiable due to the use of the specific primers for amplification, see the next step):


5'-   AAGCAGTGGTATCAACGCAGAGTAC(dT)VXXXXXXXXX...XXXXXXXXXAGATCGGAAGAGCACACGTCTGAACTCCAGTCAC -3'
3'- A*TTCGTCACCATAGTTGCGTCTCATG(pA)BXXXXXXXXX...XXXXXXXXXTCTAGCCTTCTCG -5'



(7) Add Library PCR Primers 1 & 2 to amplify library:


5'-  AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTC--------->
                                   5'-   CTACACGACGCTCTTCCGATCT[16-bp cell barcode][10-bp UMI]TTTCTTATATGGGXXX...XXXAGATCGGAAGAGCACACGTCTGAACTCCAGTCAC -3'
                                   3'- A*GATGTGCTGCGAGAAGGCTAGA[16-bp cell barcode][10-bp UMI]AAAGAATATACCCXXX...XXXTCTAGCCTTCTCG                      -5'
                                                                                                                     <-----------TGTGCAGACTTGAGGTCAGTG[8-bp sample index]TAGAGCATACGGCAGAAGACGAAC -5'

(8) Final library structure:

V1 and V2 library:


5'- AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCTNNNNNNNNNNNNNNNNNNNNNNNNNNTTTCTTATATGGGXXX...XXXAGATCGGAAGAGCACACGTCTGAACTCCAGTCACNNNNNNNNATCTCGTATGCCGTCTTCTGCTTG -3'
3'- TTACTATGCCGCTGGTGGCTCTAGATGTGAGAAAGGGATGTGCTGCGAGAAGGCTAGANNNNNNNNNNNNNNNNNNNNNNNNNNAAAGAATATACCCXXX...XXXTCTAGCCTTCTCGTGTGCAGACTTGAGGTCAGTGNNNNNNNNTAGAGCATACGGCAGAAGACGAAC -5'
          Illumina P5                   Truseq Read 1               16 bp       10 bp                   cDNA          Truseq Read 2                8 bp        Illumina P7
                                                                cell barcode     UMI                                                           Sample Index

V3 library:


5'- AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCTNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTCTTATATGGGXXX...XXXAGATCGGAAGAGCACACGTCTGAACTCCAGTCACNNNNNNNNATCTCGTATGCCGTCTTCTGCTTG -3'
3'- TTACTATGCCGCTGGTGGCTCTAGATGTGAGAAAGGGATGTGCTGCGAGAAGGCTAGANNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAGAATATACCCXXX...XXXTCTAGCCTTCTCGTGTGCAGACTTGAGGTCAGTGNNNNNNNNTAGAGCATACGGCAGAAGACGAAC -5'
          Illumina P5                   Truseq Read 1               16 bp        12 bp                   cDNA          Truseq Read 2                8 bp        Illumina P7
                                                                cell barcode      UMI                                                           Sample Index


Library sequencing:

(1) Add Truseq Read 1 primer to sequence the first read (bottom strand as template, sequence 16-bp cell barcode and UMI, 26 cycles for V1 and V2, 28 cycles for V3):


                         5'- ACACTCTTTCCCTACACGACGCTCTTCCGATCT------------------------>
3'- TTACTATGCCGCTGGTGGCTCTAGATGTGAGAAAGGGATGTGCTGCGAGAAGGCTAGANNNNNNNNNNNNNNNNNNNNNNNNNNAAAGAATATACCCXXX...XXXTCTAGCCTTCTCGTGTGCAGACTTGAGGTCAGTGNNNNNNNNTAGAGCATACGGCAGAAGACGAAC -5'

(2) Add Sample Index sequencing primer to sequence the sample index (bottom strand as template):


                                                                                                           5'- GATCGGAAGAGCACACGTCTGAACTCCAGTCAC------->
3'- TTACTATGCCGCTGGTGGCTCTAGATGTGAGAAAGGGATGTGCTGCGAGAAGGCTAGANNNNNNNNNNNNNNNNNNNNNNNNNNAAAGAATATACCCXXX...XXXTCTAGCCTTCTCGTGTGCAGACTTGAGGTCAGTGNNNNNNNNTAGAGCATACGGCAGAAGACGAAC -5'

(3) Cluster regeneration, add Truseq Read 2 primer to sequence the second read (top strand as template, sequence cDNA, 98 cycles):


5'- AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCTNNNNNNNNNNNNNNNNNNNNNNNNNNTTTCTTATATGGGXXX...XXXAGATCGGAAGAGCACACGTCTGAACTCCAGTCACNNNNNNNNATCTCGTATGCCGTCTTCTGCTTG -3'
                                                                                                        <-----TCTAGCCTTCTCGTGTGCAGACTTGAGGTCAGTG -5'