There had been two transformative technologies in modern systems biology: genomics, which allows all of genes and proteins in an organism to be monitored simultaneously, and single cell biology, which follows a few specific genes in individual cells with high precision in their native micro-environments.  Both techniques are powerful, but have complementary limitations: genomics averages over the heterogeneity and spatial complexity of a cell population, and single cell techniques can only probe a few genes at a time.  Integrating genomics with single cell is the next major challenge in biology. Our lab sets out to synthesize the genomics and single cell approaches.

Single cell in situ profiling by sequential hybridization and barcoding
.   Our lab's current focus is to profile gene expression in single cells via in situ "sequencing" by FISH [link]. To detect individual mRNAs, we use single molecule fluorescence in situ hybridization (smFISH) with 20mer oligonucleotide probes complementary to the mRNA sequence. By putting up to 48 fluorophore labeled probes on an mRNA, single transcripts in cells become readily detectable in situ. We have shown that almost all mRNAs that can be detected are observed by smFISH .

To distinguish different mRNA species, we barcode mRNAs with the FISH probes using sequential rounds of hybridization. During a round of hybridization, each transcript is targeted by a set of FISH probes labeled with a single type of fluorophore. The sample is imaged and the FISH probes are removed by enzymatic digestion. Then the mRNA is hybridized in a subsequent round with the same FISH probes, but now labeled with a different dye (Fig. 1). As the transcripts are fixed in cells, the fluorescent spots corresponding to single mRNAs remain in place during multiple rounds of hybridization, and can be aligned to read out a color sequence. Each mRNA species is therefore assigned a unique barcode. Thus, the number of each transcript in a given cell can be determined by counting the number of the corresponding barcode.

Figure 1. Sequential barcoding FISH (seqFISH). (a) Schematic of sequential barcoding. In each round of hybridization, 24 probes are hybridized on each transcript, imaged and then stripped by DNAse I treatment. The same probe sequences are used in different rounds of hybridization, but probes are coupled to different fluorophores. (b) Schematic of the FISH images of the cell. In each round of hybridization, the same spots are detected, but the dye associated with the transcript changes. The identity of the mRNA is encoded in the temporal sequence of dyes hybridized. (c) Data from 3 rounds of hybridizations on six yeast cells. 12 genes are encoded by 2 rounds of hybridization, with the 3rd hybridization using the same probes as hybridization 1. The boxed regions are magnified in the bottom panels. The matching spots are shown and barcodes are extracted. Different barcodes encode for different transcripts. Note the first and third elements of the barcodes are the same, corresponding to the same probes used in hybridization 1 and 3. Spots without colocalization are due to nonspecific binding of probes as well as mis-hybridization. The number of each barcode corresponds to the abundance of transcripts in single cells.

The sequential FISH (seqFISH) scheme is conceptually akin to sequencing transcripts in single cells with FISH probes. Our method takes advantage of the high hybridization efficiency of FISH (>95% of the mRNAs are detected) and the fact that base pair resolution is usually not needed to identify a transcript. The number of barcodes available with this approach scales as F^N, where F is the number of distinct fluorophores and N is the number of hybridization rounds. With 5 distinct dyes and 3 rounds of hybridization, one can detect 125 unique genes. While in principle the entire transcriptome can be covered by 6 rounds of hybridization (56=15,625), super-resolution microscopy is needed to resolve all of the transcripts in the cell.


go to top

Caltech    last update: 07/13/17