Hello all,
We have recently begun working with 10x Xenium data, and have been comparing normalization methods for our pipeline. We have noticed oddities in how SCTransform behaves for the data in comparison to traditional scRNA-seq data. These data make us doubt the appropriateness of SCTransform for Xenium data, so we wanted to reach out to see your opinion.
The SCTransform adds a few columns to the Seurat object metadata including nCount_SCT. According to our understanding, nCount_SCT represents the total "normalized counts" for each cell, and contrasts nicely with the raw counts (nCount_RNA for scRNA-seq, and nCount_Xenium for 10x Xenium).
Plotting the raw counts (nCount_RNA) vs the nCount_SCT allows for a high-level comparison of how the model transformed the counts across cells.
- Using scRNA-seq data from your vignette (replicated as well using our own scRNA-seq experiments) yields a pattern similar to this:

- However, using Xenium data, also from your vignette, we see a stratified set of distributions

This issue is even stronger within our own data, with some samples showing more distinct separation within nCount_SCT.
When you look into spatial plotting, you can see even more strongly the concern.

There is a grid-like pattern within the physical image data post-SCTransformation, seemingly associated with the different "strata" in the SCT counts seen above. We see similar and stronger patterns within our own data following the same methodology.
This clearly cannot represent biological variation, given the patterning, and so we hope that you can provide some insight into whether this data is expected, and if so, why?
Lastly, when looking into the counts for specific genes, we saw that 0-count genes were given non-0 values following SCTransform as well. While this makes sense conceptually for scRNA-seq, we are unsure whether such count abundance estimates are appropriate for Xenium, as an image and in-situ hybridization-based technology.
Please let us know your thoughts on this as well. Thank you
Hello all,
We have recently begun working with 10x Xenium data, and have been comparing normalization methods for our pipeline. We have noticed oddities in how
SCTransformbehaves for the data in comparison to traditional scRNA-seq data. These data make us doubt the appropriateness ofSCTransformfor Xenium data, so we wanted to reach out to see your opinion.The
SCTransformadds a few columns to theSeuratobject metadata includingnCount_SCT. According to our understanding,nCount_SCTrepresents the total "normalized counts" for each cell, and contrasts nicely with the raw counts (nCount_RNAfor scRNA-seq, andnCount_Xeniumfor 10x Xenium).Plotting the raw counts (
nCount_RNA) vs thenCount_SCTallows for a high-level comparison of how the model transformed the counts across cells.This issue is even stronger within our own data, with some samples showing more distinct separation within
nCount_SCT.When you look into spatial plotting, you can see even more strongly the concern.
There is a grid-like pattern within the physical image data post-SCTransformation, seemingly associated with the different "strata" in the SCT counts seen above. We see similar and stronger patterns within our own data following the same methodology.
This clearly cannot represent biological variation, given the patterning, and so we hope that you can provide some insight into whether this data is expected, and if so, why?
Lastly, when looking into the counts for specific genes, we saw that 0-count genes were given non-0 values following SCTransform as well. While this makes sense conceptually for scRNA-seq, we are unsure whether such count abundance estimates are appropriate for Xenium, as an image and in-situ hybridization-based technology.
Please let us know your thoughts on this as well. Thank you