Skip to content

Concern regarding method use for Xenium data #203

@LPotter21

Description

@LPotter21

Hello all,

We have recently begun working with 10x Xenium data, and have been comparing normalization methods for our pipeline. We have noticed oddities in how SCTransform behaves for the data in comparison to traditional scRNA-seq data. These data make us doubt the appropriateness of SCTransform for Xenium data, so we wanted to reach out to see your opinion.

The SCTransform adds a few columns to the Seurat object metadata including nCount_SCT. According to our understanding, nCount_SCT represents the total "normalized counts" for each cell, and contrasts nicely with the raw counts (nCount_RNA for scRNA-seq, and nCount_Xenium for 10x Xenium).

Plotting the raw counts (nCount_RNA) vs the nCount_SCT allows for a high-level comparison of how the model transformed the counts across cells.

  • Using scRNA-seq data from your vignette (replicated as well using our own scRNA-seq experiments) yields a pattern similar to this:

scRNA_SCTvsRawCounts

  • However, using Xenium data, also from your vignette, we see a stratified set of distributions

Screen Shot 2024-12-10 at 4 29 29 PM

This issue is even stronger within our own data, with some samples showing more distinct separation within nCount_SCT.

When you look into spatial plotting, you can see even more strongly the concern.

Screen Shot 2024-12-10 at 4 29 41 PM

There is a grid-like pattern within the physical image data post-SCTransformation, seemingly associated with the different "strata" in the SCT counts seen above. We see similar and stronger patterns within our own data following the same methodology.

This clearly cannot represent biological variation, given the patterning, and so we hope that you can provide some insight into whether this data is expected, and if so, why?

Lastly, when looking into the counts for specific genes, we saw that 0-count genes were given non-0 values following SCTransform as well. While this makes sense conceptually for scRNA-seq, we are unsure whether such count abundance estimates are appropriate for Xenium, as an image and in-situ hybridization-based technology.

Please let us know your thoughts on this as well. Thank you

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions