Concern regarding method use for Xenium data

Hello all,

We have recently begun working with 10x Xenium data, and have been comparing normalization methods for our pipeline. We have noticed oddities in how `SCTransform` behaves for the data in comparison to traditional scRNA-seq data. These data make us doubt the appropriateness of `SCTransform` for Xenium data, so we wanted to reach out to see your opinion.

The `SCTransform` adds a few columns to the `Seurat` object metadata including `nCount_SCT`. According to our understanding, `nCount_SCT` represents the total "normalized counts" for each cell, and contrasts nicely with the raw counts (`nCount_RNA` for scRNA-seq, and `nCount_Xenium` for 10x Xenium). 

Plotting the raw counts (`nCount_RNA`) vs the `nCount_SCT` allows for a high-level comparison of how the model transformed the counts across cells.
- Using scRNA-seq data from your vignette (replicated as well using our own scRNA-seq experiments) yields a pattern similar to this:

![scRNA_SCTvsRawCounts](https://github.com/user-attachments/assets/107d3f8c-1094-4dea-bf19-079202be060c)

- However, using Xenium data, also from your vignette, we see a stratified set of distributions

![Screen Shot 2024-12-10 at 4 29 29 PM](https://github.com/user-attachments/assets/66eae34b-e822-4705-9cad-6be3b1899439)

This issue is even stronger within our own data, with some samples showing more distinct separation within `nCount_SCT`. 

When you look into spatial plotting, you can see even more strongly the concern.

![Screen Shot 2024-12-10 at 4 29 41 PM](https://github.com/user-attachments/assets/681d7f4b-8162-46ec-9023-822e734dc4ea)

There is a grid-like pattern within the physical image data post-SCTransformation, seemingly associated with the different "strata" in the SCT counts seen above. We see similar and stronger patterns within our own data following the same methodology. 

This clearly cannot represent biological variation, given the patterning, and so we hope that you can provide some insight into whether this data is expected, and if so, why?

Lastly, when looking into the counts for specific genes, we saw that 0-count genes were given non-0 values following SCTransform as well. While this makes sense conceptually for scRNA-seq, we are unsure whether such count abundance estimates are appropriate for Xenium, as an image and in-situ hybridization-based technology.

Please let us know your thoughts on this as well. Thank you

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concern regarding method use for Xenium data #203

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Concern regarding method use for Xenium data #203

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions