Releases · ArcInstitute/cell-load

21 Feb 16:13

v0.10.3

6866990

v0.10.3 Latest

Latest

Adds a consecutive data loading option for training on huge datasets. This packs cell sets so that within a condition, they are consecutive on disk, leading to around a 3x improvement for HVG training (e.g. output space = gene), and closer to 12-15x improvement for full transcriptome training (output space = all). Code will error if underlying data is not sorted by condition

Assets 2

23 Jan 21:45

abhinadduri

v0.8.7

34a61cc

v0.8.7

updates batch codes to be stored as categoricals in metadata cache, if not already done, to properly deal with codes across multiple datasets

Assets 2

10 Dec 05:19

abhinadduri

v0.8.6

59d4594

v0.8.6

defaults to using /var/feature if /var/_index is not available in anndata files

Assets 2

21 Sep 22:35

abhinadduri

v0.8.5

1702978

v0.8.5

see v0.8.4. hotfix to fix small bug

Assets 2

21 Sep 21:58

abhinadduri

v0.8.4

b323bfd

v0.8.4

enables training on all data (no test subset required)

leave toml arrays empty for this functionality

Assets 2

17 Sep 14:18

abhinadduri

v0.8.3

1536769

fix reversion in vcc

resolved by community user @bzrry, this fixes a reversion in the vcc notebook for the zeroshot setting: #64

Contributors

bzrry

Assets 2

17 Sep 01:28

abhinadduri

v0.8.2

c8e1bff

allow embedding as output space

this adds an 'embedding' option for data.kwargs.output_space. if set, the getitem call only yields embeddings and not counts. this is paired with a new option in state where users can train only on embedding spaces (no need to force a decoder to counts)

also fixes a bug with filter_on_target_knockdown

Assets 2

16 Sep 20:06

Rive-001

v0.8.1

029d123

Enabling training on observational data

What's Changed

Fix indexing in batch map strategy by @fctb12 in #61
Speed up batch mapping by @fctb12 in #62
fix: Enable training on observational data for zero shot and few shot by @Rive-001 in #60