Skip to content

multithreading bugfix; memory efficient string alleles for GWAS#212

Merged
olivierlabayle merged 7 commits intomainfrom
gwas_features
Feb 13, 2026
Merged

multithreading bugfix; memory efficient string alleles for GWAS#212
olivierlabayle merged 7 commits intomainfrom
gwas_features

Conversation

@joshua-slaughter
Copy link
Copy Markdown
Member

v 0.12.1

Fixes multithreading bug in building inputs in which there are less variants than threads. This can happen more frequently than expected if one has a sparse BED file or a large number of variants that do not pass QC.

Futhermore, we now utilise PooledArrays for the columns in the genotype dataset creation which matches the same memory burden as the former UInt8 implementation

@joshua-slaughter joshua-slaughter linked an issue Jan 29, 2026 that may be closed by this pull request
Copy link
Copy Markdown
Member

@olivierlabayle olivierlabayle left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you add some tests to support the changes?

Comment thread src/TargeneCore.jl
Comment thread test/inputs_from_gwas_config.jl
Comment thread test/inputs_from_gwis_config.jl
…(faster counting available in DataFrames.jl)
@joshua-slaughter
Copy link
Copy Markdown
Member Author

@olivierlabayle Sorry about the tests. In thinking the change was straightforwards, in testing and experimentation I did find bugs 😆 Tests have been implemented and behavior is now expected with controls always being the major homozygous and the cases being the addition of the "alternate" allele based on dataset frequencies. Also found that StatsBase.jl's inclusion was completely unnecessary as there are faster level counting methods in DataFrames.jl.

@joshua-slaughter joshua-slaughter linked an issue Feb 10, 2026 that may be closed by this pull request
@joshua-slaughter
Copy link
Copy Markdown
Member Author

@olivierlabayle Should be good to go here!

Copy link
Copy Markdown
Member

@olivierlabayle olivierlabayle left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@olivierlabayle olivierlabayle merged commit 7e0e792 into main Feb 13, 2026
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Multithreading in inputs_from_config.jl bug Genotype changes in GWAS are ambiguous

2 participants