Conversation
…line - Parallelize area-at-the-glass computation using joblib with retry logic and exponential backoff for network resilience - Fix SOX2 half-maximal calculation to use first timepoint intensity value instead of max(), and convert to hours with *(30/60) - Add fixed-cell (single timepoint) image support in area computation - Add metadata-only movies (those without All Cells Mask) to final manifest - Vectorize normalized Z-plane subtraction (replace slow apply+lambda) - Reduce memory in area computation by passing only needed columns - Round migration onset times to nearest 0.5 hour for time grid alignment - Reorder output columns: key analysis columns first, then metadata - Include multi-channel intensity columns (Channel 2, Channel 3) in output - Add CLI arguments (--local, --local-csv) for local manifest loading - Save CSV without row index - Update io.py: add load_from_aws and local_path parameters to load_imaging_and_segmentation_dataset() - Add joblib>=1.3.0 to pyproject.toml dependencies - Add [tool.setuptools.packages.find] to fix flat-layout package discovery - Update README with gene metric definitions and pipeline details Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
vianamp
approved these changes
Feb 4, 2026
kiryteo
approved these changes
Feb 4, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Details
Parallel processing: add_bottom_mip_migration now processes movies in parallel via joblib.Parallel instead of sequential tqdm loop. Each movie is handled by _process_single_movie_area with up to 5 retries and exponential backoff (1s, 2s, 4s, 8s, 16s) for S3 network errors.
Bug fixes:
df_id.int_smooth.values[0] (first timepoint value) as the upper bound of the dynamic range, and converts to hours with *(30/60)
Pipeline improvements:
Test plan