-
Notifications
You must be signed in to change notification settings - Fork 2
Generate sequence structure mapping files for any organism #53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
jmbilodeaux
wants to merge
45
commits into
seq2struct
Choose a base branch
from
main
base: seq2struct
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Too chirpy. Run `snakefmt workflow/rules/*`
Member
|
I am a little confused. This PR seems to only have recent changes that I have already merged into main. |
Collaborator
Author
|
@jayhesselberth hmm.. okay im not sure what went wrong. let me retry |
Closes #40 * Set file extension * Eliminate input_format option
And reorganize rule files and summary outputs
Always build from source. The linux dist doesn't run on bodhi because glibc is out of date.
And snakefmt
Removing it causes it to fail.
TODO: compute these values directly from the BAM file.
Co-authored-by: Claude <noreply@anthropic.com>
* Add comprehensive CI/CD build and test checks This commit implements a complete testing and continuous integration infrastructure for the aa-tRNA-seq-pipeline: New CI/CD Components: - GitHub Actions CI workflow (.github/workflows/ci.yml) - Snakemake syntax validation with dry-run - Pipeline integration testing (merge_pods rule) - Configuration file validation - Automated tool setup (dorado, modkit) - GitHub Actions Lint workflow (.github/workflows/lint.yml) - Snakemake formatting checks (snakefmt) - Python code quality checks (black, flake8) - Markdown linting - YAML linting - Pre-commit hooks configuration (.pre-commit-config.yaml) - Automatic code quality checks before commits - Python formatting and linting - Snakemake formatting - General file hygiene checks - Local test script (.tests/run_local_tests.sh) - Quick validation script for local development - Checks Snakemake installation and syntax - Validates configuration files - Verifies directory structure Documentation: - Comprehensive testing guide (.github/TESTING.md) - CI/CD workflow documentation - Local testing instructions - Troubleshooting guide - Contributing guidelines - Updated README with CI status badges The CI workflow runs on all pushes and pull requests to main branches and claude/** branches. GPU-intensive rules are not tested in CI due to runner limitations but can be tested locally or on HPC clusters. * Fix CI workflow by replacing deprecated Mambaforge with Miniforge3 The Mambaforge variant is no longer available (404 error), causing the Snakemake Syntax Check to fail. Updated to use Miniforge3 as recommended by the setup-miniconda action warning. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: lkwhite <laura.k.white@cuanschutz.edu>
This completes the test and build check infrastructure by adding: - SessionStart hook that automatically sets up the conda environment - Validates Snakemake installation and config files - Installs pre-commit hooks if not present - Runs quick syntax checks on session start - Comprehensive documentation in .claude/SETUP.md The hook ensures developers have a properly configured environment before starting work, reducing setup friction and catching issues early. Co-authored-by: Claude <noreply@anthropic.com>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
test data is charged sequencing data from T4 infection of WT E. coli.
test.py adds the charged adapter sequence to the 5' or 3' end of each sequence. This needs to be fixed since we use 'charged' and 'uncharged' adapters that have different 3' sequences.
The input is a converted .sto file (esl-reformat sto -> afa) containing conserved secondary structure across tRNAs, including phage. The reference sequence is used to create a mapping of sequence to structure position that can be used to correct nanopore sequence space position to structural position.