New Feature - Flexible analysis start by sguizard · Pull Request #55 · nf-core/isoseq

sguizard · 2025-11-01T20:03:39Z

PR checklist

Isoseq providers deliver sequences in many different format depending of the pre-processing they apply (Subreqds, CCS, Full Length isoseq). This even more true with the new MAS-seq.
I had implemented the possibility to deals with these format through options. However, their usage along with the possibility to skip ISOseq processing and align made the samplesheet and the usage of the pipeline complex.

In this PR, I changed the way to inject input sequences into the pipeline. Now, it's possible start analysis from ccs, lima, isoseq refine or at the mapping step. The different types of inputs can be even mixed in the samplesheet.
This modification simplify the usage but also the code.

It not necessary to deals with the different entrypoints any more. The inputs files are injected at the right moment in the main channel paths.

github-actions · 2025-11-01T20:05:24Z

`nf-core pipelines lint` overall result: Passed ✅ ⚠️

Posted for pipeline commit a6bdafd

+| ✅ 256 tests passed       |+
!| ❗   7 tests had warnings |!

Details

❗ Test warnings:

pipeline_todos - TODO string in methods_description_template.yml: #Update the HTML below to your preferred methods description, e.g. add publication citation for this pipeline
pipeline_todos - TODO string in main.nf: Optionally add in-text citation tools to this list.
pipeline_todos - TODO string in main.nf: Optionally add bibliographic entries to this list.
pipeline_todos - TODO string in main.nf: Only uncomment below if logic in toolCitationText/toolBibliographyText has been filled!
pipeline_todos - TODO string in nextflow.config: Specify any additional parameters here
local_component_structure - set_value_channel.nf in subworkflows/local should be moved to a SUBWORKFLOW_NAME/main.nf structure
local_component_structure - set_chunk_num_channel.nf in subworkflows/local should be moved to a SUBWORKFLOW_NAME/main.nf structure

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .gitignore
files_exist - File found: .nf-core.yml
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/CONTRIBUTING.md
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/workflows/nf-test.yml
files_exist - File found: .github/actions/get-shards/action.yml
files_exist - File found: .github/actions/nf-test/action.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: assets/nf-core-isoseq_logo_light.png
files_exist - File found: conf/modules.config
files_exist - File found: conf/test.config
files_exist - File found: conf/test_full.config
files_exist - File found: docs/images/nf-core-isoseq_logo_light.png
files_exist - File found: docs/images/nf-core-isoseq_logo_dark.png
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: nf-test.config
files_exist - File found: tests/default.nf.test
files_exist - File found: main.nf
files_exist - File found: assets/multiqc_config.yml
files_exist - File found: conf/base.config
files_exist - File found: conf/igenomes.config
files_exist - File found: conf/igenomes_ignored.config
files_exist - File found: .github/workflows/awstest.yml
files_exist - File found: .github/workflows/awsfulltest.yml
files_exist - File found: modules.json
files_exist - File found: ro-crate-metadata.json
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: docs/images/nf-core-isoseq_logo.png
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/NfcoreTemplate.groovy
files_exist - File not found check: lib/Utils.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: lib/WorkflowMain.groovy
files_exist - File not found check: lib/WorkflowIsoseq.groovy
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: pipeline_template.yml
files_exist - File not found check: Singularity
files_exist - File not found check: lib/nfcore_external_java_deps.jar
files_exist - File not found check: .travis.yml
nextflow_config - Found nf-schema plugin
nextflow_config - Config variable found: manifest.name
nextflow_config - Config variable found: manifest.nextflowVersion
nextflow_config - Config variable found: manifest.description
nextflow_config - Config variable found: manifest.version
nextflow_config - Config variable found: manifest.homePage
nextflow_config - Config variable found: timeline.enabled
nextflow_config - Config variable found: trace.enabled
nextflow_config - Config variable found: report.enabled
nextflow_config - Config variable found: dag.enabled
nextflow_config - Config variable found: process.cpus
nextflow_config - Config variable found: process.memory
nextflow_config - Config variable found: process.time
nextflow_config - Config variable found: params.outdir
nextflow_config - Config variable found: params.input
nextflow_config - Config variable found: manifest.mainScript
nextflow_config - Config variable found: timeline.file
nextflow_config - Config variable found: trace.file
nextflow_config - Config variable found: report.file
nextflow_config - Config variable found: dag.file
nextflow_config - Config variable (correctly) not found: params.nf_required_version
nextflow_config - Config variable (correctly) not found: params.container
nextflow_config - Config variable (correctly) not found: params.singleEnd
nextflow_config - Config variable (correctly) not found: params.igenomesIgnore
nextflow_config - Config variable (correctly) not found: params.name
nextflow_config - Config variable (correctly) not found: params.enable_conda
nextflow_config - Config variable (correctly) not found: params.max_cpus
nextflow_config - Config variable (correctly) not found: params.max_memory
nextflow_config - Config variable (correctly) not found: params.max_time
nextflow_config - Config variable (correctly) not found: params.validationFailUnrecognisedParams
nextflow_config - Config variable (correctly) not found: params.validationLenientMode
nextflow_config - Config variable (correctly) not found: params.validationSchemaIgnoreParams
nextflow_config - Config variable (correctly) not found: params.validationShowHiddenParams
nextflow_config - Config variable (correctly) not found: validation.failUnrecognisedParams
nextflow_config - Config variable (correctly) not found: validation.failUnrecognisedHeaders
nextflow_config - Config timeline.enabled had correct value: true
nextflow_config - Config report.enabled had correct value: true
nextflow_config - Config trace.enabled had correct value: true
nextflow_config - Config dag.enabled had correct value: true
nextflow_config - Config manifest.name began with nf-core/
nextflow_config - Config variable manifest.homePage began with https://github.com/nf-core/
nextflow_config - Config dag.file ended with .html
nextflow_config - Config variable manifest.nextflowVersion started with >= or !>=
nextflow_config - Config manifest.version ends in dev: 2.1.0dev
nextflow_config - Config params.custom_config_version is set to master
nextflow_config - Config params.custom_config_base is set to https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Lines for loading custom profiles found
nextflow_config - nextflow.config contains configuration profile test
nextflow_config - Config default value correct: params.chunk_ccs= 40
nextflow_config - Config default value correct: params.chunk_mapping= 100
nextflow_config - Config default value correct: params.rq= 0.9
nextflow_config - Config default value correct: params.min_passes= 3
nextflow_config - Config default value correct: params.min_snr= 2.5
nextflow_config - Config default value correct: params.min_length= 10
nextflow_config - Config default value correct: params.max_length= 50000
nextflow_config - Config default value correct: params.top_passes= 60
nextflow_config - Config default value correct: params.five_prime= 100
nextflow_config - Config default value correct: params.splice_junction= 10
nextflow_config - Config default value correct: params.three_prime= 100
nextflow_config - Config default value correct: params.tama_merge_all= false
nextflow_config - Config default value correct: params.igenomes_base= s3://ngi-igenomes/igenomes/
nextflow_config - Config default value correct: params.custom_config_version= master
nextflow_config - Config default value correct: params.custom_config_base= https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Config default value correct: params.publish_dir_mode= copy
nextflow_config - Config default value correct: params.max_multiqc_email_size= 25.MB
nextflow_config - Config default value correct: params.validate_params= true
nextflow_config - Config default value correct: params.pipelines_testdata_base_path= https://raw.githubusercontent.com/nf-core/test-datasets/refs/heads/isoseq/
nf_test_content - 'tests/inputs_ccs_lima_refine_map_mergeAll.nf.test' contains outdir parameter
nf_test_content - 'tests/inputs_ccs_lima_refine_map_mergeAll.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_ccs_lima_refine_map_mergeAll.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_map.nf.test' contains outdir parameter
nf_test_content - 'tests/inputs_map.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_map.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_lima.nf.test' contains outdir parameter
nf_test_content - 'tests/inputs_lima.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_lima.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/default.nf.test' contains outdir parameter
nf_test_content - 'tests/default.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/default.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_lima_refine_map.nf.test' contains outdir parameter
nf_test_content - 'tests/inputs_lima_refine_map.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_lima_refine_map.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_refine.nf.test' contains outdir parameter
nf_test_content - 'tests/inputs_refine.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_refine.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_refine_map.nf.test' contains outdir parameter
nf_test_content - 'tests/inputs_refine_map.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_refine_map.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_ccs.nf.test' contains outdir parameter
nf_test_content - 'tests/inputs_ccs.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_ccs.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_ccs_lima_refine_map.nf.test' contains outdir parameter
nf_test_content - 'tests/inputs_ccs_lima_refine_map.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_ccs_lima_refine_map.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_ccs_map.nf.test' contains outdir parameter
nf_test_content - 'tests/inputs_ccs_map.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_ccs_map.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_multi_lib.nf.test' contains outdir parameter
nf_test_content - 'tests/inputs_multi_lib.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/inputs_multi_lib.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/minimap2.nf.test' contains outdir parameter
nf_test_content - 'tests/minimap2.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/minimap2.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/nextflow.config' contains modules_testdata_base_path
nf_test_content - 'tests/nextflow.config' contains pipelines_testdata_base_path
nf_test_content - 'nf-test.config' sets a testsDir
nf_test_content - 'nf-test.config' sets a workDir
nf_test_content - 'nf-test.config' sets a configFile
files_unchanged - .gitattributes matches the template
files_unchanged - .prettierrc.yml matches the template
files_unchanged - CODE_OF_CONDUCT.md matches the template
files_unchanged - LICENSE matches the template
files_unchanged - .github/.dockstore.yml matches the template
files_unchanged - .github/CONTRIBUTING.md matches the template
files_unchanged - .github/ISSUE_TEMPLATE/bug_report.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/config.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - .github/PULL_REQUEST_TEMPLATE.md matches the template
files_unchanged - .github/workflows/branch.yml matches the template
files_unchanged - .github/workflows/linting_comment.yml matches the template
files_unchanged - .github/workflows/linting.yml matches the template
files_unchanged - assets/email_template.html matches the template
files_unchanged - assets/email_template.txt matches the template
files_unchanged - assets/sendmail_template.txt matches the template
files_unchanged - assets/nf-core-isoseq_logo_light.png matches the template
files_unchanged - docs/images/nf-core-isoseq_logo_light.png matches the template
files_unchanged - docs/images/nf-core-isoseq_logo_dark.png matches the template
files_unchanged - docs/README.md matches the template
files_unchanged - .gitignore matches the template
files_unchanged - .prettierignore matches the template
actions_nf_test - '.github/workflows/nf-test.yml' is triggered on expected events
actions_nf_test - '.github/workflows/nf-test.yml' checks minimum NF version
actions_awstest - '.github/workflows/awstest.yml' is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml does not use -profile test
readme - README Nextflow minimum version badge matched config. Badge: 25.04.0, Config: 25.04.0
readme - README nf-core template version badge found.
readme - README Zenodo placeholder was replaced with DOI.
pipeline_if_empty_null - No ifEmpty(null) strings found
plugin_includes - No wrong validation plugin imports have been found
pipeline_name_conventions - Name adheres to nf-core convention
template_strings - Did not find any Jinja template strings (0 files)
schema_lint - Schema lint passed
schema_lint - Schema title + description lint passed
schema_lint - Input mimetype lint passed: 'text/csv'
schema_params - Schema matched params returned from nextflow config
system_exit - No System.exit calls found
actions_schema_validation - Workflow validation passed: linting.yml
actions_schema_validation - Workflow validation passed: download_pipeline.yml
actions_schema_validation - Workflow validation passed: template-version-comment.yml
actions_schema_validation - Workflow validation passed: linting_comment.yml
actions_schema_validation - Workflow validation passed: release-announcements.yml
actions_schema_validation - Workflow validation passed: fix_linting.yml
actions_schema_validation - Workflow validation passed: nf-test.yml
actions_schema_validation - Workflow validation passed: awsfulltest.yml
actions_schema_validation - Workflow validation passed: branch.yml
actions_schema_validation - Workflow validation passed: awstest.yml
actions_schema_validation - Workflow validation passed: clean-up.yml
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
multiqc_config - assets/multiqc_config.yml found and not ignored.
multiqc_config - assets/multiqc_config.yml contains report_section_order
multiqc_config - assets/multiqc_config.yml contains export_plots
multiqc_config - assets/multiqc_config.yml contains report_comment
multiqc_config - assets/multiqc_config.yml follows the ordering scheme of the minimally required plugins.
multiqc_config - assets/multiqc_config.yml contains a matching 'report_comment'.
multiqc_config - assets/multiqc_config.yml contains 'export_plots: true'.
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'
local_component_structure - local modules directory structure is correct 'modules/local/TOOL/SUBTOOL'
base_config - conf/base.config found and not ignored.
modules_config - conf/modules.config found and not ignored.
modules_config - PBCCS found in conf/modules.config and Nextflow scripts.
modules_config - LIMA found in conf/modules.config and Nextflow scripts.
modules_config - ISOSEQ_REFINE found in conf/modules.config and Nextflow scripts.
modules_config - BAMTOOLS_CONVERT found in conf/modules.config and Nextflow scripts.
modules_config - GSTAMA_POLYACLEANUP found in conf/modules.config and Nextflow scripts.
modules_config - MINIMAP2_ALIGN found in conf/modules.config and Nextflow scripts.
modules_config - GUNZIP found in conf/modules.config and Nextflow scripts.
modules_config - GNU_SORT found in conf/modules.config and Nextflow scripts.
modules_config - ULTRA_INDEX found in conf/modules.config and Nextflow scripts.
modules_config - ULTRA_ALIGN found in conf/modules.config and Nextflow scripts.
modules_config - GSTAMA_COLLAPSE found in conf/modules.config and Nextflow scripts.
modules_config - GSTAMA_FILELIST found in conf/modules.config and Nextflow scripts.
modules_config - GSTAMA_MERGE found in conf/modules.config and Nextflow scripts.
nfcore_yml - Repository type in .nf-core.yml is valid: pipeline
nfcore_yml - nf-core version in .nf-core.yml is set to the latest version: 3.4.1
rocrate_readme_sync - RO-Crate descriptions are in sync with README.md.

Run details

nf-core/tools version 3.4.1
Run at 2025-11-11 14:57:59

maxulysse · 2025-11-03T10:16:55Z

assets/schema_input.json

+                "errorMessage": "PacBio Index file for BAM subreads cannot contain spaces and must have extension '.bam.pbi' or being empty"
            },
-            "reads": {
+            "start_from": {


I think we can find a better name for this field

I have been scratching my head for naming this.
I used entrypoint, but I didn't felt it was very clear for users.
'start_from' is might not be the best, at least, it's very clear on it's signification.

Do you have propositions?

sguizard · 2025-11-03T12:20:38Z

I added a subworflow to chunk the fasta files (from lima, isoseq refine and mapping start) before the mapping steps.
As those inputs files are devided into chunks with CCS, the fasta produced is massive and contains all sequences. To mitigate this problem, the CHUNKER subworkflow will split those into small chunks.

sguizard · 2025-11-10T13:49:02Z

A new update of the CHUNKER to apply it twice in the pipeline.

Sébastien Guizard and others added 15 commits October 31, 2025 18:44

Update: Add new lines

29ffeb7

Update: split chunk option into --chunk_ccs and chunk_mapping

6c5159e

Update: Adapt schema to the new samplesheet version

c13eaed

Remove old version of samplesheet

9c2060b

Implement Flexible start

8cfd374

Add new samplesheets examples

04cd0df

Remove old testing profiles

bf9b7bd

Remove old tests and snaps

a26b65b

Add or update test config files

6cab954

Add update tests

5c8ded9

Update metro map

f6d4da5

Remove unused options from schema and nextflow.config

646094f

Update metro map

67ba8d8

Remove old version samplesheet

57e8b2e

Update documentation

ef2dc45

sguizard added 6 commits November 1, 2025 20:06

Prettify code

d26c978

pipeline lint fix

a369409

Remove trailing whitespaces

41ef6e2

Remove some legacy code

ff5ca78

Minor updates to CHANGELOG and README to reflect recent contributions.

83ed179

Fix ro-crate-metadata.json

be69a4b

maxulysse reviewed Nov 3, 2025

View reviewed changes

sguizard added 3 commits November 3, 2025 12:10

Add chunker subworkflow to chunk fasta files before mapping

e30d111

Integrate CHUNKER to pipeline

9667fcc

Update chunk_mapping in test config files for a more sensible value

5503db1

Sébastien Guizard added 3 commits November 3, 2025 13:43

Update snaps

e6dea95

Improve CHUNKER to handle diverse cases

79bbc2a

Integrate CHUNKER and use it after BAMTOOLS_CONVERT and before mapping

5b62149

Sébastien Guizard added 2 commits November 10, 2025 13:07

Update and tests profiles

6f67ee3

Update snaps

3f82882

sguizard added the WIP Work in progress label Nov 11, 2025

Sébastien Guizard added 2 commits November 11, 2025 14:54

Update: multi libraries per samples are now allowed

c960d2f

Add tests and snaps (missing and new)

a6bdafd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Feature - Flexible analysis start#55

New Feature - Flexible analysis start#55
sguizard wants to merge 31 commits intodevfrom
feature-flexible_start

sguizard commented Nov 1, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Nov 1, 2025 •

edited

Loading

❗ Test warnings:

✅ Tests passed:

Run details

Uh oh!

maxulysse Nov 3, 2025

Uh oh!

sguizard Nov 3, 2025 •

edited

Loading

Uh oh!

sguizard commented Nov 3, 2025

Uh oh!

sguizard commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sguizard commented Nov 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR checklist

Uh oh!

github-actions bot commented Nov 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

nf-core pipelines lint overall result: Passed ✅ ⚠️

❗ Test warnings:

✅ Tests passed:

Run details

Uh oh!

maxulysse Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

sguizard Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sguizard commented Nov 3, 2025

Uh oh!

sguizard commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sguizard commented Nov 1, 2025 •

edited

Loading

github-actions bot commented Nov 1, 2025 •

edited

Loading

`nf-core pipelines lint` overall result: Passed ✅ ⚠️

sguizard Nov 3, 2025 •

edited

Loading