Some of the data making its way into the pipeline will be .vcf files. In most cases attempts will be made to use raw gentoype data, but in some cases only .vcf files will be available. I propose that we add a step in the CombineGeneticDatasets workflow that accepts vcf file inputs before subselecting the common variants.
This is a long-term goal as the number of patients being added to the pipeline in this way is very small (<50) and manual (documented) steps can be taken to merge these data.
Some of the data making its way into the pipeline will be .vcf files. In most cases attempts will be made to use raw gentoype data, but in some cases only .vcf files will be available. I propose that we add a step in the CombineGeneticDatasets workflow that accepts vcf file inputs before subselecting the common variants.
This is a long-term goal as the number of patients being added to the pipeline in this way is very small (<50) and manual (documented) steps can be taken to merge these data.