Skip to content

Headers and data rows do not match up in batch_90-samples_Lowcov.bed #4

@bpow

Description

@bpow

There are 11 items in the header description of this file (chr, start, stop, length, gene, prev, synonymous, refseq, ccds, vega, miRNA) but 13 items per row (based on there being 12 delimiters per data line).

  1. There is an extra tab between the data for 'length' and 'gene' for each row.
  2. There is a column at the end which happens to be 'length'-1, but does not have a corresponding header.
  3. The second point brings up another issue: either the final column (which has no header descriptor), or the column actually marked 'length' is the length of the region, depending on whether 0-based half-open or 1-based closed intervals are used. Since this is a bed file, it would probably be best to stick with the bed convention and use 0-based half-open coordinates, but at the very least this should be documented.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions