Skip to content
This repository was archived by the owner on Feb 16, 2019. It is now read-only.
This repository was archived by the owner on Feb 16, 2019. It is now read-only.

Error in convertGenbank2table.py if /pseudo in genbank #81

@michoug

Description

@michoug

Dear All,

I was parsing a genbank file (more precisely a gbff file (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/009/365/GCF_000009365.1_ASM936v1/GCF_000009365.1_ASM936v1_genomic.gbff.gz)) and due to the fact that some genes are annotated as /pseudo there are several errors such as :

WARNING: CDS found with no translation
Qualifiers:
locus_tag ['ABO_RS05435']
inference ['EXISTENCE: similar to AA sequence:RefSeq:WP_011588335.1']
old_locus_tag ['ABO_1051']
codon_start ['1']
pseudo ['']
exception ['unextendable partial coding region']
transl_table ['11']
note ['Derived by automated computational analysis using gene prediction method: Protein Homology.']

with coordinates : complement(<1191967..1192467)

and then

BAD - the gene in the following location did not have a match in the table file!
<1191966:1192467
ATGCCAAAACGGATATTCCGCAAATACCTGCCAACACCCGAACGTATTCGACAAACCAAGTCGTTGAGTTTTCTCGGGGAAGTGCTATCGGACCCAAACCTTTGGCACATTAACCGTCGTTCACTGGCCGGCGCAGCCTTTATCGGTATCTTCTCCGGTCTGTTACCTATCCCCCTGCAAATGGGGTTGGCCGCCCTGTTGGCGGTGCGCTTCCACTGCAACCTGCCGCTTTCGATTATGCTGGTATGGATTTCCAATCCAGTGACTTATGTGCCGATTTTCTATTTCACCTACCGCATTGGCGCCTGGCTGCTGGGAATGCCGCCCCACAGCGGTGAAGGCATCACCGTCGCCTGGTTTGTGGAACAGCTCATCCCATTATGGGTAGGTTCAATGCTATGCGCATTCGGGTTTGGCGGATTAGCGTACATGGCAGTAAAAGTCAGCTGGCGGCTAGCTGTAATTCGAAGTTGGAATCTACGCGCGCACCGGCGCGCACGG

Best
Greg

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions