This repo contains my various "proof of concepts" for a pre-processing algorithm used to reduce memory usage on DNA searches, in particular for start and stop codons. PoC-14.py is the final pre-procesing algorithm with Hashing. PoC-14Exact.py is the final algorithm paired with exact matching, PoC-14Hamming.py is PoC-14 paired with Hamming distances and PoC-14L.py is PoC-14 paired with Levenshtein distances.
The UW System Library (MINDS @ UW) record with full text can be found here.