-
Notifications
You must be signed in to change notification settings - Fork 3
Counts seems incorrect #2
Copy link
Copy link
Open
Description
Hi!
I'm a computer engineering student and I'm doing my master thesis on improving UST basically (see here if interested).
I wrote a simple C++ program that extracts canonical kmers from simplitigs and appends sequentially its counts using UST output files.
Then I sorted the kmers list and compared to the one computed by Jellyfish-2.
There are difference between counts, though kmers are the same. Can you confirm this?
How to reproduce
Extract kmers and counts from ust output files:
g++ kmers-extractor.cpp -o kmers-extractor./kmers-extractor <kmer-size> <ust-fasta> <ust-counts>sort ust-kmers.txt -o ust-kmers-sorted.txt
Extract kmers and counts from starting sequence (not the bcalm one):
jellyfish-linux count -m <kmer-size> -C -s 100M -L 2 <starting-fasta>jellyfish-linux dump -c mer_counts.jf > kmers.txtsort kmers.txt -o kmers-sorted.txt
Compare the two files:
cmp kmers-sorted.txt ust-kmers-sorted.txt
kmers-extractor is attached.
Note that kmers with abundance 1 are ignored.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels