Skip to content

Question regarding speed of execution  #129

@rdauria

Description

@rdauria

I mange applications on a research cluster and our researchers have been reporting issues with execution speed of your software on out cluster.

I have just run through the first step of the tutorial (https://github.com/snayfach/MIDAS/blob/master/docs/tutorial.md) and I wonder whether you could let me know whether the timing (see below) we are getting are exceedingly long.

I am running the code a HPC network storage and on one core of a Intel(R) Xeon(R) Gold 6240 node (I can also provide timings on the following steps in the tutorial if that would help). Moving the DB, the sample file and the output directory to local storage did not seem to affect the speed significantly.

Thanks and please see the timings for the first step of the tutorial below:

/usr/bin/time -p -v run_midas.py species midas_output/SAMPLE_1 -1 /u/local/apps/midas/EXAMPLE/example/sample_1.fq.gz --remove_temp 

MIDAS: Metagenomic Intra-species Diversity Analysis System
version 1.3.0; github.com/snayfach/MIDAS
Copyright (C) 2015-2016 Stephen Nayfach
Freely distributed under the GNU General Public License (GPLv3)

===========Parameters===========
Command: /u/local/apps/midas/1.3.2/MIDAS/scripts/run_midas.py species midas_output/SAMPLE_1 -1 /u/local/apps/midas/EXAMPLE/example/sample_1.fq.gz --remove_temp
Script: run_midas.py species
Database: /u/local/apps/midas/DB/midas_db_v1.2
Output directory: midas_output/SAMPLE_1
Input reads (unpaired): /u/local/apps/midas/EXAMPLE/example/sample_1.fq.gz
Remove temporary files: True
Word size for database search: 28
Minimum mapping alignment coverage: 0.75
Number of reads to use from input: use all
Number of threads for database search: 1
================================

Aligning reads to marker-genes database
  0.66 minutes
  0.75 Gb maximum memory

Classifying reads
  total alignments: 2916
  uniquely mapped reads: 1013
  ambiguously mapped reads: 47
  0.0 minutes
  0.76 Gb maximum memory

Estimating species abundance
  total marker-gene coverage: 10.637
  0.0 minutes
  0.76 Gb maximum memory
	Command being timed: "run_midas.py species midas_output/SAMPLE_1 -1 /u/local/apps/midas/EXAMPLE/example/sample_1.fq.gz --remove_temp"
	User time (seconds): 43.04
	System time (seconds): 6.53
	Percent of CPU this job got: 118%
	Elapsed (wall clock) time (h:mm:ss or m:ss): 0:41.71
	Average shared text size (kbytes): 0
	Average unshared data size (kbytes): 0
	Average stack size (kbytes): 0
	Average total size (kbytes): 0
	Maximum resident set size (kbytes): 646160
	Average resident set size (kbytes): 0
	Major (requiring I/O) page faults: 0
	Minor (reclaiming a frame) page faults: 83162
	Voluntary context switches: 18072
	Involuntary context switches: 1161
	Swaps: 0
	File system inputs: 391032
	File system outputs: 904
	Socket messages sent: 0
	Socket messages received: 0
	Signals delivered: 0
	Page size (bytes): 4096
	Exit status: 0

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions