Skip to content

Run with 47K rows, 566 columns and 48 batches takes 5+ hour and it's still running #2

@fbrundu

Description

@fbrundu

I am running combat.py and I don't know if it is normal that with 47K genes, 566 samples and 48 batches it is taking more than 5 hours to terminate.
Can I help with something? Data is pretty confidential but I will provide informations if possible..
R version terminates in less than 1 hour.
I terminated combat.py, this is the output:

found 48 batches
found 0 numerical covariates...
found 0 categorical variables:

Standardizing Data across genes.
Fitting L/S model and finding priors
Finding parametric adjustments
^CTraceback (most recent call last):
  File "rm_batches.py", line 45, in <module>
    main()
  File "rm_batches.py", line 40, in main
    ebat = combat.combat(dat, bat['Batch'], None)
  File "/home/unsel/Dropbox/poli/pb/class/xeno/combat.py", line 99, in combat
    delta_hat[i], gamma_bar[i], t2[i], a_prior[i], b_prior[i])
  File "/home/unsel/Dropbox/poli/pb/class/xeno/combat.py", line 134, in it_sol
    sum2 = ((sdat - np.dot(g_new.reshape((g_new.shape[0], 1)), np.ones((1, sdat.shape[1])))) ** 2).sum(axis=1)
KeyboardInterrupt
^C
real    2767m37.090s
user    369m50.351s
sys 0m12.489s

Hope it helps..

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions