Description
I'm very interested in implementing SCASA. I find it a very interesting method.
I'm trying to implement SCASA with gencode 44 as a reference. To do this, I'm trying to create the necessary files to be able to use this reference: Xmatrix, etc. To do this, I'm trying to replicate the algorithm explained at https://github.com/eudoraleer/scasa/wiki/How-to-run-Scasa-for-a-new-annotation
However, I haven't been able to fully execute it.
First, the Rscript code $scasaPath/gen_tx2gene.R is executed.
After executing this code, the following files are generated:
ref.sqlite
ref.tsv
ref.clean.fa
tx_removed.fa (the latter is not explained as far as I know and is the cause of an error in the code).
On line 106: names(tx.export.fasta)=paste0(names(tx.export.fasta)," ","Unknown-Gene-Name")
If all the cdna_tx are in genes.tx.all$TXNAME, it gives an error.
Also, line 95 is confusing:
cat("not in gtf: ",table(!p)) # some are not matched
If P has only one value and is TRUE, the cat returns as if all are not matched. It would be more understandable to use length(which(p==FALSE)).
I was able to resolve this error without any problems.
The problem comes with the following part of the code:
perl $scasaPath/SCRIPTS/SCASA_WRAP_V1.0.0.pl --createxmatrix YES --xmatrix $xmatrixFile --ref $refCleanFile --whitelist $whiteListFn
I get a lot of errors about not being able to create a folder that already exists, and it just gets stuck and doesn't move forward.
The error that appears is the following:
SCASA_WRAP_V1.0.0.pl --createxmatrix YES \
> --xmatrix $pwd/scasa_ref/Xmatrix.RData \
> --ref $pwd/scasa_ref/gencode.v44.transcripts.clean.fa \
> --whitelist $pwd/whitelist.txt
###############################################################
# SCASA V1.0.1
# SINGLE CELL TRANSCRIPT QUANTIFICATION TOOL
# Version Date: 2022-03-24
# FOR ANY ISSUES, CONTACT: LU.PAN@KI.SE
# https://github.com/eudoraleer/scasa/
###############################################################
Directory ./ already exists. Writing into existing directory..
mkdir: cannot create directory './/SCASA_My_Project_20250603141017/': File exists
Warning message:
In dir.create(output_dir, recursive = T) :
'.//SCASA_My_Project_20250603141017/0PRESETS' already exists
Error in .Call2("C_solve_user_SEW", refwidths, start, end, width, translate.negative.coord, :
row solving 16964: negative values are not allowed in 'width'
Calls: simulate_experiment_countmat ... make_IRanges_from_windows_args -> solveUserSEW -> .Call2
Execution halted
ls: cannot access './/SCASA_My_Project_20250603141017/0PRESETS//SIMULATE_XMATRIX//*_1.fasta': No such file or directory
I hope you can answer my question.
Best regards,
Juan