Transform BAM alignments to a normalized BigWig (or HDF5-stored) coverage track.
| Arguments | Type | Description |
|---|---|---|
| bam_file | BAM/CRAM | Alignments file from which to extract coverage. |
| output_file | BigWig/HDF5 | Output coverage track stored as BigWig or HDF5, depending on the ".bw" suffix. |
| hdf5_file | HDF5 | Output HDF5 file with train_in/train_out, test_in/test_out and many other keys. |
Combine a set of coverage tracks stored as BigWig or HDF5 into a single file for training and testing, parallelizing over samples per-segment using multiprocessing on a single machine.
| Arguments | Type | Description |
|---|---|---|
| fasta_file | FASTA | FASTA file of chromosome sequences. |
| sample_wigs_file | Text table | Sample labels and paths to coverage files. |
| hdf5_file | HDF5 | Output HDF5 file with train_in/train_out, test_in/test_out and many other keys. |
Combine a set of coverage tracks stored as BigWig or HDF5 into a single file for training and testing, parallelizing over samples on our SLURM cluster.
| Arguments | Type | Description |
|---|---|---|
| fasta_file | FASTA | FASTA file of chromosome sequences. |
| sample_wigs_file | Text table | Sample labels and paths to coverage files. |
| hdf5_file | HDF5 | Output HDF5 file with train_in/train_out, test_in/test_out and many other keys. |
Tile a set of genes and save the result in HDF5 for Basenji processing.
| Arguments | Type | Description |
|---|---|---|
| fasta_file | FASTA | FASTA file of chromosome sequences. |
| gtf_file | GTF | Gene annotations in gene transfer format. |
| hdf5_file | HDF5 | Output HDF5 file with gene sequences and descriptions. |