Skip to content

Local transfers just hang after some time #3

@jordancaraballo

Description

@jordancaraballo

Hi Paul,

I am using the latest version of shift taken from the master repo and compiled. I am trying to copy the contents of one directory to another directory within a local system. The only caveat which I assume should not matter is that the original data is located on a GPFS filesystem, and is being transferred to an NFS-based file system. For simple context this is being done at the NCCS.

The command I am using is as follows (the problem occurs both if I submit the job on the backend, or if I actively wait/monitor the submission). I can confirm there are no quota issues and filesystem continues to be writable.

shiftc -r -d --wait --monitor=color /explore/nobackup/projects/ilab/data/LandsatABoVE_GLAD_ARD_Native_All /css/landsat/Collection2/GLAD_ARD/Native_Grid_Update

The first time, the data started transferring for a total of 2.21TB of a total of 100TB. The process just staled at that point. Re-submitted the job several times and it has been stale since. This is the status from the two jobs:

shiftc --status
id | state |       dirs |          files |    file size |  date |           run |     rate
   |       |       sums |          attrs |     sum size |  time |          left |
---+-------+------------+----------------+--------------+-------+---------------+---------
 7 | run   |  4409/4409 | 106761/2680298 | 2.21TB/100TB | 01/28 |     20h58m21s | 29.3MB/s
   |       |  0/5360596 |      0/2684707 |   0.0B/200TB | 17:31 | 1w4d17h29m54s |
 8 | run   |    0/4306+ |     0/2617110+ | 0.0B/97.3TB+ | 01/29 |       8h5m24s |   0.0B/s
   |       | 0/5234220+ |     0/2621416+ |  0.0B/195TB+ | 06:24 |               |

Any ideas of what could be going on? Any suggestions on how to debug this to find a solution?

Thanks,
Jordan

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions