Skip to content

job_resource_manager_cgroups: Nvidia devices not hidden for the first job after boot #193

@bzizou

Description

@bzizou

The Enable_devices_cg = "YES" enables hide of GPU devices that are not reserved in the current job.
But the feature doesn't seem to work for the first job just after a reboot of the node. The next jobs are ok.
Tested with Debian 9.13 nodes, V100 and A100 GPUS, rebooted several times, the problem is reproducible

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions