Skip to content

Can't start from interactive Slurm session #47

@kcgthb

Description

@kcgthb

Hi!

When trying to start remora from an interactive Slurm session, it immediately exits with the following error:

$ srun --pty bash
[cn] $ remora --help
REMORA Error:  Incorrect syntax: REMORA can't run in parallel
REMORA Howto
remora ./myapp [args]                     (serial applications)
remora ibrun [options] ./myapp [args]     (parallel MPI applications)

Our Slurm setup has MpiDefault=pmi2, and when starting an interactive session with srun --pty bash, the PMI environment is set, so PMI_RANK is defined in the environment, and remora thinks that it's running in a parallel mode (checks are in check_running_parallel() in aux/extra).

Unsetting the PMI_RANK variable allowsremora to start:

$ srun --pty bash
[cn] $ unset PMI_RANK
[cn] $ remora --help
 SYNOPSIS
  remora ./myapp [args]                     (serial applications)
  remora ibrun [options] ./myapp [args]     (parallel MPI applications)

 DESCRIPTION
 REMORA: REsource MOnitoring for Remote Applications
[...]

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions