Skip to content

Conversation

@trz42
Copy link
Owner

@trz42 trz42 commented Feb 9, 2025

First PR to start NVIDIA Grace/Hopper software stack.

@eessi-bot-devel-trz42
Copy link

Instance eX3-dev-300 is configured to build for:

  • architectures: x86_64/amd/zen2, aarch64/generic
  • repositories: nessi-2023.06-swl-deb11, nessi-2023.06-cl, nessi-2023.06-swl-deb10

@eessi-bot-devel-trz42
Copy link

Instance trz42-GH200-jr is configured to build for:

  • architectures: aarch64/nvidia/grace
  • repositories: eessi-2023.06

@trz42
Copy link
Owner Author

trz42 commented Feb 9, 2025

bot: build inst:GH200 arch:grace repo:eessi.io-2023.06-software

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 9, 2025

Updates by the bot instance eX3-dev-300 (click for details)
  • received bot command build inst:GH200 arch:grace repo:eessi.io-2023.06-software from trz42

    • expanded format: build instance:GH200 architecture:grace repository:eessi.io-2023.06-software
  • handling command build instance:GH200 architecture:grace repository:eessi.io-2023.06-software resulted in:

    • no jobs were submitted

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 9, 2025

Updates by the bot instance trz42-GH200-jr (click for details)

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 9, 2025

New job on instance trz42-GH200-jr for CPU micro-architecture aarch64-nvidia-grace for repository eessi.io-2023.06-software in job dir /p/project1/ceasybuilders/bot-trz42/jobs/2025.02/pr_79/13454703

date job status comment
Feb 09 20:30:48 UTC 2025 submitted job id 13454703 awaits release by job manager
Feb 09 20:31:21 UTC 2025 released job awaits launch by Slurm scheduler
Feb 09 20:32:23 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-13454703.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
No artefacts were created or found.
Feb 09 20:32:23 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
Failed for unknown reason
Details
✅ job output file slurm-13454703.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@trz42
Copy link
Owner Author

trz42 commented Feb 9, 2025

bot: build inst:GH200 arch:grace repo:eessi.io-2023.06-software

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 9, 2025

Updates by the bot instance trz42-GH200-jr (click for details)

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 9, 2025

Updates by the bot instance eX3-dev-300 (click for details)
  • received bot command build inst:GH200 arch:grace repo:eessi.io-2023.06-software from trz42

    • expanded format: build instance:GH200 architecture:grace repository:eessi.io-2023.06-software
  • handling command build instance:GH200 architecture:grace repository:eessi.io-2023.06-software resulted in:

    • no jobs were submitted

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 9, 2025

New job on instance trz42-GH200-jr for CPU micro-architecture aarch64-nvidia-grace for repository eessi.io-2023.06-software in job dir /p/project1/ceasybuilders/bot-trz42/jobs/2025.02/pr_79/13454707

date job status comment
Feb 09 20:39:01 UTC 2025 submitted job id 13454707 awaits release by job manager
Feb 09 20:39:27 UTC 2025 released job awaits launch by Slurm scheduler
Feb 09 20:40:28 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-13454707.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
No artefacts were created or found.
Feb 09 20:40:28 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
Failed for unknown reason
Details
✅ job output file slurm-13454707.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@trz42
Copy link
Owner Author

trz42 commented Feb 9, 2025

bot: build inst:GH200 arch:grace repo:eessi.io-2023.06-software

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 9, 2025

Updates by the bot instance trz42-GH200-jr (click for details)

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 9, 2025

Updates by the bot instance eX3-dev-300 (click for details)
  • received bot command build inst:GH200 arch:grace repo:eessi.io-2023.06-software from trz42

    • expanded format: build instance:GH200 architecture:grace repository:eessi.io-2023.06-software
  • handling command build instance:GH200 architecture:grace repository:eessi.io-2023.06-software resulted in:

    • no jobs were submitted

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 9, 2025

New job on instance trz42-GH200-jr for CPU micro-architecture aarch64-nvidia-grace for repository eessi.io-2023.06-software in job dir /p/project1/ceasybuilders/bot-trz42/jobs/2025.02/pr_79/13454709

date job status comment
Feb 09 20:42:43 UTC 2025 submitted job id 13454709 awaits release by job manager
Feb 09 20:43:32 UTC 2025 released job awaits launch by Slurm scheduler
Feb 09 20:44:34 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-13454709.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
No artefacts were created or found.
Feb 09 20:44:34 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
Failed for unknown reason
Details
✅ job output file slurm-13454709.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@trz42
Copy link
Owner Author

trz42 commented Feb 9, 2025

bot: build inst:GH200 arch:grace repo:eessi.io-2023.06-software

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 9, 2025

Updates by the bot instance eX3-dev-300 (click for details)
  • received bot command build inst:GH200 arch:grace repo:eessi.io-2023.06-software from trz42

    • expanded format: build instance:GH200 architecture:grace repository:eessi.io-2023.06-software
  • handling command build instance:GH200 architecture:grace repository:eessi.io-2023.06-software resulted in:

    • no jobs were submitted

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 9, 2025

Updates by the bot instance trz42-GH200-jr (click for details)

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 9, 2025

New job on instance trz42-GH200-jr for CPU micro-architecture aarch64-nvidia-grace for repository eessi.io-2023.06-software in job dir /p/project1/ceasybuilders/bot-trz42/jobs/2025.02/pr_79/13454710

date job status comment
Feb 09 20:46:13 UTC 2025 submitted job id 13454710 awaits release by job manager
Feb 09 20:46:37 UTC 2025 released job awaits launch by Slurm scheduler
Feb 09 20:47:40 UTC 2025 running job 13454710 is running
Feb 09 21:09:10 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-13454710.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
No artefacts were created or found.
Feb 09 21:09:10 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-13454710.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@trz42
Copy link
Owner Author

trz42 commented Feb 10, 2025

bot: build inst:GH200 arch:grace repo:eessi.io-2023.06-software

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 10, 2025

Updates by the bot instance trz42-GH200-jr (click for details)

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 10, 2025

Updates by the bot instance eX3-dev-300 (click for details)
  • received bot command build inst:GH200 arch:grace repo:eessi.io-2023.06-software from trz42

    • expanded format: build instance:GH200 architecture:grace repository:eessi.io-2023.06-software
  • handling command build instance:GH200 architecture:grace repository:eessi.io-2023.06-software resulted in:

    • no jobs were submitted

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 10, 2025

New job on instance trz42-GH200-jr for CPU micro-architecture aarch64-nvidia-grace for repository eessi.io-2023.06-software in job dir /p/project1/ceasybuilders/bot-trz42/jobs/2025.02/pr_79/13454932

date job status comment
Feb 10 05:07:54 UTC 2025 submitted job id 13454932 awaits release by job manager
Feb 10 05:08:51 UTC 2025 released job awaits launch by Slurm scheduler
Feb 10 05:09:54 UTC 2025 running job 13454932 is running
Feb 10 05:21:10 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-13454932.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
No artefacts were created or found.
Feb 10 05:21:10 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-13454932.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@trz42
Copy link
Owner Author

trz42 commented Feb 10, 2025

bot: build inst:GH200 arch:grace repo:eessi.io-2023.06-software

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 10, 2025

Updates by the bot instance trz42-GH200-jr (click for details)

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 10, 2025

Updates by the bot instance eX3-dev-300 (click for details)
  • received bot command build inst:GH200 arch:grace repo:eessi.io-2023.06-software from trz42

    • expanded format: build instance:GH200 architecture:grace repository:eessi.io-2023.06-software
  • handling command build instance:GH200 architecture:grace repository:eessi.io-2023.06-software resulted in:

    • no jobs were submitted

@eessi-bot-devel-trz42
Copy link

eessi-bot-devel-trz42 bot commented Feb 10, 2025

New job on instance trz42-GH200-jr for CPU micro-architecture aarch64-nvidia-grace for repository eessi.io-2023.06-software in job dir /p/project1/ceasybuilders/bot-trz42/jobs/2025.02/pr_79/13454934

date job status comment
Feb 10 05:29:37 UTC 2025 submitted job id 13454934 awaits release by job manager
Feb 10 05:30:15 UTC 2025 released job awaits launch by Slurm scheduler
Feb 10 05:31:18 UTC 2025 running job 13454934 is running
Feb 10 05:40:31 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-13454934.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-nvidia-grace-1739165898.tar.gzsize: 118 MiB (124574365 bytes)
entries: 196847
modules under 2023.06/software/linux/aarch64/nvidia/grace/modules/all
EasyBuild/4.8.2.lua
EasyBuild/4.9.0.lua
EasyBuild/4.9.1.lua
EasyBuild/4.9.2.lua
EasyBuild/4.9.3.lua
EasyBuild/4.9.4.lua
EESSI-extend/2023.06-easybuild.lua
software under 2023.06/software/linux/aarch64/nvidia/grace/software
EasyBuild/4.8.2
EasyBuild/4.9.0
EasyBuild/4.9.1
EasyBuild/4.9.2
EasyBuild/4.9.3
EasyBuild/4.9.4
EESSI-extend/2023.06-easybuild
other under 2023.06/software/linux/aarch64/nvidia/grace
.lmod/lmodrc.lua
.lmod/SitePackage.lua
Feb 10 05:40:31 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-13454934.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants