Skip to content

Commit a8fd474

Browse files
committed
Upgrade NVIDIA drivers, CUDA and Fabric Manager
NVIDIA drivers from 470.82.01 to 470.103.01 CUDA from 11.4.3 to 11.4.4 Fabric Manager from 470.82.01 to 470.103.01 Signed-off-by: Luca Carrogu <carrogu@amazon.com>
1 parent 7b28ec8 commit a8fd474

File tree

2 files changed

+5
-5
lines changed

2 files changed

+5
-5
lines changed

CHANGELOG.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,9 +13,9 @@ This file is used to list changes made in each version of the AWS ParallelCluste
1313
**CHANGES**
1414
- Upgrade Slurm to version 21.08.5.
1515
- Upgrade NICE DCV to version 2021.3-11591.
16-
- Upgrade NVIDIA driver to version 470.82.01.
17-
- Upgrade CUDA library to version 11.4.3.
18-
- Upgrade NVIDIA Fabric manager to version 470.82.01.
16+
- Upgrade NVIDIA driver to version 470.103.01.
17+
- Upgrade CUDA library to version 11.4.4.
18+
- Upgrade NVIDIA Fabric manager to version 470.103.01.
1919
- Upgrade Intel MPI Library to 2021.4.0.441.
2020
- Upgrade PMIx to version 3.2.3.
2121
- Move the configure/install recipes to separate cookbooks that are called from the main one. Existing entrypoints are maintained and backwards compatible.

attributes/default.rb

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -166,12 +166,12 @@
166166

167167
# NVIDIA
168168
default['cluster']['nvidia']['enabled'] = 'no'
169-
default['cluster']['nvidia']['driver_version'] = '470.82.01'
169+
default['cluster']['nvidia']['driver_version'] = '470.103.01'
170170
default['cluster']['nvidia']['cuda_version'] = '11.4'
171171
default['cluster']['nvidia']['driver_url_architecture_id'] = arm_instance? ? 'aarch64' : 'x86_64'
172172
default['cluster']['nvidia']['cuda_url_architecture_id'] = arm_instance? ? 'linux_sbsa' : 'linux'
173173
default['cluster']['nvidia']['driver_url'] = "https://us.download.nvidia.com/tesla/#{node['cluster']['nvidia']['driver_version']}/NVIDIA-Linux-#{node['cluster']['nvidia']['driver_url_architecture_id']}-#{node['cluster']['nvidia']['driver_version']}.run"
174-
default['cluster']['nvidia']['cuda_url'] = "https://developer.download.nvidia.com/compute/cuda/11.4.3/local_installers/cuda_11.4.3_470.82.01_#{node['cluster']['nvidia']['cuda_url_architecture_id']}.run"
174+
default['cluster']['nvidia']['cuda_url'] = "https://developer.download.nvidia.com/compute/cuda/11.4.4/local_installers/cuda_11.4.4_470.82.01_#{node['cluster']['nvidia']['cuda_url_architecture_id']}.run"
175175

176176
# NVIDIA fabric-manager
177177
# The package name of Fabric Manager for alinux2 and centos7 is nvidia-fabric-manager-version

0 commit comments

Comments
 (0)