Skip to content

Information for the Intro to Cluster System Administration for Non-Sysadmins class

Notifications You must be signed in to change notification settings

HPCSYSPROS/IntroToClusterSysadmin

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

IntroToClusterSysadmin

Information for the Intro to Cluster System Administration for Non-Sysadmins class

Outline

General Documentation

First part - Network services

Second part - Images

Third part - Node management

  • ssh keys/parallel shell
    • /install/postscripts/remoteshell
    • xdsh
  • User management (passwd/shadow/group files)
    • /install/postscripts/syncfiles
    • updatenode {node} -F
  • Shared file system (NFS/NTP)
    • Postinstall scripts
      • /install/custom/install/centos7.9/x86_64/centos7.postinstall
  • Resource manager (SLURM)
  • Firewalls (iptables) best practices
    • Lockdown ssh on the login node
    • Lockdown SLURM on the resource manager node
    • Leave nodes on a private subnet

More Resources

Contents

  • This repo contains an example xCAT dump from a working cluster, and a copy of the /install directory with relevant changes to config files.
  • To use the xCAT config:
    restorexCATdb -p xCAT-dump-example
  • The root password in the password table needs to be changed.
    • Generate a root password hash:
      openssl passwd -1
    • Set root password:
      chtab key=system passwd.username=root passwd.password={Hash from openssl command}
  • To use the install directory:
    cp -rf install-example to /install
  • Update the password hash for the root password in the file /install/custom/syncfiles/all/etc/shadow with the openssl hash from above.
  • The /install/post/install/otherpkgs/x86_86/centos7.9/ needs to be created:
    copycds {CentOS 7.9 Everthing DVD.iso}
  • The /install/netboot/centos7.9/x86_64/compute/ files need to be created:
    genimage centos7.9-x86_64-netboot-compute
    packimage centos7.9-x86_64-netboot-compute
  • The ntp_nfs.txt and slurm_install.txt documents describe detailed steps taken to configure those services on the example cluster.
  • You need to create the VM machines using the info found in the xCAT vm table. See xCAT-dump-example/vm.csv.

About

Information for the Intro to Cluster System Administration for Non-Sysadmins class

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages