Software installers to build custom AMIs for AWS PCS
AWS provides a downloadable file that can install the AWS PCS software on an instance. AWS also provides software that can download, compile, and install relevant versions of Slurm and its dependencies. You can use these instructions to build custom AMIs for use with AWS PCS or you can use your own methods.
Contents
AWS PCS software installer
The AWS PCS software installer configures an instance to work with AWS PCS during the instance bootstrap process. You must use AWS-provided installers to install the AWS PCS software on your custom AMI.
Slurm installer
The Slurm installer downloads, compiles, and installs relevant versions of Slurm and its dependencies. You can use the Slurm installer to build custom AMIs for AWS PCS. You can also use your own mechanisms if they are consistent with the software configuration that the Slurm installer provides.
The AWS-provided software installs the following:
-
Slurm
at the requested major and maintenance version (currently version 23.11.10) - License GPL 2 -
Slurm is built with
--sysconfdir
set to/etc/slurm
-
Slurm is built with the option
--enable-pam
and--without-munge
-
Slurm is built with the option
--sharedstatedir=/run/slurm/
-
Slurm is built with PMIX and JWT support
-
Slurm is installed at
/opt/aws/pcs/schedulers/slurm-23.11
-
-
OpenPMIX
(version 4.2.6) – License -
OpenPMIX is installed as a subdirectory of
/opt/aws/pcs/scheduler/
-
-
libjwt
(version 1.17.0) – License MPL-2.0 -
libjwt is installed as a subdirectory of
/opt/aws/pcs/scheduler/
-
The AWS-provided software changes the system configuration as follows:
-
The Slurm
systemd
file created by the build is copied to/etc/systemd/system/
with file nameslurmd-23.11.service
. -
If they don't exist, a Slurm user and group (
slurm:slurm
) are created with UID/GID of401
. -
On Amazon Linux 2 and Rocky Linux 9 the installation adds the EPEL repository to install the required software to build Slurm or its dependencies.
-
On RHEL9 the installation will enable
codeready-builder-for-rhel-9-rhui-rpms
andepel-release-latest-9
fromfedoraproject
to install the required software to build Slurm or its dependencies.
Supported operating systems
The AWS PCS software and Slurm installers suppport the following operating systems:
-
Amazon Linux 2
-
RedHat Enterprise Linux 9
-
Rocky Linux 9
-
Ubuntu 22.04
Note
AWS Deep Learning AMIs (DLAMI) versions based on Amazon Linux 2 and Ubuntu 22.04 should be compatible with the AWS PCS software and Slurm installers. For more information, see Choosing Your DLAMI in the AWS Deep Learning AMIs Developer Guide.
Supported instance types
AWS PCS software and Slurm installers support any x86_64 or arm64 instance type than can run one of the supported operating systems.
Supported Slurm versions
The following major versions of Slurm are supported:
-
Slurm 23.11
Verify installers using a checksum
You can use SHA256 checksums to verify the installer tarball (.tar.gz) files. We recommend that you do this to verify the identity of the software publisher and to check that the application has not been altered or corrupted since it was published.
To verify a tarball
Use the sha256sum utility for the SHA256 checksum and specify the tarball filename. You must run the command from the directory where you saved the tarball file.
-
SHA256
$
sha256sumtarball_filename.tar.gz
The command should return a checksum value in the following format.
checksum_value
tarball_filename.tar.gz
Compare the checksum value returned by the command with the checksum value provided in the following table. If the checksums match, then it's safe to run the installation script.
Important
If the checksums don't match, don't run the installation script. Contact AWS Support
For example, the following command generates the SHA256 checksum for the Slurm 23.11.10 tarball.
$
sha256sum aws-pcs-slurm-23.11-installer-23.11.10-1.tar.gz
Example output:
1de7d919c8632fe8e2806611bed4fde1005a4fadc795412456e935c7bba2a9b8 aws-pcs-slurm-23.11-installer-23.11.10-1.tar.gz
The following table lists the checksums for recent versions of the installers. Replace
us-east-1
with the AWS Region where you use AWS PCS.
Installer | Download URL | SHA256 checksum |
---|---|---|
AWS PCS agent 1.0.1-1 |
|
|
AWS PCS agent 1.0.0-1 |
|
|
Slurm 23.11.10-1 |
|
|
Slurm 23.11.9-1 |
|
|