Skip to content

Releases: aws/aws-parallelcluster-cookbook

AWS ParallelCluster v2.11.5

01 Mar 18:29
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 2.11.5

This is associated with AWS ParallelCluster v2.11.5

CHANGES

  • Drop support for SGE and Torque schedulers.
  • Remove nodewatcher, sqswatcher, jobwatcher related code.
  • Disable log4j-cve-2021-44228-hotpatch service on Amazon Linux to avoid incurring in potential performance degradation.
  • Upgrade NVIDIA driver to version 470.103.01.
  • Upgrade CUDA library to version 11.4.4.
  • Upgrade NVIDIA Fabric manager to version 470.103.01.
  • Upgrade Intel MPI Library to 2021.4.0.441.

BUG FIXES

  • Fix DCV connection through browsers.

AWS ParallelCluster v3.1.1

10 Feb 19:02
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 3.1.1

This is associated with AWS ParallelCluster v3.1.1

ENHANCEMENTS

  • Add support for multiple users cluster environments by integrating with Active Directory (AD) domains managed via AWS Directory Service.
  • Install NVIDIA drivers and CUDA library for ARM.

CHANGES

  • Upgrade Slurm to version 21.08.5.
  • Upgrade NICE DCV to version 2021.3-11591.
  • Upgrade NVIDIA driver to version 470.103.01.
  • Upgrade CUDA library to version 11.4.4.
  • Upgrade NVIDIA Fabric manager to version 470.103.01.
  • Upgrade Intel MPI Library to 2021.4.0.441.
  • Upgrade PMIx to version 3.2.3.
  • Move the configure/install recipes to separate cookbooks that are called from the main one. Existing entrypoints are maintained and backwards compatible.
  • Download dependencies of Intel HPC platform during AMI build time to avoid contacting internet during cluster creation time.
  • Do not strip - from compute resource name when configuring Slurm nodes.
  • Add cluster parameter directory_service.disabled_on_compute_nodes to disable AD integration on compute nodes.

BUG FIXES

  • Do not configure GPUs in Slurm when NVIDIA driver is not installed.
  • Fix the way ExtraChefAttributes are merged into the final configuration.

AWS ParallelCluster v3.0.3

17 Jan 13:50
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 3.0.3

This is associated with AWS ParallelCluster v3.0.3

CHANGES

  • Disable log4j-cve-2021-44228-hotpatch service on Amazon Linux to avoid incurring in potential performance degradation.

AWS ParallelCluster v2.11.4

20 Dec 17:02
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 2.11.4

This is associated with AWS ParallelCluster v2.11.4

CHANGES

  • CentOS 8 is no longer supported (EOL on December 31st, 2021).
  • Upgrade Slurm to version 20.11.8.
  • Upgrade Cinc Client to version 17.2.29.
  • Upgrade NICE DCV to version 2021.2-11190.
  • Upgrade NVIDIA driver to version 470.82.01.
  • Upgrade CUDA library to version 11.4.3.
  • Upgrade NVIDIA Fabric manager to 470.82.01.
  • Disable unattended packages update on Ubuntu.
  • Install Python 3 version of aws-cfn-bootstrap scripts on CentOS 7 and Ubuntu 18.04, aligning with Ubuntu 20.04 and Amazon Linux 2.

AWS ParallelCluster v3.0.2

05 Nov 18:27
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 3.0.2

This is associated with AWS ParallelCluster v3.0.2

3.0.2

CHANGES

  • Upgrade EFA installer to version 1.14.1. Thereafter, EFA enables GDR support by default on supported instance type(s).
    ParallelCluster does not reinstall EFA during node start. Previously, EFA was reinstalled if GdrSupport had been
    turned on in the configuration file.
    • EFA configuration: efa-config-1.9-1
    • EFA profile: efa-profile-1.5-1
    • EFA kernel module: efa-1.14.2
    • RDMA core: rdma-core-37.0
    • Libfabric: libfabric-1.13.2
    • Open MPI: openmpi40-aws-4.1.1-2

BUG FIXES

  • Fix issue that is preventing cluster names to start with parallelcluster- prefix.

AWS ParallelCluster v2.11.3

03 Nov 17:57
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 2.11.3

This is associated with AWS ParallelCluster v2.11.3

2.11.3

CHANGES

  • Upgrade EFA installer to version 1.14.1. Thereafter, EFA enables GDR support by default on supported instance type(s).
    ParallelCluster does not reinstall EFA during node start. Previously, EFA was reinstalled if enable_efa_gdr had been
    turned on in the configuration file.
    • EFA configuration: efa-config-1.9-1
    • EFA profile: efa-profile-1.5-1
    • EFA kernel module: efa-1.14.2
    • RDMA core: rdma-core-37.0
    • Libfabric: libfabric-1.13.2
    • Open MPI: openmpi40-aws-4.1.1-2

BUG FIXES

  • Fix failure when building AMI, due to SGE sources not available at arc.liv.ac.uk
  • Fix cluster update when using proxy setup.
  • Update ca-certificates package during AMI build time and prevent Chef from using outdated/distrusted CA certificates.

AWS ParallelCluster v3.0.1

27 Oct 14:24
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 3.0.1

This is associated with AWS ParallelCluster v3.0.1

CHANGES

  • Change supervisord service script from SysVinit to Systemd.
  • Drop support for SysVinit. Only Systemd is supported.

BUG FIXES

  • Fix supervisord service not enabled on Ubuntu.
  • Update ca-certificates package during AMI build time and prevent Chef from using outdated/distrusted CA certificates.

AWS ParallelCluster v3.0.0

10 Sep 15:52
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 3.0.0

This is associated with AWS ParallelCluster v3.0.0

3.0.0

ENHANCEMENTS

  • Support restart/reboot for instance type with instance store (ephemeral drives).
  • Compile Slurm with jobcomp/elasticsearch support.

CHANGES

  • Drop support for SGE and Torque schedulers.
  • Drop support for CentOS8.
  • Remove nodewatcher, sqswatcher, jobwatcher related code.
  • Remove Ganglia support.
  • Install ParallelCluster AWS Batch CLI at AMI build time.
  • Run daemons as cluster admin user (not root).
  • Add explicit assignment of names, uids, gids for slurm, munge and dcvextauth users.
  • Remove packer.
  • Restrict access to IMDS to root and cluster admin users, only.
  • Make PATH include required directories for every user and recipes context.
  • Fail cluster creation when IMDS lockdown is not working correctly.
  • Make sudoers secure_path include the same directories in every platform.
  • Remove option for instance store software encryption (encrypted_ephemeral).
  • Add support for iptables restore on instance reboot.
  • Allow IMDS access for dcv user when dcv is enabled.
  • Restore noatime option, which has positive impact on the performances of NFS filesystem
  • Upgrade NICE DCV to version 2021.1-10851.
  • Upgrade Slurm to version 20.11.8
  • Upgrade Cinc Client to version 17.2.29.
  • Upgrade NVIDIA driver to version 470.57.02.
  • Upgrade CUDA library to version 11.4.0.
  • Avoid installing MPICH and FFTW packages.
  • Upgrade EFA installer to version 1.13.0
    • Update rdma-core to v35.0.
    • Update libfabric to v1.13.0amzn1.0.

BUG FIXES

  • Fix cluster update when using proxy setup

AWS ParallelCluster v2.11.2

26 Aug 17:02
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 2.11.2

This is associated with AWS ParallelCluster v2.11.2

2.11.2

CHANGES

  • When using a custom AMI with a preinstalled EFA package, no actions are taken at node bootstrap time in case GPUDirect RDMA is enabled. The original EFA package deployment is preserved as during the createami process.
  • Upgrade EFA installer to version 1.13.0
    • Update rdma-core to v35.0.
    • Update libfabric to v1.13.0amzn1.0.

BUG FIXES

  • Lock the version of nvidia-fabricmanager package to the installed NVIDIA drivers to prevent updates and misalignments.

AWS ParallelCluster v2.11.1

23 Jul 23:52
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Cookbook 2.11.1

This is associated with AWS ParallelCluster v2.11.1

ENHANCEMENTS

  • Retry failed installations of aws-parallelcluster package on head node of clusters using AWS Batch as the scheduler.

CHANGES

  • Restore noatime option, which has positive impact on the performances of NFS filesystem.

BUG FIXES

  • Pin to version 1.247347 of the CloudWatch agent due to performance impact of latest CW agent version 1.247348.
  • Avoid failures when building SGE using instance type with vCPU >=32.