Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

{lib}[GCCcore/13.2.0] UCC-CUDA v1.2.0 w/ CUDA 12.4.0 #21565

Merged

Conversation

tanmoy1989
Copy link
Contributor

(created using eb --new-pr)

@bedroge
Copy link
Contributor

bedroge commented Nov 28, 2024

@boegelbot please test @ jsc-zen3-a100

@boegelbot
Copy link
Collaborator

@bedroge: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=21565 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_21565 --ntasks=8 --partition=jsczen3g --gres=gpu:1 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 5347

Test results coming soon (I hope)...

- notification for comment with ID 2506383715 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
jsczen3g1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.5, x86_64, AMD EPYC-Milan Processor (zen3), 1 x NVIDIA NVIDIA A100 80GB PCIe, 555.42.06, Python 3.9.19
See https://gist.github.com/boegelbot/05bd9c65201f298b88538e2de51245d3 for a full test report.

@bedroge
Copy link
Contributor

bedroge commented Nov 29, 2024

Test report by @bedroge
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
bob-Latitude-5300 - Linux Ubuntu 24.04.1 LTS (Noble Numbat), x86_64, Intel(R) Core(TM) i7-8665U CPU @ 1.90GHz, Python 3.12.3
See https://gist.github.com/bedroge/10ee75ec519bee1c85a89de46b075833 for a full test report.

@bedroge
Copy link
Contributor

bedroge commented Nov 29, 2024

Test report by @bedroge
SUCCESS
Build succeeded for 2 out of 2 (1 easyconfigs in total)
l40sgpu1 - Linux Rocky Linux 8.10, x86_64, Intel(R) Xeon(R) Gold 6240R CPU @ 2.40GHz (cascadelake), 1 x NVIDIA NVIDIA L40S, 555.42.06, Python 3.6.8
See https://gist.github.com/bedroge/9582990b43a8f9541c241c8dc6fbf7ca for a full test report.

@bedroge
Copy link
Contributor

bedroge commented Nov 29, 2024

@boegelbot please test @ generoso

@boegelbot
Copy link
Collaborator

@bedroge: Request for testing this PR well received on login1

PR test command 'EB_PR=21565 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs /opt/software/slurm/bin/sbatch --job-name test_PR_21565 --ntasks=4 ~/boegelbot/eb_from_pr_upload_generoso.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 14781

Test results coming soon (I hope)...

- notification for comment with ID 2507325141 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@bedroge
Copy link
Contributor

bedroge commented Nov 29, 2024

@boegelbot please test @ jsc-zen3

@boegelbot
Copy link
Collaborator

@bedroge: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=21565 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_21565 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 5352

Test results coming soon (I hope)...

- notification for comment with ID 2507331890 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 3 out of 3 (1 easyconfigs in total)
cns1 - Linux Rocky Linux 8.9, x86_64, Intel(R) Xeon(R) CPU E5-2667 v3 @ 3.20GHz (haswell), Python 3.6.8
See https://gist.github.com/boegelbot/5456fe736e6e1a336d6b040da2a1aa2e for a full test report.

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
jsczen3c1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.5, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.19
See https://gist.github.com/boegelbot/13a1e6a6f59da8a9fa035e1504e689f9 for a full test report.

@bedroge bedroge added the update label Nov 29, 2024
Copy link

Updated software UCC-CUDA-1.2.0-GCCcore-13.2.0-CUDA-12.4.0.eb

Diff against UCC-CUDA-1.3.0-GCCcore-13.3.0-CUDA-12.6.0.eb

easybuild/easyconfigs/u/UCC-CUDA/UCC-CUDA-1.3.0-GCCcore-13.3.0-CUDA-12.6.0.eb

diff --git a/easybuild/easyconfigs/u/UCC-CUDA/UCC-CUDA-1.3.0-GCCcore-13.3.0-CUDA-12.6.0.eb b/easybuild/easyconfigs/u/UCC-CUDA/UCC-CUDA-1.2.0-GCCcore-13.2.0-CUDA-12.4.0.eb
index a0b4865a72..f8bba05195 100644
--- a/easybuild/easyconfigs/u/UCC-CUDA/UCC-CUDA-1.3.0-GCCcore-13.3.0-CUDA-12.6.0.eb
+++ b/easybuild/easyconfigs/u/UCC-CUDA/UCC-CUDA-1.2.0-GCCcore-13.2.0-CUDA-12.4.0.eb
@@ -1,7 +1,7 @@
 easyblock = 'ConfigureMake'
 
 name = 'UCC-CUDA'
-version = '1.3.0'
+version = '1.2.0'
 versionsuffix = '-CUDA-%(cudaver)s'
 
 homepage = 'https://www.openucx.org/'
@@ -12,7 +12,7 @@ feature-rich for current and emerging programming models and runtimes.
 This module adds the UCC CUDA support.
 """
 
-toolchain = {'name': 'GCCcore', 'version': '13.3.0'}
+toolchain = {'name': 'GCCcore', 'version': '13.2.0'}
 toolchainopts = {'pic': True}
 
 source_urls = ['https://github.com/openucx/ucc/archive/refs/tags']
@@ -21,21 +21,21 @@ patches = [
     '%(name)s-%(version)s_link_against_existing_UCC_libs.patch',
 ]
 checksums = [
-    {'v1.3.0.tar.gz': 'b56379abe5f1c125bfa83be305d78d81a64aa271b7b5fff0ac17b86725ff3acf'},
-    {'UCC-CUDA-1.3.0_link_against_existing_UCC_libs.patch':
-     '758228357ce2a6ae50fb26a0b43e9176feaf379e266365f38205ce679267fc0d'},
+    {'v1.2.0.tar.gz': 'c1552797600835c0cf401b82dc89c4d27d5717f4fb805d41daca8e19f65e509d'},
+    {'UCC-CUDA-1.2.0_link_against_existing_UCC_libs.patch':
+     '84157be5eae96d2501df076bcf0598b104adf80abeca028a144c4fb098638207'},
 ]
 
 builddependencies = [
-    ('binutils', '2.42'),
-    ('Autotools', '20231222'),
+    ('binutils', '2.40'),
+    ('Autotools', '20220317'),
 ]
 
 dependencies = [
     ('UCC', version),
-    ('CUDA',  '12.6.0', '', SYSTEM),
-    ('UCX-CUDA', '1.16.0', '-CUDA-%(cudaver)s'),
-    ('NCCL', '2.22.3', '-CUDA-%(cudaver)s'),
+    ('CUDA',  '12.4.0', '', SYSTEM),
+    ('UCX-CUDA', '1.15.0', '-CUDA-%(cudaver)s'),
+    ('NCCL', '2.20.5', '-CUDA-%(cudaver)s'),
 ]
 
 preconfigopts = "./autogen.sh && "
Diff against UCC-CUDA-1.2.0-GCCcore-12.3.0-CUDA-12.1.1.eb

easybuild/easyconfigs/u/UCC-CUDA/UCC-CUDA-1.2.0-GCCcore-12.3.0-CUDA-12.1.1.eb

diff --git a/easybuild/easyconfigs/u/UCC-CUDA/UCC-CUDA-1.2.0-GCCcore-12.3.0-CUDA-12.1.1.eb b/easybuild/easyconfigs/u/UCC-CUDA/UCC-CUDA-1.2.0-GCCcore-13.2.0-CUDA-12.4.0.eb
index 8594d50984..f8bba05195 100644
--- a/easybuild/easyconfigs/u/UCC-CUDA/UCC-CUDA-1.2.0-GCCcore-12.3.0-CUDA-12.1.1.eb
+++ b/easybuild/easyconfigs/u/UCC-CUDA/UCC-CUDA-1.2.0-GCCcore-13.2.0-CUDA-12.4.0.eb
@@ -12,7 +12,7 @@ feature-rich for current and emerging programming models and runtimes.
 This module adds the UCC CUDA support.
 """
 
-toolchain = {'name': 'GCCcore', 'version': '12.3.0'}
+toolchain = {'name': 'GCCcore', 'version': '13.2.0'}
 toolchainopts = {'pic': True}
 
 source_urls = ['https://github.com/openucx/ucc/archive/refs/tags']
@@ -33,9 +33,9 @@ builddependencies = [
 
 dependencies = [
     ('UCC', version),
-    ('CUDA',  '12.1.1', '', SYSTEM),
-    ('UCX-CUDA', '1.14.1', '-CUDA-%(cudaver)s'),
-    ('NCCL', '2.18.3', '-CUDA-%(cudaver)s'),
+    ('CUDA',  '12.4.0', '', SYSTEM),
+    ('UCX-CUDA', '1.15.0', '-CUDA-%(cudaver)s'),
+    ('NCCL', '2.20.5', '-CUDA-%(cudaver)s'),
 ]
 
 preconfigopts = "./autogen.sh && "
Diff against UCC-CUDA-1.1.0-GCCcore-12.2.0-CUDA-12.0.0.eb

easybuild/easyconfigs/u/UCC-CUDA/UCC-CUDA-1.1.0-GCCcore-12.2.0-CUDA-12.0.0.eb

diff --git a/easybuild/easyconfigs/u/UCC-CUDA/UCC-CUDA-1.1.0-GCCcore-12.2.0-CUDA-12.0.0.eb b/easybuild/easyconfigs/u/UCC-CUDA/UCC-CUDA-1.2.0-GCCcore-13.2.0-CUDA-12.4.0.eb
index bfe211063d..f8bba05195 100644
--- a/easybuild/easyconfigs/u/UCC-CUDA/UCC-CUDA-1.1.0-GCCcore-12.2.0-CUDA-12.0.0.eb
+++ b/easybuild/easyconfigs/u/UCC-CUDA/UCC-CUDA-1.2.0-GCCcore-13.2.0-CUDA-12.4.0.eb
@@ -1,7 +1,7 @@
 easyblock = 'ConfigureMake'
 
 name = 'UCC-CUDA'
-version = '1.1.0'
+version = '1.2.0'
 versionsuffix = '-CUDA-%(cudaver)s'
 
 homepage = 'https://www.openucx.org/'
@@ -12,32 +12,30 @@ feature-rich for current and emerging programming models and runtimes.
 This module adds the UCC CUDA support.
 """
 
-toolchain = {'name': 'GCCcore', 'version': '12.2.0'}
+toolchain = {'name': 'GCCcore', 'version': '13.2.0'}
 toolchainopts = {'pic': True}
 
 source_urls = ['https://github.com/openucx/ucc/archive/refs/tags']
 sources = ['v%(version)s.tar.gz']
 patches = [
-    '%(name)s-1.0.0_link_against_existing_UCC_libs.patch',
-    '%(name)s-%(version)s_cuda_12_mem_ops.patch',
+    '%(name)s-%(version)s_link_against_existing_UCC_libs.patch',
 ]
 checksums = [
-    {'v1.1.0.tar.gz': '74c8ba75037b5bd88cb703e8c8ae55639af3fecfd4428912a433c010c97b4df7'},
-    {'UCC-CUDA-1.0.0_link_against_existing_UCC_libs.patch':
-     '9fa11cf6779174f4e9048df5812096e4261e1769d465cc7f34a6354398876856'},
-    {'UCC-CUDA-1.1.0_cuda_12_mem_ops.patch': 'fc3ea1487d29dc626db2363ef5a79e7f0906f6a7507a363fa6167a812b143eb6'},
+    {'v1.2.0.tar.gz': 'c1552797600835c0cf401b82dc89c4d27d5717f4fb805d41daca8e19f65e509d'},
+    {'UCC-CUDA-1.2.0_link_against_existing_UCC_libs.patch':
+     '84157be5eae96d2501df076bcf0598b104adf80abeca028a144c4fb098638207'},
 ]
 
 builddependencies = [
-    ('binutils', '2.39'),
+    ('binutils', '2.40'),
     ('Autotools', '20220317'),
 ]
 
 dependencies = [
-    ('UCC', '1.1.0'),
-    ('CUDA',  '12.0.0', '', SYSTEM),
-    ('UCX-CUDA', '1.13.1', '-CUDA-%(cudaver)s'),
-    ('NCCL', '2.16.2', '-CUDA-%(cudaver)s'),
+    ('UCC', version),
+    ('CUDA',  '12.4.0', '', SYSTEM),
+    ('UCX-CUDA', '1.15.0', '-CUDA-%(cudaver)s'),
+    ('NCCL', '2.20.5', '-CUDA-%(cudaver)s'),
 ]
 
 preconfigopts = "./autogen.sh && "

Copy link
Contributor

@bedroge bedroge left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me!

@bedroge
Copy link
Contributor

bedroge commented Nov 29, 2024

Test report by @bedroge
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
bob-Latitude-5300 - Linux Ubuntu 24.04.1 LTS (Noble Numbat), x86_64, Intel(R) Core(TM) i7-8665U CPU @ 1.90GHz, Python 3.12.3
See https://gist.github.com/bedroge/c4231eacc37a15d1e955978124318e0e for a full test report.

@bedroge bedroge added this to the release after 4.9.4 milestone Nov 29, 2024
@bedroge
Copy link
Contributor

bedroge commented Nov 29, 2024

Going in, thanks @tanmoy1989!

@bedroge bedroge merged commit d107aff into easybuilders:develop Nov 29, 2024
10 checks passed
@tanmoy1989 tanmoy1989 deleted the 20241007044945_new_pr_UCC-CUDA120 branch November 29, 2024 12:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants