Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Build error when trying to upgrade cmake version (to 3.30) for GPU builds (on pm-gpu at least) #6863

Open
ndkeen opened this issue Dec 19, 2024 · 2 comments · May be fixed by #6861
Open

Build error when trying to upgrade cmake version (to 3.30) for GPU builds (on pm-gpu at least) #6863

ndkeen opened this issue Dec 19, 2024 · 2 comments · May be fixed by #6861
Labels
EAMxx PRs focused on capabilities for EAMxx Machine Files pm-gpu Perlmutter machine at NERSC (GPU nodes)

Comments

@ndkeen
Copy link
Contributor

ndkeen commented Dec 19, 2024

Trying to update the cmake version on pm-cpu/pm-gpu. Cases seem ok on pm-cpu, but all builds fail on pm-gpu.

Can reproduce with SMS_D.ne4pg2_ne4pg2.F2010-SCREAMv1.pm-gpu_gnugpu

/pscratch/sd/n/ndk/e3sm_scratch/pm-gpu/mf330/SMS_D.ne4pg2_ne4pg2.F2010-SCREAMv1.pm-gpu_gnugpu.cmake330

Not sure if this is the best past of build error:

6_64/23.9/cuda/12.2/targets/x86_64-linux/include -mcmodel=medium -fconvert=big-endian -ffree-line-length-none -ffixed-line-length-none -fallow-argument-mismatch -fopenmp  -O -O2  -ffree-form  -DUSE_CONTIGUOUS= -c /global/cfs/cdirs\
/e3sm/ndk/repos/ndk_mf_perlmutter-update-cmake-version/components/elm/src/external_models/fates/main/FatesSizeAgeTypeIndicesMod.F90 -o CMakeFiles/lnd.dir/__/__/elm/src/external_models/fates/main/FatesSizeAgeTypeIndicesMod.F90.o
make  -f eamxx/scripts/CMakeFiles/query-cf-database.dir/build.make eamxx/scripts/CMakeFiles/query-cf-database.dir/depend
make[2]: Entering directory '/pscratch/sd/n/ndk/e3sm_scratch/pm-gpu/mf330/ERP_Ln9.ne4pg2_ne4pg2.F2010-SCREAMv1.pm-gpu_gnugpu.egpu/bld/cmake-bld'
eamxx/scripts/CMakeFiles/query-cf-database.dir/build.make:102: *** target pattern contains no '%'.  Stop.
make[2]: Leaving directory '/pscratch/sd/n/ndk/e3sm_scratch/pm-gpu/mf330/ERP_Ln9.ne4pg2_ne4pg2.F2010-SCREAMv1.pm-gpu_gnugpu.egpu/bld/cmake-bld'
make[1]: *** [CMakeFiles/Makefile2:2844: eamxx/scripts/CMakeFiles/query-cf-database.dir/all] Error 2
make[1]: *** Waiting for unfinished jobs....
@ndkeen ndkeen added EAMxx PRs focused on capabilities for EAMxx pm-gpu Perlmutter machine at NERSC (GPU nodes) Machine Files labels Dec 19, 2024
@ndkeen
Copy link
Contributor Author

ndkeen commented Dec 20, 2024

The PR does address this error, but build then hits a diff error.

CMake Error at /pscratch/sd/n/ndk/e3sm_scratch/pm-gpu/bartgol_eamxx_do-not-build-scripts-dir-with-cmake/SMS_D_Ln9.ne4_ne4.F2010-SCREAMv1-noAero.pm-gpu_gnugpu.scream-output-preset-3.esv1/bld/gnugpu/mpich/debug/nothreads/lib64/cmake\
/Kokkos/KokkosConfig.cmake:41 (SET_TARGET_PROPERTIES):
  The link interface of target "CUDA::cudart" contains:

    CUDA::cudart_static_deps

  but the target was not found.  Possible reasons include:

    * There is a typo in the target name.
    * A find_package call is missing for an IMPORTED target.
    * An ALIAS target is missing.

Call Stack (most recent call first):
  cmake/find_dep_packages.cmake:31 (find_package)
  CMakeLists.txt:122 (include)

Two build logs below (one that uses current cmake, and one that uses newer cmake)

/pscratch/sd/n/ndk/e3sm_scratch/pm-gpu/REP_Ld5.ne30pg2_ne30pg2.F2010-SCREAMv1.pm-gpu_gnugpu.scream-L128--scream-output-preset-6.esv1/bld/e3sm.bldlog.241220-092138.gz
/pscratch/sd/n/ndk/e3sm_scratch/pm-gpu/bartgol_eamxx_do-not-build-scripts-dir-with-cmake/REP_Ld5.ne30pg2_ne30pg2.F2010-SCREAMv1.pm-gpu_gnugpu.scream-L128--scream-output-preset-6.esv1/bld/e3sm.bldlog.241220-095956

@ndkeen
Copy link
Contributor Author

ndkeen commented Dec 20, 2024

Luca thinks even kokkos 4.3 has this issue, but 4.5 does not

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
EAMxx PRs focused on capabilities for EAMxx Machine Files pm-gpu Perlmutter machine at NERSC (GPU nodes)
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant