Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Build (container) doesn't work. #160

Open
zjuercz opened this issue Oct 28, 2024 · 3 comments
Open

Build (container) doesn't work. #160

zjuercz opened this issue Oct 28, 2024 · 3 comments

Comments

@zjuercz
Copy link

zjuercz commented Oct 28, 2024

My desktop has an NVIDIA GeForce RTX 4090 GPU, and the system is Ubuntu 22.04.

~$ nvidia-smi
Mon Oct 28 14:21:03 2024
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 560.35.03 Driver Version: 560.35.03 CUDA Version: 12.6 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce RTX 4090 Off | 00000000:01:00.0 On | Off |
| 0% 45C P8 34W / 450W | 510MiB / 24564MiB | 2% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+

+-----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| 0 N/A N/A 2409 G /usr/lib/xorg/Xorg 259MiB |
| 0 N/A N/A 2555 G /usr/bin/gnome-shell 79MiB |
| 0 N/A N/A 10907 G ...seed-version=20241027-180153.472000 138MiB |
| 0 N/A N/A 14838 G gnome-control-center 6MiB |
+-----------------------------------------------------------------------------------------+

I have installed Podman but cannot use Podman to complete the compile.

~$ podman --version
podman version 3.4.4

The error message is as follows,
~/SIMULATeQCD$ ./simulate_qcd.sh list
enabled
enabled
It looks like you are running podman as root. You will need to disable this before continuing with 'systemctl disable --now podman podman.socket'

@clarkedavida
Copy link
Collaborator

hi, sorry about the slow response.

what happens if you systemctl disable --now podman podman.socket?

@zjuercz
Copy link
Author

zjuercz commented Oct 31, 2024

Hi, Thank you for your explanation.

This is the output after running the commands in the response.

SIMULATeQCD$ ./simulate_qcd.sh list
disabled
disabled
/usr/bin/podman
Group ID: 1000
User ID: 1000
RHEL_VERSION=latest
CUDA_VERSION=latest
####################################

BUILD TARGETS

####################################
7linkprof
axpy
bulkIndexerTest
cgProf
checkConf
checkRand
colorElectricCorrTest
colorElectricMagneticCloverBenchmark
compressionTest
condensateTest
configConverter
confReadWriteTest
correlatorNorm
correlatorTest
cudaAwareMPITest
cudaIpcTest
dotProductTest
dslashImagmuTest
dslashMultiTest
dslashTest
fieldStrengthTensorBenchmark
forceProfiling
gaugeFixing
generalFunctorTest
generateQuenched
gfixplcTest
gfixTestMulti
gfixTestSingle
gradientFlow
gradientFlow_example
gradientFlowTest
halfPrecMathTest
haloTest
hbor_benchmark
hbor_multiple_test
hbor_single_test
hisqForce
hisqSmearingImagmuTest
hisqSmearingMulti
hisqSmearingMultiCreate
hisqSmearingTest
hypSmearingTest
inverterProf
inverterTest
linkPathTest
maximalCenterGaugeFixing
maximalCenterGaugeFixingTest
measureHadrons
measureHadronsTest
memManTest
mixedPrecInverterTest
multiRHSProf
multiRHSProf_half
plaquette
polSuscRenorm
polyakovLoop
pureGaugeHmcTest
rhmc
rhmcTest
rndMultipleTest
rndSingleTest
sampleTopology
spinorHaloTest
stackedSpinorTest
sublatticeUpdates
taylorMeasurementTest
triad
utimesUdaggerTest
weinbergTopTest
wilsonLinesCorrelatorMultiGPUStacked
wilsonLinesCorrelatorTest
wilsonLinesFields
wilsonLinesFieldsShared
wilsonloop

SIMULATeQCD$ chmod +x ./simulate_qcd.sh && ./simulate_qcd.sh build
disabled
disabled
/usr/bin/podman
Group ID: 1000
User ID: 1000
RHEL_VERSION=latest
CUDA_VERSION=latest
subuid and subgid entries already exist for zjuercz
./simulate_qcd.sh: 行 279: curl: 未找到命令
Using latest RHEL version:
./simulate_qcd.sh: 行 298: curl: 未找到命令
Using latest CUDA version
Please set the CUDA_VERSION environment variable.

SIMULATeQCD$ nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2024 NVIDIA Corporation
Built on Thu_Sep_12_02:18:05_PDT_2024
Cuda compilation tools, release 12.6, V12.6.77
Build cuda_12.6.r12.6/compiler.34841621_0

@clarkedavida
Copy link
Collaborator

that superficially looks like it worked. did it indeed work?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants