-
Notifications
You must be signed in to change notification settings - Fork 156
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(gpu): implement PBSs that use new Hopper's features #987
Conversation
e9813dc
to
4c6bb0f
Compare
3b6c448
to
268d2f0
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @pdroalves! Thanks a lot for this! I haven't looked in details at the tbc PBS code itself, would be way too long for me to do so, but here's a my review on the rest.
backends/tfhe-cuda-backend/cuda/include/programmable_bootstrap_multibit.h
Show resolved
Hide resolved
backends/tfhe-cuda-backend/cuda/src/pbs/programmable_bootstrap_cg_classic.cuh
Show resolved
Hide resolved
backends/tfhe-cuda-backend/cuda/src/pbs/programmable_bootstrap_classic.cu
Show resolved
Hide resolved
backends/tfhe-cuda-backend/cuda/src/pbs/programmable_bootstrap_classic.cu
Show resolved
Hide resolved
backends/tfhe-cuda-backend/cuda/src/pbs/programmable_bootstrap_classic.cu
Show resolved
Hide resolved
backends/tfhe-cuda-backend/cuda/src/pbs/programmable_bootstrap_classic.cu
Outdated
Show resolved
Hide resolved
backends/tfhe-cuda-backend/cuda/src/pbs/programmable_bootstrap_classic.cu
Outdated
Show resolved
Hide resolved
backends/tfhe-cuda-backend/cuda/src/pbs/programmable_bootstrap_multibit.cu
Outdated
Show resolved
Hide resolved
Since TBC PBS will only run (and be compiled) on H100s, I rebased this branch on top of |
@slab-ci gpu_test |
@slab-ci integer_gpu_full_bench |
@agnesLeroy Looks like this breaks the CI. Should I undo this rebase? |
@pdroalves you should target main instead of David's branch, let's run the H100 tests manually instead of in the CI exceptionally. |
Indeed. I rebased on |
601baaf
to
e59909c
Compare
b34e148
to
c757fb3
Compare
c757fb3
to
60f00d7
Compare
60f00d7
to
f08da6e
Compare
f08da6e
to
c80d5fc
Compare
cluster and distributed shared memory
e59909c
to
20e11ea
Compare
…cluster and distributed shared memory
@agnesLeroy This branch passes integer tests on H100. |
7c10dae
into
chre/gpu/integer_mult_gpu_params
https://github.com/zama-ai/tfhe-rs-internal/issues/450
https://github.com/zama-ai/concrete-internal/issues/345
closes: please link all relevant issues
PR content/description
Check-list: