Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implemented hook for measuring GPU times #502

Merged
merged 1 commit into from
Nov 13, 2024

Conversation

brownbaerchen
Copy link
Contributor

I split off the timings from the default hook and implemented a version for GPU timings. This works with CUDA events, see the CuPy documentation for details on how to measure GPU timings.

Note that the time elapsed on CPU is not in general correlated with the time to run an operation on GPU. There are some operations that implicitly synchronise device and host, such as memory transfers between the two, resulting in similar CPU and GPU timings in pySDC. I did not find a meaningful difference in average timing_step between the two hooks in my experiments. Still, I think we need a hook that makes this reliable by explicitly synchronising.

@pancetta pancetta merged commit 31d83a2 into Parallel-in-Time:master Nov 13, 2024
90 checks passed
@brownbaerchen brownbaerchen deleted the timing_hook branch November 13, 2024 15:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants