-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: add tensorflow and pytorch CUDA version tests for GPU image build #452
Conversation
# Raise exception if CUDA is not detected | ||
if 'cuda' not in package_build: | ||
raise Exception("Pytorch is installed without CUDA support for GPU image build.") | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe also do a print here "Pytorch is built with CUDA support"
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe also do a print here "Pytorch is built with CUDA support"
Good point, updated exception message
ARG SAGEMAKER_DISTRIBUTION_IMAGE | ||
FROM $SAGEMAKER_DISTRIBUTION_IMAGE | ||
|
||
ARG MAMBA_DOCKERFILE_ACTIVATE=1 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What is this ARG for?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
afaik, these 3 lines are used to activate test environment, we have it in other tests too: https://github.com/aws/sagemaker-distribution/blob/main/test/test_artifacts/v1/autogluon.test.Dockerfile#L1-L4
Issue #, if available:
In 1.8.0 GPU image, we installed tensorflow-cpu version by mistake
Description of changes:
To get rid of such issue, add a unit test to validate tensorflow and pytorch cuda version is installed in GPU image
Test:
By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.