Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CopulaGAN : Out of memory error #1382

Closed
harsh-sengar opened this issue Apr 19, 2023 · 2 comments
Closed

CopulaGAN : Out of memory error #1382

harsh-sengar opened this issue Apr 19, 2023 · 2 comments
Labels
bug Something isn't working resolution:WAI The software is working as intended

Comments

@harsh-sengar
Copy link

Environment Details

Please indicate the following details about the environment in which you found the bug:

  • SDV version: 1.0.0
  • Python version: 3.9
  • Operating System: Linux

Error Description

It is giving out of memory error even with very small dataset of singular table of 50k record even with very large AWS EC2 instance r5a.12xlarge and with different GPU machines. I am not able to use CUDA here.

File ~/miniconda3/envs/sdv_0172/lib/python3.9/site-packages/torch/_tensor.py:396, in Tensor.backward(self, gradient, retain_graph, create_graph, inputs)
387 if has_torch_function_unary(self):
388 return handle_torch_function(
389 Tensor.backward,
390 (self,),
(...)
394 create_graph=create_graph,
395 inputs=inputs)
--> 396 torch.autograd.backward(self, gradient, retain_graph, create_graph, inputs=inputs)

File ~/miniconda3/envs/sdv_0172/lib/python3.9/site-packages/torch/autograd/init.py:173, in backward(tensors, grad_tensors, retain_graph, create_graph, grad_variables, inputs)
168 retain_graph = create_graph
170 # The reason we repeat same the comment below is that
171 # some Python versions print out the first line of a multi-line function
172 # calls in the traceback and some print out the last line
--> 173 Variable.execution_engine.run_backward( # Calls into the C++ engine to run the backward pass
174 tensors, grad_tensors
, retain_graph, create_graph, inputs,
175 allow_unreachable=True, accumulate_grad=True)

RuntimeError: [enforce fail at alloc_cpu.cpp:66] . DefaultCPUAllocator: can't allocate memory: you tried to allocate 3530813440 bytes. Error code 12 (Cannot allocate memory)

@harsh-sengar harsh-sengar added bug Something isn't working new Automatic label applied to new issues labels Apr 19, 2023
@npatki
Copy link
Contributor

npatki commented Apr 20, 2023

Hi @harsh-sengar, thanks for filing this issue. You mention there are 50K records in your dataset. Can you tell me how many columns it has? This will help us debug.

BTW a very similar issue was filed just a few minutes before you at #1381. My reply on that issue may also be relevant to you now: I wonder what your use case is and whether you can consider using one of the other SDV synthesizers for your use case. In particular I recommend the Gaussian Copula Synthesizer.

@npatki npatki added under discussion Issue is currently being discussed and removed new Automatic label applied to new issues labels Apr 20, 2023
@npatki
Copy link
Contributor

npatki commented May 3, 2023

Hi @harsh-sengar curious if you are still having an issue with this and perhaps try this method.

I'll close this issue off since it's been a few weeks. Please feel free to reply if there is more to discuss and I can reopen the issue for investigation.

@npatki npatki closed this as completed May 3, 2023
@npatki npatki added resolution:WAI The software is working as intended and removed under discussion Issue is currently being discussed labels May 3, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working resolution:WAI The software is working as intended
Projects
None yet
Development

No branches or pull requests

2 participants