Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ValueError: Length of values (757) does not match length of index (756) #29

Open
mahmoudibrahim98 opened this issue Nov 4, 2020 · 1 comment

Comments

@mahmoudibrahim98
Copy link

mahmoudibrahim98 commented Nov 4, 2020

  • DataSynthesizer version:
  • Python version:
  • Operating System:

Description

I'm trying to use the Data generator in correlated attribute mode.I tried with many datasets and everything works fine. However, for some datasets, I'm getting the following error when I run the DataGenerator:
ValueError: Length of values (757) does not match length of index (756)

Note that the DataDescriber works fine without raising an error. However, for the datasets that I weren't able to generate syntehtic data, I checked their description file, and in all of them, the number of attributes in the Bayesian network is less than the number of attributes in the whole datasets.

What I Did



![image](https://user-images.githubusercontent.com/73237782/98086146-fe8c7180-1e86-11eb-8313-d6d32b1b59f0.png)

@haoyueping
Copy link
Collaborator

Hi, I just reviewed all implementations of AbstractAttribute.generate_values_as_candidate_key(self, n), the number of generated values is set correctly. Can you give me more information about this error? For example, what is the data type of this column, is it integer, string, datetime, or SSN? Can you print out a few generated values?

BTW, is the DataSynthesizer the latest version?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants