Question regarding dataset #38

MinaBasem · 2024-04-21T01:45:09Z

Hello Philip,

I am working on a GUI mock data generation project that (as the name states) generates fake data such as first name, last name, countries, etc.

I was looking for a more realistic way to generate names from their corresponding countries and I came across your repository, I've tried tinkering around with the API but the execution time is too long for mass data generation.

Question is whether there is a way to call out numerous names in a single API call? If not, I am considering using the original dataset to create my own algorithm without needing API calls. However, I wanted to check whether the 3.3GB file has duplicate rows or not, examples regarding what duplicate data there is and such (since I currently cannot download the dataset on my machine).

Point is if there is a significant number of numerous data then I might attempt to manually shrink the rows down by removing as much duplicates as I can in order to run the algorithm locally, making it much faster than waiting for API call returns.

Regards.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question regarding dataset #38

Question regarding dataset #38

MinaBasem commented Apr 21, 2024

Question regarding dataset #38

Question regarding dataset #38

Comments

MinaBasem commented Apr 21, 2024