Skip to content
This repository has been archived by the owner on May 18, 2023. It is now read-only.

Hoppity dataset generation #11

Open
msintaha opened this issue Oct 10, 2022 · 1 comment
Open

Hoppity dataset generation #11

msintaha opened this issue Oct 10, 2022 · 1 comment

Comments

@msintaha
Copy link

msintaha commented Oct 10, 2022

Hi,

I'm trying to run the data generation script using the cooked graphs from my dataset. However, i see in your code that you use something called hoppity_cg.tar.gz (https://github.com/google-research/plur/blob/main/plur/stage_1/hoppity_single_ast_diff_dataset.py#L61) to get some json files. What is this used for? This was not available in the hoppity repo - is this some pre-processing that you have done on your end?

@nashid
Copy link

nashid commented Oct 11, 2022

Can you please help us understand how you generated the hoppity_cg.tar.gz file? We dont find this file in the hoppity repository.

      'hoppity_cg.tar.gz': {
          'url': 'https://drive.google.com/u/0/uc?id=1JdXaehWO4UocjXqIXzWtUmVpRWWBtqmE&export=download',
          'sha1sum': '9f4a635408f86974a8e9739769d3ed2a52c2b907',
      }

How is this file generated? Can you please provide us with the script to generate intermediate files as part of the artefact?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants