Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can the Huggingface croissant API endpoint read croissant.json metadata created by this tool? #724

Open
cboettig opened this issue Aug 16, 2024 · 0 comments

Comments

@cboettig
Copy link

Is it possible to get the croissant metadata provided by the HuggingFace Datasets API, e.g. at enpoint https://huggingface.co/api/datasets/{USER}/{REPO}/croissant to reflect metadata from a croissant.json, such as might be generated from this tool?

I am unable to find documentation or examples for this either in this repo or on huggingface datasets documentation.
I have tried simply pushing a croissant.json file to a huggingface datasets repo, but so far the croissant metadata generated automatically by huggingface and served by its API seems only to reflect what is automatically extracted from the README. It's really great to see HuggingFace Datasets embrace Croissant metadata and make datasets discoverable in schema.org platforms like Google Dataset Search, but it quite defeats the point when that metadata does not reflect the more detailed metadata provided in a json-ld file / mlcroissant tool...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant