Skip to content
This repository has been archived by the owner on Jul 23, 2024. It is now read-only.

Extra arguments for hf dataset loading #32

Open
anmoisio opened this issue Aug 31, 2023 · 2 comments
Open

Extra arguments for hf dataset loading #32

anmoisio opened this issue Aug 31, 2023 · 2 comments

Comments

@anmoisio
Copy link
Contributor

Hi,

I would need to pass extra keyword arguments for the load_dataset function of the datasets library, like this:

from datasets import load_dataset
load_dataset("europarl_bilingual", lang1="en", lang2="fi")

But it seems this is not supported by the API. Is there a way to load this dataset using the API?

@kazemnejad
Copy link
Contributor

kazemnejad commented Aug 31, 2023

Thanks for bringing up this feature. At this point, we plan to add in the final version. In the meantime, you can override Task._load_data_source() method in your Task implementation to add this.

https://github.com/GenBench/genbench_cbt/blob/caa63fbf37733c76b218db5fa08bb2e67ac07b6c/src/genbench/task.py#L466

@anmoisio
Copy link
Contributor Author

anmoisio commented Sep 2, 2023

Okay thanks for the help!

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants