-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Enhance Dataloader Configuration #60
Comments
Discussed with @john-zielke-snkeos:
dataloaders:
train:
batch_size: 4
num_workers: 8 and the rest was already set by default. If a user needs a completely different DataLoader, they can go ahead and define |
@ibro45 Agree with this. This would now bring us into the territory of templates where we set some default object for If we do this for data loaders and there is a default expected behaviour for it that our user can expect, should we not do this for other items in the config as well? For instance, |
We can also extend this templating and have several templates for different workflows. Say we want to have a classification workflow, we can set templates for a few different models and losses. We can set up a simple CLI interface for the user to generate a configuration that provides selection between these different templates and spits out a final config that they just have to configure their data for (These templates won't get assigned by default unlike the We can use something like Cookiecutter (https://github.com/cookiecutter/cookiecutter) to map user CLI to pre-set templates This will be a separate feature ofcourse and should go in a separate issue if we agree to do it but templating can provide us a lot of extra features without comprising on the dynamicism of the library |
Seems like Let's discuss the defaults in the future PR. |
🚀 Feature Request
A lot of dataloader arguments are mentioned in
system
parameters. For example,batch_size
,drop_last_batch
.Would be good to have a way to set other parameters of the dataloader such as
prefetch_factor
,persist_workers
and potentially other future additions to this.🛰 Alternatives
Maybe we can add a partial dataloader to the system config? and give it
dataset
andsampler
later?The text was updated successfully, but these errors were encountered: