Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to train inf-dit #28

Open
billfjj opened this issue Oct 29, 2024 · 9 comments
Open

How to train inf-dit #28

billfjj opened this issue Oct 29, 2024 · 9 comments

Comments

@billfjj
Copy link

billfjj commented Oct 29, 2024

I want to fine tune this model, but i am unable to run your training code. Could you please provide a detailed description of the dataset fromat used for your training?

@ivcylc
Copy link

ivcylc commented Nov 2, 2024

Have you succeeded in training it?
I am wondering where can i change the batch_size

@yzy-thu
Copy link
Contributor

yzy-thu commented Nov 2, 2024

I want to fine tune this model, but i am unable to run your training code. Could you please provide a detailed description of the dataset fromat used for your training?

webdataset with .jpg

@yzy-thu
Copy link
Contributor

yzy-thu commented Nov 2, 2024

Have you succeeded in training it? I am wondering where can i change the batch_size

scripts/ds_config_zero.json

@ivcylc
Copy link

ivcylc commented Nov 2, 2024

Have you succeeded in training it? I am wondering where can i change the batch_size

scripts/ds_config_zero.json

Thanks, and i found the loss when training with bs=2 is large than that with bs=1
Additionally, insterestly, when bs=2, the predicted seems excatedly half of that when bs=1, which cause error

@billfjj
Copy link
Author

billfjj commented Nov 4, 2024

I want to fine tune this model, but i am unable to run your training code. Could you please provide a detailed description of the dataset fromat used for your training?

webdataset with .jpg

I built a tar file with only jpg, but after changing the path of this tar file to the configuration file, the training will report an error that the dataset class does not have the len attribute.

@BrooksXiaoxi
Copy link

Have you succeeded in training it? I am wondering where can i change the batch_size

scripts/ds_config_zero.json

Thanks, and i found the loss when training with bs=2 is large than that with bs=1 Additionally, insterestly, when bs=2, the predicted seems excatedly half of that when bs=1, which cause error

Could you please tell me how you prepared your training data and how you configured the model to train? I am having trouble fine-tuning my own model on my dataset. I would appreciate your response, thanks!!!

@sunrccc
Copy link

sunrccc commented Dec 6, 2024

Have you succeeded in training it? I am wondering where can i change the batch_size

scripts/ds_config_zero.json

Thanks, and i found the loss when training with bs=2 is large than that with bs=1 Additionally, insterestly, when bs=2, the predicted seems excatedly half of that when bs=1, which cause error

Could you please tell me how you prepared your training data and how you configured the model to train? I am having trouble fine-tuning my own model on my dataset. I would appreciate your response, thanks!!!

May I ask if you have solved this problem? Could you give me some details?

@sunrccc
Copy link

sunrccc commented Dec 6, 2024

May I ask if you have solved this problem? Could you give me some details?

May I ask if you have solved this problem? Could you give me some details?

@sunrccc
Copy link

sunrccc commented Dec 6, 2024

I want to fine tune this model, but i am unable to run your training code. Could you please provide a detailed description of the dataset fromat used for your training?

webdataset with .jpg

Sorry, I don't quite understand your answer. Could you explain it in detail? Thank you very much.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants