Standard FT does not work #30

YaNgZhAnG-V5 · 2024-01-25T12:07:59Z

Hi, thank for the great work!

When I tried to run your baseline evaluation script with:

TASK=SST-2 K=16 SEED=42 BS=8 LR=1e-5 MODEL=roberta-large bash finetune.sh

the script will break during evaluation with this error message:
TypeError: repeat(): argument 'repeats' (position 1) must be tuple of ints, but found element of type NoneType at pos 0

Can you check the standard FT script to see if there is any issue?

The text was updated successfully, but these errors were encountered:

gaotianyu1350 · 2024-01-29T12:31:03Z

Hi, are you using multi-gpu setup? Also, can you share your pytorch/transformers versions?

YaNgZhAnG-V5 · 2024-01-30T02:31:39Z

Thanks for reaching back! I am using the single-gpu setup. For the environment setting, I am using torch 2.1.2+cu118 and transformers 4.37.1

gaotianyu1350 · 2024-02-06T12:57:57Z

Hi, can you try transformers==4.28.1, this is the version of transformers that we used to test the code base.

aparna-aketi · 2024-10-18T21:48:50Z

I had the same issue with transformers==4.44.2. The problem is with get_eval_dataloader function in the Trainer class. The output eval_dataloader has batch_size attribute as None. However, the eval_dataloader.batch_sampler.batch_size has the right batch-size value. I fixed it by modified the batch_size variable to dataloader.batch_sampler.batch_size in prediction_loop() function of the Transformer's Trainer class. I am not sure if there is a better way to fix this bug without modifying the transformers library.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Standard FT does not work #30

Standard FT does not work #30

YaNgZhAnG-V5 commented Jan 25, 2024

gaotianyu1350 commented Jan 29, 2024

YaNgZhAnG-V5 commented Jan 30, 2024

gaotianyu1350 commented Feb 6, 2024

aparna-aketi commented Oct 18, 2024

Standard FT does not work #30

Standard FT does not work #30

Comments

YaNgZhAnG-V5 commented Jan 25, 2024

gaotianyu1350 commented Jan 29, 2024

YaNgZhAnG-V5 commented Jan 30, 2024

gaotianyu1350 commented Feb 6, 2024

aparna-aketi commented Oct 18, 2024