AttributeError: 'LlamaModel' object has no attribute 'split_gpus' #4

seeyourcell · 2024-03-05T12:47:06Z

when I try
CUDA_VISIBLE_DEVICES=0 python llama_simquant.py --abits 4 --nsamples 16 --seqlen 2048 --nuq --fisher --quantize --include_sparse --sparsity-threshold 0.99 --quantizer_path quantizers.pickle ;

get this error
AttributeError: 'LlamaModel' object has no attribute 'split_gpus'

what is the problem

chooper1 · 2024-03-06T16:15:40Z

Are you using an environment where you have installed the transformers library from either the "gradients" or "deployment" folder? I believe this issue is caused by a mismatch between the quantization code and the modified transformers library for gradient computation / deployment. At the moment, the quantization code isn't compatible with these environments, so to run simulated quantization you need to install transformers using pip into the kvquant conda environment.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AttributeError: 'LlamaModel' object has no attribute 'split_gpus' #4

AttributeError: 'LlamaModel' object has no attribute 'split_gpus' #4

seeyourcell commented Mar 5, 2024

chooper1 commented Mar 6, 2024

AttributeError: 'LlamaModel' object has no attribute 'split_gpus' #4

AttributeError: 'LlamaModel' object has no attribute 'split_gpus' #4

Comments

seeyourcell commented Mar 5, 2024

chooper1 commented Mar 6, 2024