PTQ4CLIP

Abstract

Quantization is an effective technique for compressing neural networks. However, applying quantization on vision-language models is under-explored. This is because the vision-language models are more sensitive to post-training quantization due to uneven activation distributions. Our work helps with the analysis and effective implementation of post-training quantization on the Vision-Language model (CLIP). Our post-training quantization analysis is carried out by various methods ranging from basic quantization on CLIP to applying twin uniform quantization and using Hessian guided metric to find the scaling factors for activations and weights for every layer respectively. We will also use five vision-language tasks as benchmarks to further analyze and evaluate the post-training quantization of the CLIP model. Our experiments show the quantized CLIP achieves a near loss-less prediction accuracy (for 8-bit quantization) on the ImageNet Classification task.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.ipynb_checkpoints		.ipynb_checkpoints
CLIP		CLIP
PTQ4ViT		PTQ4ViT
__pycache__		__pycache__
inference_result/mscoco/baselines		inference_result/mscoco/baselines
model		model
zerocap		zerocap
.gitignore		.gitignore
.gitmodules		.gitmodules
PTQ4CLIP.ipynb		PTQ4CLIP.ipynb
PTQ4CLIP.py		PTQ4CLIP.py
Post_Training_Quantization_on_the_CLIP_Vision_Language_Model.pdf		Post_Training_Quantization_on_the_CLIP_Vision_Language_Model.pdf
Prompt_Engineering_for_ImageNet.ipynb		Prompt_Engineering_for_ImageNet.ipynb
README.md		README.md
cosine_sim.py		cosine_sim.py
dataset.py		dataset.py
forbidden_tokens.npy		forbidden_tokens.npy
imagenet_classes.json		imagenet_classes.json
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PTQ4CLIP

Abstract

About

Releases

Packages

Contributors 3

Languages

VLQuant/PTQ4CLIP

Folders and files

Latest commit

History

Repository files navigation

PTQ4CLIP

Abstract

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages