Name		Name	Last commit message	Last commit date
parent directory ..
LICENSE		LICENSE
README.md		README.md
bpe_simple_vocab_16e6.txt.gz		bpe_simple_vocab_16e6.txt.gz
chelsea.png		chelsea.png
cifar100_classes.txt		cifar100_classes.txt
clip.py		clip.py
imagenet_classes.txt		imagenet_classes.txt
simple_tokenizer.py		simple_tokenizer.py

README.md

CLIP

Input

(Image from https://scikit-image.org/)

Output

Zero-Shot Prediction

### predicts the most likely top5 labels among input textual labels ###
    a cat: 98.40%
  a human: 1.35%
    a dog: 0.24%

Requirements

This model requires additional module.

pip3 install ftfy

Usage

Automatically downloads the onnx and prototxt files on the first run. It is necessary to be connected to the Internet while downloading.

For the sample image,

$ python3 clip.py

If you want to specify the input image, put the image path after the --input option.

$ python3 clip.py --input IMAGE_PATH

You can use --text option if you want to specify a subset of the texture labels to input into the model.
Default labels is "a human", "a dog" and "a cat".

$ python3 clip.py --text "a human" --text "a dog" --text "a cat"

If you want to load a subset of the texture labels you input into the model from a file, use the --desc_file option.

$ python3 clip.py --desc_file imagenet_classes.txt

By adding the --model_type option, you can specify model type which is selected from "ViTB32", "RN50". (default is ViTB32)

$ python3 clip.py --model_type ViTB32

Reference

CLIP

Framework

Pytorch

Model Format

ONNX opset=11

Netron

ViT-B32-encode_image.onnx.prototxt
ViT-B32-encode_text.onnx.prototxt
RN50-encode_image.onnx.prototxt
RN50-encode_text.onnx.prototxt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clip

clip

README.md

CLIP

Input

Output

Requirements

Usage

Reference

Framework

Model Format

Netron

Files

clip

Directory actions

More options

Directory actions

More options

Latest commit

History

clip

Folders and files

parent directory

README.md

CLIP

Input

Output

Requirements

Usage

Reference

Framework

Model Format

Netron