Is the trained projection head available? #199

lkshrsch · 2022-03-30T19:41:39Z

I am interested in downloading a pre-trained simCLR model with the projection head, to retrieve the latent features z, upon which the contrastive loss was applied.
Is this layer + pre-trained weights available somewhere?

chentingpc · 2022-04-02T23:14:04Z

yes, the projection head weights should also be included in gs://simclr-checkpoints/simclrv2/pretrained/...

lkshrsch · 2022-04-04T16:03:15Z

In the github README that link is under the description

"Pretrained SimCLRv2 models (with linear eval head):"

I assumed "with linear eval head" refers to the classification layer for ImageNet,

but downloading the model r50_1x_sk0 from:

https://console.cloud.google.com/storage/browser/simclr-checkpoints/simclrv2/pretrained/r50_1x_sk0?pageState=(%22StorageObjectListTable%22:(%22f%22:%22%255B%255D%22))&prefix=&forceOnObjectsSortingFiltering=false

the model output is of dimension 2048, which could be either the output from the resnet50, or the output of the projection head.

so to confirm:
are these the features from the projection head z = g(h) (as described in the simCLR paper, Figure 2)?
or from resNet50: h = f(x) (as described in the simCLR paper, Figure 2)?
or the linear evaluation head for classification (as described in the github README, which should be logits of dimension (1000) for ImageNet )?

Thanks!

chentingpc · 2022-04-04T19:07:44Z

Both projection head's and supervised linear head's weights are available in the checkpoints. I suppose you're using hub module? If so, you could choose output by providing signature that's available in module.get_output_info_dict(), I listed the results below. Note that the projection head's output is not included, so in order to get that, you may need to run the tf code with the checkpoint loaded to build a new graph.

{'block_group1': <hub.ParsedTensorInfo shape=(None, None, None, 256) dtype=float32 is_sparse=False>, 'block_group2': <hub.ParsedTensorInfo shape=(None, None, None, 512) dtype=float32 is_sparse=False>, 'block_group3': <hub.ParsedTensorInfo shape=(None, None, None, 1024) dtype=float32 is_sparse=False>, 'block_group4': <hub.ParsedTensorInfo shape=(None, None, None, 2048) dtype=float32 is_sparse=False>, 'default': <hub.ParsedTensorInfo shape=(None, 2048) dtype=float32 is_sparse=False>, 'final_avg_pool': <hub.ParsedTensorInfo shape=(None, 2048) dtype=float32 is_sparse=False>, 'initial_conv': <hub.ParsedTensorInfo shape=(None, None, None, 64) dtype=float32 is_sparse=False>, 'initial_max_pool': <hub.ParsedTensorInfo shape=(None, None, None, 64) dtype=float32 is_sparse=False>, 'logits_sup': <hub.ParsedTensorInfo shape=(None, 1000) dtype=float32 is_sparse=False>}

ilia10000 · 2022-04-22T03:46:37Z

I'm struggling to actually get the projection representations and still not quite certain what to do based on the previous comments in this thread. Does anyone have a minimal working example of loading the pre-trained model, pushing an input through, and getting the representation from the projection head?

collinskatie · 2023-04-03T23:40:28Z

Thanks for the great repo! I wanted to follow-up to explore whether this issue has been reconciled?

I'm also trying to access the projection representations. Specifically, I'd like to be able to pass in an image and get out just the representation (prior to the class-level logits). What layer should I use for this?

If I load a model as follows:

saved_model_path = 'gs://simclr-checkpoints-tf2/simclrv2/pretrained/r50_1x_sk0/saved_model/'
saved_model = tf.saved_model.load(saved_model_path)

The keys available when running inference on a new image as follows:

saved_model(image, trainable=False).keys()

dict_keys(['logits_sup', 'block_group3', 'block_group4', 'final_avg_pool', 'block_group2', 'block_group1', 'initial_max_pool', 'initial_conv'])

Which of these is the key associated with the representation? final_avg_pool?

Thank you for any insight @chentingpc or others!

chentingpc · 2023-04-04T19:56:05Z

Hi final_avg_pool is the output of the resnet which is used for linear probing. hope that helps

collinskatie · 2023-04-05T10:35:14Z

Thank you @chentingpc !! That's great to know!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is the trained projection head available? #199

Is the trained projection head available? #199

lkshrsch commented Mar 30, 2022

chentingpc commented Apr 2, 2022

lkshrsch commented Apr 4, 2022

chentingpc commented Apr 4, 2022

ilia10000 commented Apr 22, 2022

collinskatie commented Apr 3, 2023 •

edited

Loading

chentingpc commented Apr 4, 2023

collinskatie commented Apr 5, 2023

Is the trained projection head available? #199

Is the trained projection head available? #199

Comments

lkshrsch commented Mar 30, 2022

chentingpc commented Apr 2, 2022

lkshrsch commented Apr 4, 2022

chentingpc commented Apr 4, 2022

ilia10000 commented Apr 22, 2022

collinskatie commented Apr 3, 2023 • edited Loading

chentingpc commented Apr 4, 2023

collinskatie commented Apr 5, 2023

collinskatie commented Apr 3, 2023 •

edited

Loading