Skip to content

ViCE: Improving Dense Representation Learning by Superpixelization and Contrasting Cluster Assignment

License

Notifications You must be signed in to change notification settings

robin-karlsson0/vice

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ViCE: Improving Dense Representation Learning by Superpixelization and Contrasting Cluster Assignment

This repository contain the code to reproduce results in the paper "ViCE: Improving Dense Representation Learning by Superpixelization and Contrasting Cluster Assignment". We provide instructions for installations and running experiments.

Paper link: https://arxiv.org/abs/2111.12460v3

Summary

  • Installation
    • VISSL
    • MMSegmentation
  • Dataset setup
    • COCO-Stuff164K (for training)
    • Coarse COCO-Stuff164K (for evaluation)
  • Download pretrained ViCE models
  • Run experiments
    • Example 1: COCO ViCE model training (1 GPU)
    • Example 2: COCO linear model training (w. pretrained ViCE model)
    • Example 3: COCO cluster evaluation model training (w. pretrained ViCE model)
    • Example 4: COCO ViCE model training (32 GPUs)

1. Requirements

Confirmed to work for Ubuntu 18.04 and 20.04.

  • Python 3.8
  • Pytorch 1.9.1
  • CUDA 11.1

2. Installation

The training and evaluation code is implemented within two the frameworks VISSL and MMSegmentation. Follow the instructions bellow to install both frameworks with included modifications for running ViCE using the code provided in this repository.

First initialize an environment of your choice using Python 3.8. For example

$ conda create -n vice python=3.8
or
$ pyenv virtualenv 3.8.10 vice
$ source .pyenv/versions/vice/bin/activate

2.1: Install VISSL

Starting from the vice/ directory root, run the following commands in order to install Pytorch, Apex, and VISSL along with required packages.

$ pip3 install torch==1.9.1+cu111 torchvision==0.10.1+cu111 torchaudio==0.9.1 -f https://download.pytorch.org/whl/torch_stable.html
$ pip install apex -f https://dl.fbaipublicfiles.com/vissl/packaging/apexwheels/py38_cu111_pyt191/download.html
$ cd vissl/
$ pip install --progress-bar off -r requirements.txt
$ pip install opencv-contrib-python
$ pip install classy-vision
$ pip install -e .[dev]

The installation is successful if running the bellow line generates no error messages.

$ python -c 'import vissl, apex, cv2'

Return to the root directory

$ cd ../

2.2: Install MMSegmentation

Starting from the vice/ directory root, run the following commands in order to install MMCV and MMSegmentation.

$ pip install mmcv-full==1.3.13 -f https://download.openmmlab.com/mmcv/dist/cu111/torch1.9.0/index.html
$ cd mmsegmentation/
$ pip install -e . 

Return to the root directory

$ cd ../

3: Dataset setup

We provide instructions for setting up the COCO-Stuff164K dataset for training and benchmark experiments, as well as the Cityscapes dataset for benchmark experiments.

3.1: COCO-Stuff164K

  1. Download the COCO-Stuff164K dataset and create a symbolic link inside vissl/datasets/ to the COCO images/ directory. For download instructions, refer to the MMSegmentation documentation. The additional unlabeled training images are downloaded from the official COCO dataset download site.

Expected vanilla dataset directory structure

coco_stuff164k/
    annotations/
        train2017/
            .png
        val2017/
            .png
    images/
        train2017/
            .jpg
        val2017/
            .jpg
        unlabeled2017/
            .jpg
  1. Inside the vissl/datasets directory, symbolically link a path to the COCO dataset images directory.
$ cd vissl/
$ mkdir datasets
$ cd datasets/
$ ln -s PATH-TO-YOUR-coco_stuff164k/images coco
$ cd ../
  1. Make sure that vissl/extra_scripts/datasets/dataset_config.yaml is setup to parse the COCO dataset (default configuration), with the entry for coco as bellow and all other entries set to False.
coco:
    root_path: "datasets/coco"
    use: True
  1. Create filelist_coco.npy which specifies paths to training images for VISSL. Used datasets are specified in the dataset_config.yaml file. Codes for parsing datasets are provided in the extra_scripts/datasets/dataset_parsers/ directory.
$ python extra_scripts/datasets/gen_dataset_filelist.py extra_scripts/datasets/dataset_config.yaml --out-filename filelist_coco.npy

Return to the ViCE-model/ directory.

cd ../

3.2: Coarse COCO-Stuff164K for evaluation

  1. Copy the directory named curated from the ViCE-model/mmsegmentation/tools/convert_datasets/curated directory into your COCO-Stuff164K directory root. The coarse label splits in curated were originally provided in the IIC paper's GitHub repository

  2. Run coco_stuff164k_coarse.py to create the coarse COCO-Stuff164K dataset corresponding to the samples specified in curated/train2017/Coco164kFull_Stuff_Coarse_7.txt and curated/val2017/Coco164kFull_Stuff_Coarse_7.txt. Set --nproc N to as many threads your CPU have.

$ cd mmsegmentation/
$ python tools/convert_datasets/coco_stuff164k_coarse.py PATH-TO-YOUR-coco_stuff164k --nproc 4

The above script first generates new coarse GT label maps with 27 classes and stored in the same annotation folder as the original GT label map. Finally two new directories annotations_coarse and images_coarse are created with symbolic links to the previously generated GT label maps and corresponding RGB images.

Upon completion the COCO-Stuff164K directory structure will look as follows.

coco_stuff164k/
    annotations/
    annotations_coarse/
        train2017/
            .png
        val2017/
            .png
    curated/
    images/
    images_coarse/
        train2017/
            .jpg
        val2017/
            .jpg

All COCO benchmark training runs are configured to read samples from annotations_coarse and images_coarse.

  1. Finally create a symbolic link to the COCO-Stuff164K dataset inside the mmsegmentation/data directory.
$ mkdir data
$ cd data/
$ ln -s PATH-TO-YOUR-coco_stuff164k/ coco_stuff164k_coarse

4. Pretrained models

We are unable to provide pretrained models due to supplementary size limitations.

In the publicly released code we plan to provide the following pretrained models for reproducing high- and low-resolution COCO and Cityscapes experiment results.

  • ViCE models
  • Cluster and linear evaluation models accompanying the ViCE models.

5. Usage

Example 1: COCO ViCE model training (1 GPU)

Run the following command from the ViCE-model/vissl/ directory.

$ python tools/run_distributed_engines.py config=pretrain/vice/vice_1gpu_resnet50_coco_demo.yaml

If you get the error AttributeError: module 'cv2' has no attribute 'ximgproc' try reinstalling OpenCV.

$ pip uninstall opencv-contrib-python opencv-python
$ pip install opencv-contrib-python

The single GPU experiment is provided for demonstration purposes only. The resulting model is not expected to result in a high-performance model. See the 32 GPU experiment example bellow for how to reproduce the provided pretrained ViCE models.

Example 2: COCO linear evaluation model training (w. pretrained ViCE model)

Run the following command from the ViCE-model/mmsegmentation/ directory to reproduce the COCO linear evaluation model using the provided pretrained ViCE model. Note that the following setup is configured to run on a node with 4x V100 32GB GPUs.

$ ./tools/dist_train.sh configs/fcn_linear/fcn_linear_coco-stuff164k_vissl_sc_exp127_ep16_12k.py 4 --work-dir fcn_linear_coco-stuff164k_vissl_sc_exp127_ep16_12k

For systems with fewer GPUs, memory, or different configuration, please modify the configuration to match your system by following the MMSegmentation multi-GPU training documentation

  • Reduce samples per GPU if your run out of memory in the following configuration file mmsegmentation/configs/_base_/datasets/coco-stuff164k_coarse.py.
data = dict(
    samples_per_gpu=16, # <-- Lower
    workers_per_gpu=4,
  • Command to train the linear model on using a single GPU
$ python tools/train.py configs/fcn_linear_coco-stuff164k_exp27.py --work-dir YOUR-EXPERIMENT_DIR_NAME

Example 3: COCO cluster evaluation model training (w. pretrained ViCE model)

Run the following command from the vice/mmsegmentation/ directory to reproduce the COCO cluster evaluation model using the provided pretrained ViCE model. Note that the following setup is configured to run on a node with 1x V100 32GB GPU.

$ ./tools/test.py configs/vice_cluster/cluster_coco-stuff164k_vissl_exp127.py --eval mIoU

Example 4: COCO ViCE model training (32 GPUs)

We train our models on a supercomputer using a job scheduling system. Please reference our setup while configuring the code to work on your system.

  1. Modify job.sh to activate the environment (See 'Setup'). Also specify username and paths.

  2. Similarly modify distributed_train.sh to activate the environment. Specify config file to run (COCO or road scene training).

  3. Launch the job.

$ pjsub job.sh

Results

Representation quality performance of our best model on coarse COCO-Stuff164K.

Cluster evaluation mIoU Acc.
IIC 13.26 51.49
Mod. DeepCluster 13.76 50.79
PiCIE 13.84 48.09
PiCIE+H 14.40 50.00
PiCIE (C 256) 12.42 66.02
ViCE (C 256) 17.98 54.92
ViCE (high-res, C 256) 21.77 64.75
Linear evaluation mIoU Acc.
PiCIE 14.77 54.75
ViCE 25.49 62.78
ViCE (high-res) 29.38 68.16

About

ViCE: Improving Dense Representation Learning by Superpixelization and Contrasting Cluster Assignment

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published