FPTrans: Feature-Proxy Transformer for Few-Shot Segmentation

Jian-Wei Zhang, Yifan Sun, Yi Yang, Wei Chen

This repository is the PyTorch Implementation. One can find the PaddlePaddle implementation from here.

Installation

Create a virtual environment and install the required packages.

conda create -n fptrans python=3.9.7
conda activate fptrans
conda install numpy=1.21.2
conda install pytorch==1.10.0 torchvision==0.11.1 cudatoolkit=11.3 -c pytorch
conda install tqdm scipy pyyaml
pip install git+https://github.com/IDSIA/[email protected]
pip install dropblock pycocotools opencv-python

Put following bash function in ~/.bashrc for simplifying the CUDA_VISIBLE_DEVICES.

function cuda()
{
    if [ "$#" -eq 0 ]; then
        return
    fi
    GPU_ID=$1
    shift 1
    CUDA_VISIBLE_DEVICES="$GPU_ID" $@
}

Now we can use cuda 0 python for single GPU and cuda 0,1 python for multiple GPUs.

Getting Started

See Preparing Datasets and Pretrained Backbones for FPTrans

Usage for inference with our pretrained models

Download the checkpoints of our pretrained FPTrans from GoogleDrive or BaiduDrive (Code: FPTr), and put the pretrained models (the numbered folders) into ./output/.

Datasets	Backbone	#Shots	Experiment ID (Split 0 - Split 3)
PASCAL-5i	ViT-B/16	1-shot	1,2,3,4
	DeiT-B/16	1-shot	5,6,7,8
	DeiT-S/16	1-shot	9,10,11,12
	DeiT-T/16	1-shot	13,14,15,16
	ViT-B/16	5-shot	17,18,19,20
	DeiT-B/16	5-shot	21,22,23,24
COCO-20i	ViT-B/16	1-shot	25,26,27,28
	DeiT-B/16	1-shot	29,30,31,32
	ViT-B/16	5-shot	33,34,35,36
	DeiT-B/16	5-shot	37,38,39,40

Run the test command:

# PASCAL ViT 1shot
cuda 0 python run.py test with configs/pascal_vit.yml exp_id=1 split=0

# PASCAL ViT 5shot
cuda 0 python run.py test with configs/pascal_vit.yml exp_id=17 split=0 shot=5

# COCO to PASCAL 1shot (cross domain, no need for training, just test)
# Load model trained from COCO, test on PASCAL 
# Notice: the code will use different splits from PASCAL-5i to avoid test 
#         classes (PASCAL) existed in training datasets (COCO).
cuda 0 python run.py test with configs/coco2pascal_vit.yml exp_id=29 split=0

Usage for training from scratch

Run the train command (adjust batch size bs for adapting the GPU memory):

# PASCAL 1shot
cuda 0 python run.py train with split=0 configs/pascal_vit.yml

# PASCAL 5shot
cuda 0,1 python run.py train with split=0 configs/pascal_vit.yml shot=5

# COCO 1shot
cuda 0,1 python run.py train with split=0 configs/coco_vit.yml

# COCO 5shot
cuda 0,1,2,3 python run.py train with split=0 configs/coco_vit.yml shot=5 bs=8

Optional arguments:

-i <Number>: Specify the experiment id. Default is incremental numbers in the ./output directory (or MongoDB if used).
-p: print configurations
-u: Run command without saving experiment details. (used for debug)

Please refer to Sacred Documentation for complete command line interface.

Performance

Results on PASCAL-5i

Backbone	Method	1-shot	5-shot
ResNet-50	HSNet	64.0	69.5
	BAM	67.8	70.9
ViT-B/16-384	FPTrans	64.7	73.7
DeiT-T/16	FPTrans	59.7	68.2
DeiT-S/16	FPTrans	65.3	74.2
DeiT-B/16-384	FPTrans	68.8	78.0

Results on COCO-20i

Backbone	Method	1-shot	5-shot
ResNet-50	HSNet	39.2	46.9
	BAM	46.2	51.2
ViT-B/16-384	FPTrans	42.0	53.8
DeiT-B/16-384	FPTrans	47.0	58.9

Notice that the results are obtained on NVIDIA A100/V100 platform. We find that the results may have a few fluctuation on NVIDIA GeForce 3090 with exactly the same model and environment.

Citing FPTrans

@inproceedings{zhang2022FPTrans,
  title={Feature-Proxy Transformer for Few-Shot Segmentation},
  author={Jian-Wei Zhang, Yifan Sun, Yi Yang, Wei Chen},
  journal={NeurIPS},
  year={2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
assets		assets
configs		configs
core		core
data		data
data_kits		data_kits
lists		lists
networks		networks
utils_		utils_
.gitignore		.gitignore
README.md		README.md
config.py		config.py
constants.py		constants.py
run.py		run.py
tools.py		tools.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FPTrans: Feature-Proxy Transformer for Few-Shot Segmentation

Installation

Getting Started

Usage for inference with our pretrained models

Usage for training from scratch

Performance

Citing FPTrans

About

Releases

Packages

Languages

Jarvis73/FPTrans

Folders and files

Latest commit

History

Repository files navigation

FPTrans: Feature-Proxy Transformer for Few-Shot Segmentation

Installation

Getting Started

Usage for inference with our pretrained models

Usage for training from scratch

Performance

Citing FPTrans

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages