UDASS

The official repo for the "Unified Domain Adaptive Semantic Segmentation". [paper] [demo]

Abstract

Unsupervised Domain Adaptive Semantic Segmentation (UDA-SS) aims to transfer the supervision from a labeled source domain to an unlabeled target domain. The majority of existing UDA-SS works typically consider images whilst recent attempts have extended further to tackle videos by modeling the temporal dimension. Although the two lines of research share the major challenges -- overcoming the underlying domain distribution shift, their studies are largely independent.

This causes several issues: (1) The insights gained from each line of research remain fragmented, leading to a lack of a holistic understanding of the problem and potential solutions. (2) Preventing the unification of methods, techniques, and best practices across the two domains, resulting in redundant efforts and missed opportunities for cross-pollination of ideas. (3) Without a unified approach, the knowledge and advancements made in one domain (images or videos) may not be effectively transferred to the other, leading to suboptimal performance and slower progress.

Under this observation, we advocate unifying the study of UDA-SS across video and image scenarios, enabling a more comprehensive understanding, synergistic advancements, and efficient knowledge sharing. To that end, we explore the unified UDA-SS from a general data augmentation perspective, serving as a unifying conceptual framework, enabling improved generalization, and potential for cross-pollination of ideas, ultimately contributing to the overall progress and practical impact of this field of research. Specifically, we propose a Quad-directional Mixup (QuadMix) method, characterized by tackling distinct point attributes and feature inconsistencies through four-directional paths for intra- and inter-domain mixing in a feature space. To deal with temporal shifts with videos, we incorporate optical flow-guided feature aggregation across spatial and temporal dimensions for fine-grained domain alignment.

Extensive experiments show that our method outperforms the state-of-the-art works by large margins on four challenging UDA-SS benchmarks.

Index Terms: Unified domain adaptation, semantic segmentation, QuadMix, flow-guided spatio-temporal aggregation.

Click for more qualitative results

The video demo is also avaliable at bilibili or google drive. Please select HD quality (1080p) for clearer display.

UDASS for Image Scenarios

You can find the source code to run image-UDASS on domain-adaptive IMAGE semantic segmentation in the subfolder /image_udass. For instructions how to set up the environment/datasets and how to train UDASS for image semantic segmentation UDA, please refer to seg/README.md.

UDASS for Video Scenarios

You can find the source code to run image-UDASS on domain-adaptive VIDEO semantic segmentation in the subfolder /video_udass. For instructions how to set up the environment/datasets and how to train UDASS for image semantic segmentation UDA, please refer to VIDEO/README.md.

Citation

If you find UDASS useful in your research, please consider citing:

@misc{zhang2024udass,
      title={Unified Domain Adaptive Semantic Segmentation}, 
      author={Zhe Zhang and Gaochang Wu and Jing Zhang and Chunhua Shen and Dacheng Tao and Tianyou Chai},
      year={2024},
      eprint={2311.13254},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 129 Commits
image_udass		image_udass
video_udass		video_udass
README.md		README.md
Unified-UDASS.jpg		Unified-UDASS.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UDASS

Abstract

Click for more qualitative results

UDASS for Image Scenarios

UDASS for Video Scenarios

Citation

About

Releases

Packages

Languages

ZHE-SAPI/UDASS

Folders and files

Latest commit

History

Repository files navigation

UDASS

Abstract

Click for more qualitative results

UDASS for Image Scenarios

UDASS for Video Scenarios

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages