CC3D: Layout-Conditioned Generation of Compositional 3D Scenes

CC3D: Layout-Conditioned Generation of Compositional 3D Scenes

Project Page | Paper

Abstract: In this work, we introduce CC3D, a conditional generative model that synthesizes complex 3D scenes conditioned on 2D semantic scene layouts, trained using single-view images. Different from most existing 3D GANs that limit their applicability to aligned single objects, we focus on generating complex scenes with multiple objects, by modeling the compositional nature of 3D scenes. By devising a 2D layout-based approach for 3D synthesis and implementing a new 3D field representation with a stronger geometric inductive bias, we have created a 3D GAN that is both efficient and of high quality, while allowing for a more controllable generation process. Our evaluations on synthetic 3D-FRONT and real-world KITTI-360 datasets demonstrate that our model generates scenes of improved visual and geometric quality in comparison to previous works.

Requirements

The codebase is tested on

Python 3.9
PyTorch 1.12.1
4 NVIDIA GPUs (Tesla V100 32GB) with CUDA version 11.6

For additional python libraries, please install by:

pip install -r requirements.txt

Please refer to https://github.com/NVlabs/stylegan2-ada-pytorch for additional software/hardware requirements.

Dataset

Datasets need to have two subdirectories, namely images and labels. In each there are given number of scenes with renderings in images and labels in the dedicated boxes.npz. boxes.npz stores the layout parameters and the camera parameters. We use BlenderProc to render the 3D-FRONT dataset. We provide pre-processing repositories here:

dataset/ 
        images/
                scene_x/[0.png, ..., max_num.png]
        labels/
                scene_x/boxes.npz

Pre-trained Checkpoints

You can download the pre-trained checkpoints used in our paper:

Dataset	Resolution	Download
3D-FRONT Bedrooms	256	Google Drive
3D-FRONT Living Rooms	256	Google Drive
KITTI-360	256	Google Drive

Train a new model

bash train.sh

Render scenes with a pre-trained model

bash generate.sh

Evaluate model

Use evaluate.sh to evaluate a trained model for the FID metric.

License

Our code is based on the EG3D and follows their license code.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
dnnlib		dnnlib
gui_utils		gui_utils
metrics		metrics
metrics_full		metrics_full
torch_utils		torch_utils
training		training
visu		visu
README.md		README.md
calc_metrics.py		calc_metrics.py
calc_metrics_for_dataset.py		calc_metrics_for_dataset.py
camera_utils.py		camera_utils.py
dataset_tool.py		dataset_tool.py
evaluate.sh		evaluate.sh
gen_samples.py		gen_samples.py
gen_videos.py		gen_videos.py
generate.py		generate.py
generate.sh		generate.sh
generate_dataset.py		generate_dataset.py
generate_utils.py		generate_utils.py
legacy.py		legacy.py
renderer.py		renderer.py
requirements.txt		requirements.txt
shape_utils.py		shape_utils.py
train.py		train.py
train.sh		train.sh
visualizer.py		visualizer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CC3D: Layout-Conditioned Generation of Compositional 3D Scenes

Project Page | Paper

Requirements

Dataset

Pre-trained Checkpoints

Train a new model

Render scenes with a pre-trained model

Evaluate model

License

About

Releases

Packages

Languages

sherwinbahmani/cc3d

Folders and files

Latest commit

History

Repository files navigation

CC3D: Layout-Conditioned Generation of Compositional 3D Scenes

Project Page | Paper

Requirements

Dataset

Pre-trained Checkpoints

Train a new model

Render scenes with a pre-trained model

Evaluate model

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages