Please cite the following work if you find this model useful for your research:
Ulrich, C., Isensee, F., Wald, T., Zenk, M., Baumgartner, M., & Maier-Hein, K.(2023).
MultiTalent: A Multi-Dataset Approach to Medical Image Segmentation. arXiv preprint arXiv:2303.14444.
Please also cite the following work if you use this pipeline for training:
Isensee, F., Jaeger, P. F., Kohl, S. A., Petersen, J., & Maier-Hein, K. H. (2020).
nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nature Methods, 1-9.
This repository is based on nnU-Net V1. Please get familiar with it first.
The repository can be cloned and installed using the following commands.
git clone https://github.com/MIC-DKFZ/MultiTalent.git MultiTalent
cd MultiTalent
pip install -U .
Set the Paths of your enviroment according to nnU-Net V1
Note: We are working on an update to nnU-NetV2.
In nnU-Net V1, the project was still very experimental. In V2 we will make the project more user-friendly.
You can download a trained Multitalent (U-Net and Residual U-Net) model here using the following command:
nnUNet_download_pretrained_model Task100_MultiTalent
Please use the command above, because it makes some modifications to your plan files!
After that, you could run the following script for inference:
CUDA_VISIBLE_DEVICES=0 python -m torch.distributed.launch --master_port=1224 --nproc_per_node=1 ./nnunet/inference/predict_MultiTalent.py -i inputpath -o outputpath -m path_to_model_folds
We recommend to preprocess your target dataset as we did for the MultiTalent dataset collection. We expect, that the dataset is already in the raw data folder as expected by nnU-Net V1.
nnUNet_plan_and_preprocess -t TASK_ID -pl3d ExperimentPlanner3D_v21_Pretrained --planner2d None -overwrite_plans Path_To_MultiTalent_Plan -overwrite_plans_identifier New_Plans_Name --verify_dataset_integrity
Path_To_MultiTalent_Plan should point to the plans you downloaded above.
You can fine-tune a MultiTalent model on a new task using the following command:
nnUNet_train 3d_fullres nnUNetTrainerV2_warmupsegheads TASK_ID Fold -p New_Plans_Name -pretrained_weights path_to_pretrained_model/model_final_checkpoint.model
For fine-tuning a residual encoder, use the trainer nnUNetTrainerV2_warmupsegheads_resenc.
Note, that these trainers only train the new segmentation heads for the first 10 epochs, followed by a warm-up for the whole network.
Dataset collection
In the following, we will give you an instruction how to get and preprocess our partially labelled dataset collection.
You need to download the raw data individually from each dataset origin:
Dataset 1-6: MSD (only CT datasets)
Dataset 7 & 8: BTCV
Dataset 9: BTCV2
Dataset 10: StructSeg
Dataset 11: SegThor
Dataset 12: NIH-Pan
Dataset 13: KiTS19
After downloading, you have to save the files as expected by nnU-Net in your nnUNet_raw_data_base
directory and to use the right folder structure (nnU-Net dataset conversion).
It is important that you choose the same folder names as we did:
'Task003_Liver'
'Task006_Lung'
'Task007_Pancreas'
'Task008_HepaticVessel'
'Task009_Spleen'
'Task010_Colon'
'Task017_AbdominalOrganSegmentation'
'Task046_AbdOrgSegm2'
'Task051_StructSeg2019_Task3_Thoracic_OAR'
'Task055_SegTHOR'
'Task062_NIHPancreas'
'Task064_KiTS_labelsFixed'
'Task018_PelvicOrganSegmentation'
Preprocessing for training
First, we need to generate our raw dataset for the multi-class training. Run the following script to copy all the images in the right folder, convert the labelmaps, generate the dataset.jason file and transpose the images if needed:
/nnunet/dataset_conversion/Task100_MultiTalent.py
It can take around 3 hours, depending on your system. If you want to extend or change the base dataset collection, you would need to adapt this file!
Now, we can use the nn-UNet preprocessing function with a specialized preprocessor:
nnUNet_plan_and_preprocess -t 100 -pl3d ExperimentPlanner3D_v21_MultiTalent -pl2d None -tf 16 --verify_dataset_integrity -overwrite_plans_identifier multitalent_bs4
Again, this takes some time.
This also generates a training plan file that we need for the following network training. By default, this plan generates a batchsize of 2. It is very easy to change the batchsize (see extending nnU-Net), but you could also use the plans that we provide.
Next, copy the ./splits_custom.pkl file in your preprocessed MultiTalent folder
We are almost done, but we need to add the information about the valid labels for each image to the .pkl files:
python /nnunet/dataset_conversion/Task100_MultiTalent_addregions.py
This takes only a few seconds.
Training of the Multi-Class network
First, you should take a look at the nnU-Net V1 distributed training instructions.
To train a MultiTalent network, run the following command:
CUDA_VISIBLE_DEVICES=0,1 python -m torch.distributed.launch --master_port=1234 --nproc_per_node=2 ./nnunet/run/run_training_DDP.py 3d_fullres MultiTalent_trainer_ddp 100 0 -p MultiTalent_bs4 --dbs