v1.2: DeepSpeed and CV Models
DeepSpeed
This release brings support for DeepSpeed. It is now possible to train bigger models on Gaudi with Optimum Habana!
- Add support for DeepSpeed #93
Check the documentation here to know how to use it.
Computer Vision Models
Two computer-vision models have been validated for performing image classification in both single- and multi-cards configurations:
- ViT #80
- Swin
You can see how to use them in this example.
SynapseAI 1.6.0
This release is fully compatible with SynapseAI 1.6.0.
- Update to SynapseAI 1.6.0 #91
It is recommended to use SynapseAI 1.6.0 for optimal performance.
Documentation
Optimum Habana now has a dedicated documentation. you can find it here.
It shows how to quickly make a Transformers-based script work with the library. It also contains guides explaining how to do distributed training, how to use DeepSpeed or how to make the most of HPUs to accelerate training.
Masked Language Modeling
A new example script has been added to perform masked language modeling. This is especially useful if you want to pretrain models such as BERT or RoBERTa.
- Add run_mlm.py in language-modeling examples #83