Release v1.2: DeepSpeed and CV Models · huggingface/optimum-habana

DeepSpeed

This release brings support for DeepSpeed. It is now possible to train bigger models on Gaudi with Optimum Habana!

Add support for DeepSpeed #93

Check the documentation here to know how to use it.

Computer Vision Models

Two computer-vision models have been validated for performing image classification in both single- and multi-cards configurations:

ViT #80
Swin

You can see how to use them in this example.

SynapseAI 1.6.0

This release is fully compatible with SynapseAI 1.6.0.

Update to SynapseAI 1.6.0 #91

It is recommended to use SynapseAI 1.6.0 for optimal performance.

Documentation

Optimum Habana now has a dedicated documentation. you can find it here.

It shows how to quickly make a Transformers-based script work with the library. It also contains guides explaining how to do distributed training, how to use DeepSpeed or how to make the most of HPUs to accelerate training.

Masked Language Modeling

A new example script has been added to perform masked language modeling. This is especially useful if you want to pretrain models such as BERT or RoBERTa.

Add run_mlm.py in language-modeling examples #83

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.2: DeepSpeed and CV Models

DeepSpeed

Computer Vision Models

SynapseAI 1.6.0

Documentation

Masked Language Modeling