Skip to content

Reference models for Intel(R) Gaudi(R) AI Accelerator

Notifications You must be signed in to change notification settings

HabanaAI/Model-References

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

63 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Intel® Gaudi® AI Accelerator Examples for Training and Inference

Model List and Performance Data

Please visit this page for performance information.

This repository is a collection of models that have been ported to run on Intel Gaudi AI accelerator. They are intended as examples, and will be reasonably optimized for performance while still being easy to read.

Computer Vision

Models Framework Validated on Gaudi Validated on Gaudi 2 Validated on Gaudi 3
ResNet50 PyTorch Training (compile) Training (compile), Inference (compile) Inference (compile)
ResNeXt101 PyTorch - Training (compile) Training (compile)
ResNet152 PyTorch Training - -
MobileNetV2 PyTorch Training - -
UNet2D PyTorch Lightning Training (compile), Inference (compile) Training (compile), Inference (compile) -
Unet3D PyTorch Lightning Training (compile), Inference (compile) Training (compile), Inference (compile) Training (compile)*
SSD PyTorch Training Training -
GoogLeNet PyTorch Training - -
Vision Transformer PyTorch Training - -
DINO PyTorch Training - -
YOLOX PyTorch Training - -

*Disclaimer: only functional checks done

Natural Language Processing

Models Framework Validated on Gaudi Validated on Gaudi 2 Validated on Gaudi 3
BERT Pretraining PyTorch Training (compile), Inference Training (compile), Inference -
BERT Finetuning PyTorch Training, Inference Training, Inference (compile) Inference (compile)*
DeepSpeed BERT-1.5B, BERT-5B PyTorch Training Training (compile) -
BART PyTorch Training - -

*Disclaimer: Only bf16

Audio

Models Framework Validated on Gaudi Validated on Gaudi 2 Validated on Gaudi 3
Wav2Vec2ForCTC PyTorch Inference Inference -

Generative Models

Models Framework Validated on Gaudi Validated on Gaudi 2 Validated on Gaudi 3
Stable Diffusion PyTorch Lightning Training Training -
Stable Diffusion FineTuning PyTorch Training Training -

MLPerf™ Training 4.0

Models Framework Validated on Gaudi Validated on Gaudi 2 Validated on Gaudi 3
GPT3 PyTorch - Training -
Llama 70B LoRA PyTorch - Training -

MLPerf™ Inference 4.0

Models Framework Validated on Gaudi Validated on Gaudi 2 Validated on Gaudi 3
Llama 70B PyTorch - Inference -
Stable Diffusion XL PyTorch - Inference -

MLPerf™ is a trademark and service mark of MLCommons Association in the United States and other countries. All rights reserved. Unauthorized use is strictly prohibited.

Reporting Bugs/Feature Requests

We welcome you to use the GitHub issue tracker to report bugs or suggest features.

When filing an issue, please check existing open, or recently closed, issues to make sure somebody else hasn't already reported the issue. Please try to include as much information as you can. Details like these are incredibly useful:

  • A reproducible test case or series of steps
  • The version of our code being used
  • Any modifications you've made relevant to the bug
  • Anything unusual about your environment or deployment

Community

Hugging Face

Megatron-DeepSpeed

Fairseq