Intel® Gaudi® AI Accelerator Examples for Training and Inference

Intel® Gaudi® AI Accelerator Examples for Training and Inference
Community

Model List and Performance Data

Please visit this page for performance information.

This repository is a collection of models that have been ported to run on Intel Gaudi AI accelerator. They are intended as examples, and will be reasonably optimized for performance while still being easy to read.

Computer Vision

Models	Framework	Validated on Gaudi	Validated on Gaudi 2	Validated on Gaudi 3
ResNet50	PyTorch	Training (compile)	Training (compile), Inference (compile)	Inference (compile)
ResNeXt101	PyTorch	-	Training (compile)	Training (compile)
ResNet152	PyTorch	Training	-	-
MobileNetV2	PyTorch	Training	-	-
UNet2D	PyTorch Lightning	Training (compile), Inference (compile)	Training (compile), Inference (compile)	-
Unet3D	PyTorch Lightning	Training (compile), Inference (compile)	Training (compile), Inference (compile)	Training (compile)*
SSD	PyTorch	Training	Training	-
GoogLeNet	PyTorch	Training	-	-
Vision Transformer	PyTorch	Training	-	-
DINO	PyTorch	Training	-	-
YOLOX	PyTorch	Training	-	-

*Disclaimer: only functional checks done

Natural Language Processing

Models	Framework	Validated on Gaudi	Validated on Gaudi 2	Validated on Gaudi 3
BERT Pretraining	PyTorch	Training (compile), Inference	Training (compile), Inference	-
BERT Finetuning	PyTorch	Training, Inference	Training, Inference (compile)	Inference (compile)*
DeepSpeed BERT-1.5B, BERT-5B	PyTorch	Training	Training (compile)	-
BART	PyTorch	Training	-	-

*Disclaimer: Only bf16

Audio

Models	Framework	Validated on Gaudi	Validated on Gaudi 2	Validated on Gaudi 3
Wav2Vec2ForCTC	PyTorch	Inference	Inference	-

Generative Models

Models	Framework	Validated on Gaudi	Validated on Gaudi 2	Validated on Gaudi 3
Stable Diffusion	PyTorch Lightning	Training	Training	-
Stable Diffusion FineTuning	PyTorch	Training	Training	-

MLPerf™ Training 4.0

Models	Framework	Validated on Gaudi	Validated on Gaudi 2	Validated on Gaudi 3
GPT3	PyTorch	-	Training	-
Llama 70B LoRA	PyTorch	-	Training	-

MLPerf™ Inference 4.0

Models	Framework	Validated on Gaudi	Validated on Gaudi 2	Validated on Gaudi 3
Llama 70B	PyTorch	-	Inference	-
Stable Diffusion XL	PyTorch	-	Inference	-

Reporting Bugs/Feature Requests

We welcome you to use the GitHub issue tracker to report bugs or suggest features.

When filing an issue, please check existing open, or recently closed, issues to make sure somebody else hasn't already reported the issue. Please try to include as much information as you can. Details like these are incredibly useful:

A reproducible test case or series of steps
The version of our code being used
Any modifications you've made relevant to the bug
Anything unusual about your environment or deployment

Community

Hugging Face

All supported models are available in Optimum Habana project https://github.com/huggingface/optimum-habana/ and as model cards at https://huggingface.co/Habana.

Megatron-DeepSpeed

Megatron-DeepSpeed was moved to a new GitHub repository HabanaAI/Megatron-DeepSpeed.

Fairseq

Transformer

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.github		.github
MLPERF4.0		MLPERF4.0
PyTorch		PyTorch
.gitignore		.gitignore
.gitmodules		.gitmodules
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intel® Gaudi® AI Accelerator Examples for Training and Inference

Model List and Performance Data

Computer Vision

Natural Language Processing

Audio

Generative Models

MLPerf™ Training 4.0

MLPerf™ Inference 4.0

Reporting Bugs/Feature Requests

Community

Hugging Face

Megatron-DeepSpeed

Fairseq

About

Releases

Packages

Contributors 52

Languages

HabanaAI/Model-References

Folders and files

Latest commit

History

Repository files navigation

Intel® Gaudi® AI Accelerator Examples for Training and Inference

Model List and Performance Data

Computer Vision

Natural Language Processing

Audio

Generative Models

MLPerf™ Training 4.0

MLPerf™ Inference 4.0

Reporting Bugs/Feature Requests

Community

Hugging Face

Megatron-DeepSpeed

Fairseq

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 52

Languages

Packages