Gaudi-graphtransformer

HPU version of the graph transformer architectures

Introduction

This repository is related to the template for graph transformer by using Intel Gaudi-v2 devices. Specifically, our goal is to provide the following contents:

Implementation of Graph Transformer models which is compatible to Intel Gaudi-v2.
Developing sparse matrix multiplication kernels in TPC-C levels which helps efficient computation

The main difference between original deep learning structures and graph neural network is sparsity of the dataset. Since, the graph datasets are composed with high sparsity. To resolve this issues, many GNN frameworks (such as PyG and DGL) providing spmm operations. Unfortunately, current version of Intel Gaudi-v2 is not supporting spase matrix multiplication Intel Forum

We adapt and modify compatibility based on SGFormer official codes.

Implemented models

SGFormer (NeurIPS 2023)
GraphGPS
Cobformer
Nodeformer

How to run this repository

docker run -it --runtime=habana -e HABANA_VISIBLE_DEVICES=all -e OMPI_MCA_btl_vader_single_copy_mechanism=none --cap-add=sys_nice --net=host --ipc=host -v /home/irteamsu:/root vault.habana.ai/gaudi-docker/1.17.1/ubuntu22.04/habanalabs/pytorch-installer-2.3.1:latest
pip install -r requirements_docker.txt

Please refer to run.sh commands to run each models to HPU

For the large datasets such as ogbn-arxiv, ogbn-proteins, we conducted subgraph sampling training due to the memory issues.

Since current version of the codes were implemented with dense matrix multiplication version, ogbn-arxiv need 100GB for the full-graph training.

Functionality

We will support spmm kernels in TPC-C kernel levels.

General Issues

 #model = torch.compile(model, backend = "hpu_backend")
 device = "hpu"
 model = model.to(device)

Current version of the code occurs error when we use torch.compile() with backend = "hpu_backend". It seems to be related to not work with dynamic shapes of the tensors when we move to HPU.

For the speedup, it needs to be resolved in the future version of the codes.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Habana_spmm_kernel		Habana_spmm_kernel
results		results
.gitignore		.gitignore
README.md		README.md
data_utils.py		data_utils.py
dataset.py		dataset.py
eval.py		eval.py
graphgps.py		graphgps.py
logger.py		logger.py
main-batch.py		main-batch.py
main.py		main.py
models.py		models.py
nodeformer.py		nodeformer.py
ours.py		ours.py
parse.py		parse.py
requirements_docker.txt		requirements_docker.txt
run.sh		run.sh
time_test.py		time_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gaudi-graphtransformer

Introduction

Implemented models

How to run this repository

Functionality

General Issues

About

Releases

Packages

Languages

NAVER-INTEL-Co-Lab/gaudi-graphtransformer

Folders and files

Latest commit

History

Repository files navigation

Gaudi-graphtransformer

Introduction

Implemented models

How to run this repository

Functionality

General Issues

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages