Name		Name	Last commit message	Last commit date
parent directory ..
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
helpers.h		helpers.h
mp_getrf_getrs.cpp		mp_getrf_getrs.cpp
mp_potrf_potrs.cpp		mp_potrf_potrs.cpp

README.md

cuSOLVERMp Library API examples

Description

Here we provide examples of cuSOLVERMp library API usage

Key Concepts

Distributed decompositions and linear system solutions

Examples

Dense matrix LU factorization and linear system solve

Dense matrix Cholesky factorization and linear system solve

Examples are bootstrapped by MPI and use it to set up distributed data. Those examples are intended just to show how API is used and not for performance benchmarking. For same reasons process grid is hardcoded to 2x1 in the examples, however you can change it to other values in following lines:

/* Define grid of processors */
    const int numRowDevices = 2;
    const int numColDevices = 1;

Based on your distributed setup you can choose how your GPU devices are mapped to processes - change following line in the example to suit your needs: const int localDeviceId = getLocalRank();

In these samples each process will use CUDA device ID equal to the local MPI rank ID of the process.

Supported OSes

Linux

Supported CPU Architecture

x86_64

Supported SM Architectures

SM 7.0

SM 8.0

Documentation

cuSOLVERMp documentation

Usage

Prerequisites

Samples require c++11 compatible compiler. cusolverMp is distributed as a part of HPC SDK starting with version 21.11 and requires HPC SDK to be installed in the system. Also you need to set up HPCX environment which is part of HPC SDK using one of the provided scripts before building and running examples, i.e.:

HPCSDKVER=21.11
HPCSDKARCH=Linux_x86_64
HPCSDKPATH=/opt/nvidia/hpc_sdk
HPCSDKROOT=$HPCSDKPATH/$HPCSDKARCH/$HPCSDKVER
source $HPCSDKROOT/comm_libs/hpcx/latest/hpcx-init-ompi.sh
hpcx_load

Building

Build examples using make command:

make HPCSDKVER=21.11 CUDAVER=11.5 all

Running

Run examples with mpi command and number of processes according to process grid values, i.e.

mpirun -n 2 ./mp_getrf_getrs

mpirun -n 2 ./mp_potrf_potrs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuSOLVERMp

cuSOLVERMp

README.md

cuSOLVERMp Library API examples

Description

Key Concepts

Examples

Supported OSes

Supported CPU Architecture

Supported SM Architectures

Documentation

Usage

Prerequisites

Building

Running

Files

cuSOLVERMp

Directory actions

More options

Directory actions

More options

Latest commit

History

cuSOLVERMp

Folders and files

parent directory

README.md

cuSOLVERMp Library API examples

Description

Key Concepts

Examples

Supported OSes

Supported CPU Architecture

Supported SM Architectures

Documentation

Usage

Prerequisites

Building

Running