Skip to content

cuda stream benchmark: based on work by Massimiliano Fatica@NVIDIA

Notifications You must be signed in to change notification settings

nicejunjie/CUDA-Cpp-STREAM

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

STREAM Benchmark in CUDA C++

Variant of the STREAM benchmark written in CUDA C++, based on work by Massimiliano Fatica (NVIDIA).

Three models are available:

  1. default: the original GPU STREAM using HBM.
  2. -DUSE_HOST: use managed memory pointer allocated on the host, cuda runtime may make automatic page migration.
  3. -DZERO_COPY: use coherent access in unified memory architecture, directly access host memory without data movement.

To Do: add output validation.

About

cuda stream benchmark: based on work by Massimiliano Fatica@NVIDIA

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Cuda 95.2%
  • Shell 2.8%
  • Makefile 2.0%