Skip to content
David Tanner edited this page Sep 11, 2015 · 2 revisions

Code layout

This provides a small description of the clBLAS directories:

clBLAS/

+---client                     Benchmarking program 
+---include                    Internal header files
+---samples                    Example programs that ship with SDK
+---scripts
|   \---perf                   Python performance scripts to generate performance graphs
+---library
|   +---blas                   Contains most of the per routine driver code
|   |   +---AutoGemm           AutoGemm python scripts and UserGemm kernels
|   |   +---generic            Code common to all routines
|   |   +---gens               Code related to generating kernels on the fly
|   |   |   +---clTemplates    Type neutral template files
|   |   |   +---legacy         Code not used anymore
|   |   |   |   \---tests
|   |   |   \---tests          
|   |   \---include            Private include files related to the library internals
|   +---common                 Data structures common across routines
|   |   +---gens
|   |   \---tests
|   \---tools
|       +---ktest              Tool to generate a c program to call a library routine with a given set of parameters
|       |   +---naive
|       |   +---scripts
|       |   \---steps
|       +---tplgen             Source for the 
|       \---tune               Source for the tuning program
\---tests
    +---correctness            Googletest based monolithic test program that tests all functions, can take a long time to run
    +---functional             Test program to help test multiple queues and threading
    +---include 
    \---performance            Command line driven performance program, no graphing; compares GPU vs CPU
Clone this wiki locally