Skip to content

Parallelization

kaskr edited this page Jun 25, 2016 · 5 revisions

BLAS

TMB uses the following BLAS kernels when calculating function value and derivatives

Function Gradient
dgemm dgemm
dsyrk dsymm
dtrsm dtrsm
dpotrf dpotri

If your model spends a significant amount of time in these BLAS operations you may benefit from an optimized BLAS library e.g. MKL for CPU or nvblas for GPU. It's critical that

Clone this wiki locally