These are the materials for the blog post: https://dirtyhandscoding.github.io/posts/vectorizing-small-fixed-size-sort.html
It compares several implementations of sorting algorithm for 32-bit integer keys on a very small array (mainly N = 32). Here is the table with performance measurements:
You can see more information in the blog post.