v3.3.6
This is a patch release containing the following changes to v3.3.5:
- Fixed crash during platform detection on some AArch64-based systems (3e0e69b)
- Improved inner product performance with Arm Compute Library (ACL) (e7abee2, 214fb9e, 8aacc8f)
- Fixed incorrect results in int8 depthwise convolution with post-ops on processors with Intel AVX2 instruction set support (0c922e0)
- Fixed performance regression in fp32 convolution on processors with Intel AVX2 instruction set support (4efc0ad)