CERE v0.3 release
- memory tracer: Fixed multiple race-conditions in ptrace functions attaching new threads.
- memory_tracer: Fixed a synchronization bug in send_to_tracer function
- Tested and validated CERE extraction on Lulesh Lawrence Livermore proto-application and on BWA gene alignment application
- Compatibility with LLVM 3.9
- Improved thread remapping formula for NUMA domains
- Support for ARMv8 complete (tested on Jetson, Juno and Thunder)