v0.6.0
Changes
- add grammar sampling for llama models, you can put .gbnf files to the grammars directory
- llama.cpp updated to b1256
- rwkv updated to 8db73b1
- gpt-2 updated
- rwkv_eval_sequence 20% increase speed
- handle GGML_ASSERT
- fixed many errors
- new llama2 and saiga template
** Due to error ScrollViewReader, autoscroll is disabled on iOS <16.4