Skip to content

v0.6.0

Compare
Choose a tag to compare
@guinmoon guinmoon released this 26 Sep 19:02
· 204 commits to main since this release

Changes

  • add grammar sampling for llama models, you can put .gbnf files to the grammars directory
  • llama.cpp updated to b1256
  • rwkv updated to 8db73b1
  • gpt-2 updated
  • rwkv_eval_sequence 20% increase speed
  • handle GGML_ASSERT
  • fixed many errors
  • new llama2 and saiga template

** Due to error ScrollViewReader, autoscroll is disabled on iOS <16.4