Skip to content

v0.4.5

Compare
Choose a tag to compare
@guinmoon guinmoon released this 26 Aug 16:02
· 248 commits to main since this release

Changes

  • llama.cpp updated to dadbed9, A noticeable increase in the speed of Metal on iOS. Now 7B qK_3 model works fine on iphone 12
  • Add models management
  • Add template for run LLaMA 2 on iPhone
  • Fix template set context size

Now you can install LLMFarm on iOS devices with TestFlight

** llmfarm_core has been moved to a separate repository. To build LLMFarm, you need to clone this repository recursively:

git clone --recurse-submodules https://github.com/guinmoon/LLMFarm