v0.4.5
Changes
- llama.cpp updated to dadbed9, A noticeable increase in the speed of Metal on iOS. Now 7B qK_3 model works fine on iphone 12
- Add models management
- Add template for run LLaMA 2 on iPhone
- Fix template set context size
Now you can install LLMFarm
on iOS devices with TestFlight
** llmfarm_core
has been moved to a separate repository. To build LLMFarm
, you need to clone this repository recursively:
git clone --recurse-submodules https://github.com/guinmoon/LLMFarm