feat: add lora fine tuning for llama 3.2 #958

jfrery · 2024-12-10T16:25:31Z

No description provided.

docs/advanced_examples/LoraMLP.ipynb

docs/deep-learning/lora_training.md

use_case_examples/lora_finetuning/data_finetune/raw_cml_1.7.0_examples.txt

use_case_examples/lora_finetuning/utils_lora.py

use_case_examples/lora_finetuning/GPT2FineTuneHybrid.ipynb

docs/advanced_examples/LoraMLP.ipynb

use_case_examples/lora_finetuning/GPT2FineTuneHybrid.ipynb

use_case_examples/lora_finetuning/LLamaFineTuning.ipynb

src/concrete/ml/torch/lora.py

kcelia

Thanks for your PR.

Some comments:

if we want to go for LoRA, maybe we should add it in the forbidden list, I stopped spamming you with my LoRA comments lol
The new Lora API is very cool
GPT2 and LLAma notebooks follow the same logic and share same utility functions, maybe we can create a utils file for them.
In GPT2 notebook, I think you don't use the full potential of the new LoRA API, or maybe you wanted to highlight what's happening behind the scene and I did not get it
In the 3 notebooks, I think it's not clear for the reader, if we are using FHE only for the inference or for adapters as well, maybe you should explicitly specify it in the conclusion or the introduction.

jfrery · 2024-12-16T08:26:28Z

GPT2 and LLAma notebooks follow the same logic and share same utility functions, maybe we can create a utils file for them.

I think they share a few function already with the utils file. GPT2 uses the previous API version without the LoraTrainer so a bit more complicated but more flexible as well.

In GPT2 notebook, I think you don't use the full potential of the new LoRA API, or maybe you wanted to highlight what's happening behind the scene and I did not get it

Yes I kept GPT2 without LoraTrainer to show that one could use its own training method but it implies defining the hybrid model / remote layers and so on.

In the 3 notebooks, I think it's not clear for the reader, if we are using FHE only for the inference or for adapters as well, maybe you should explicitly specify it in the conclusion or the introduction.

I will add a sentence at the beginning to make sure what we do here is clear.

use_case_examples/lora_finetuning/GPT2FineTuneHybrid.ipynb

kcelia

Thanks for the changes.

It would be nice to specify if the weights are encrypted too.

use_case_examples/lora_finetuning/README.md

github-actions · 2024-12-17T10:31:11Z

⚠️ Known flaky tests have been rerun ⚠️

One or several tests initially failed but were identified as known flaky. tests. Therefore, they have been rerun and passed. See below for more details.

Failed tests details

Known flaky tests that initially failed:

tests/torch/test_compile_torch.py::test_compile_torch_or_onnx_conv_networks[True-True-CNN_conv1d-relu]- tests/torch/test_compile_torch.py::test_compile_torch_or_onnx_conv_networks[False-True-CNN_grouped-relu]

andrei-stoian-zama

please fix the gpt2 notebook convergence

- fix wrong unpacking of inputs in LoraTraining + add check - add optimizer step in gpt2 - typo in llama notebook - update version in requirements

github-actions · 2024-12-19T12:11:01Z

Coverage passed ✅

Coverage details

---------- coverage: platform linux, python 3.8.18-final-0 -----------
Name    Stmts   Miss  Cover   Missing
-------------------------------------
TOTAL    8490      0   100%

63 files skipped due to complete coverage.

cla-bot bot added the cla-signed label Dec 10, 2024

jfrery marked this pull request as ready for review December 11, 2024 11:13

jfrery requested a review from a team as a code owner December 11, 2024 11:13