QoQ-g128 Llama3-70B-Instruct Results #7

ethxnp · 2024-06-03T20:00:58Z

I ran python -m lmquant.llm.run projects/llm/configs/llm.yaml projects/llm/configs/qoq/g128.yaml --mod│ el-name llama3-instruct-70b --model-path meta-llama/Meta-Llama-3-70B-Instruct --smooth-xw-alpha 0.3 --smooth-xw-beta 0.7 --smooth-yx-str│ ategy GridSearch --smooth-yx-beta " -2" --save-model

These were the results:

24-06-03 19:43:57 | I |         |  Task  |Version|    Metric     |Value |   |Stderr|                                                    │
24-06-03 19:43:57 | I |         |--------|------:|---------------|-----:|---|-----:|                                                    │
24-06-03 19:43:57 | I |         |wikitext|      1|word_perplexity|5.5105|±  |5.5105|

The perplexity seems rather high, do you have recommendations for different parameters to try? I will upload to HF later and share the link in case it's helpful.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QoQ-g128 Llama3-70B-Instruct Results #7

QoQ-g128 Llama3-70B-Instruct Results #7

ethxnp commented Jun 3, 2024

QoQ-g128 Llama3-70B-Instruct Results #7

QoQ-g128 Llama3-70B-Instruct Results #7

Comments

ethxnp commented Jun 3, 2024