[Good First Issue]: Verify chatglm3-6b with GenAI text_generation #268

p-wysocki · 2024-03-01T12:22:57Z

Context

This task regards enabling tests for chatglm3-6b. You can find more details under openvino_notebooks LLM chatbot README.md.

Please ask general questions in the main issue at #259

What needs to be done?

Described in the main Discussion issue at: #259

Example Pull Requests

Described in the main Discussion issue at: #259

Resources

Contribution guide - start here!
Intel DevHub Discord channel - engage in discussions, ask questions and talk to OpenVINO developers

Contact points

Described in the main Discussion issue at: #259

Ticket

No response

Utkarsh-2002 · 2024-03-05T16:55:42Z

.take

github-actions · 2024-03-05T16:55:54Z

Thank you for looking into this issue! Please let us know if you have any questions or require any help.

p-wysocki · 2024-03-12T09:44:01Z

Hello @Utkarsh-2002, are you still working on this? Is there anything we could help you with?

Utkarsh-2002 · 2024-03-12T12:10:45Z

yes i am working on this can and there is some issue with the complilation part but i am working on it will let you know if a will need any help

p-wysocki · 2024-04-03T18:09:36Z

Hello @Utkarsh-2002, please let me know if you're still working on this, for now I'm unassigning you due to long inactivity.

HikaruSadashi · 2024-04-04T20:54:05Z

.take

github-actions · 2024-04-04T20:54:18Z

Thank you for looking into this issue! Please let us know if you have any questions or require any help.

duydl · 2024-05-14T14:11:47Z

.take

github-actions · 2024-05-14T14:12:00Z

Thank you for looking into this issue! Please let us know if you have any questions or require any help.

duydl · 2024-05-15T16:07:14Z

Sorry, I could not access lab PC till next week. My laptop is a little short for the task. So leave it open for others in the mean time.

p-wysocki · 2024-05-15T17:02:17Z

No worries, come back anytime you feel like contributing. You're always welcome :)

Jessielovecodings · 2024-06-03T05:32:43Z

#WLB
.take

github-actions · 2024-06-03T05:32:54Z

Thank you for looking into this issue! Please let us know if you have any questions or require any help.

Wovchena · 2024-10-16T04:45:18Z

#259 (5) is all we have about tests extension.
optimum-cli offers cmd args for quantization to reduce the model size. This may help to fit the model into memory at runtime, although you still need a larger host to run the quantization, not sure if it helps in your case.

Aniruddha521 · 2024-10-16T05:01:30Z

@Wovchena Thank you for your guidance.

Aniruddha521 · 2024-10-18T06:43:05Z

Hey @Wovchena I have build and configure cmake(Release) but when I am executing ./beam_search_causal_lm TinyLlama-1.1B-Chat-v1.0 "Why is the Sun yellow?" I am getting the below error

Aniruddha521 · 2024-10-18T21:13:27Z

@Wovchena
1) beam_search_causal_lm : Working fine
2) benchmark_genai: Working fine
3) chat_sample: Working fine
4) continuous_batching_accuracy:
5) continuous_batching_benchmark:
6) greedy_causal_lm: Working fine
7) lora_greedy_causal_lm:
8) multinomial_causal_lm: Working fine
9) prompt_lookup_decoding_lm: Working fine
10) speculative_decoding_lm: Working fine

As you can see except continuous_batching_accuracy, continuous_batching_benchmark and lora_greedy_causal_lm is raising error except them all are working fine with chatglm3-6b.
How can I fix the errors for 4, 5 and 7.

Wovchena · 2024-10-21T04:56:13Z

Can you share the build commands?

I see on your screenshot that the error comes from openvino_genai... which is usually the name for a prebuilt GenAI. It also states that the used version is 24.4. While it's the latest released version, it's an outdated version from the development point of view. I encourage you to compile the whole project on your own following https://github.com/openvinotoolkit/openvino.genai/blob/master/src/docs/BUILD.md

Aniruddha521 · 2024-10-22T21:33:58Z

@Wovchena
I built openvino with openvino genai using the following commands sequentially:

git clone --recursive https://github.com/openvinotoolkit/openvino.git
git clone --recursive https://github.com/openvinotoolkit/openvino.genai.git

cd openvino
sudo ./install_build_dependencies.sh
mkdir build && cd build
cmake -DCMAKE_BUILD_TYPE=Release ..
cmake --build . --parallel 14
cd --

cmake --install openvino/build --prefix openvino_install
source openvino_install/setupvars.sh

cd openvino.genai
cmake -DCMAKE_BUILD_TYPE=Release -S ./ -B ./build/
cmake --build ./build/ --config Release --parallel 14
cmake --install ./build/ --config Release --prefix openvino_install
cd openvino_install/samples/cpp

./build_samples.sh
cd --

cd openvino_cpp_samples_build/intel64/Release/

1) beam_search_causal_lm: ./beam_search_causal_lm /home/roy/chatglm3-6b-with-past "Why sun is yellow?"

2) benchmark_genai: ./benchmark_genai -m /home/roy/chatglm3-6b-with-past working fine.

3) chat_sample: ./chat_sample /home/roy/chatglm3-6b-with-past working fine.

4) greedy_causal_lm: ./greedy_causal_lm /home/roy/chatglm3-6b-with-past "Why sun is yellow?" working fine.

5) lora_greedy_causal_lm: ./lora_greedy_causal_lm /home/roy/chatglm3-6b-with-past /home/roy/.cache/huggingface/hub/models--THUDM--chatglm3-6b/snapshots/91a0561caa089280e94bf26a9fc3530482f0fe60/model-00001-of-00007.safetensors "Why sun is yellow?" working fine.

6) multinomial_causal_lm: ./multinomial_causal_lm /home/roy/chatglm3-6b-with-past "Why sun is yellow?" working fine.

7) prompt_lookup_decoding_lm: ./prompt_lookup_decoding_lm /home/roy/chatglm3-6b-with-past "return 0;" working fine.

8) speculative_decoding_lm: ./speculative_decoding_lm /home/roy/chatglm3-6b-with-past /home/roy/Llama-2-7b-chat-hf "Why sun is yellow?".

After complete building and installing(e.g openvino_install) I noticed that openvino_install/samples/cpp is missing speculative_decoding_lm, prompt_lookup_decoding_lm and lora_greedy_causal_lm folders. So, I manually added this three folders in openvino_install/samples/cpp and executed ./build_samples.sh which generate openvino_cpp_samples_build containing executable files for all the samples folder present in openvino_install/samples/cpp , Is it fine or I am expected to use any other approach or I missed anything?

Wovchena · 2024-10-23T07:48:40Z

1) Is likely explained with swapped dimensions for that model. When adding the model to the supported list, please mark (no beam search).
8) Ensure /home/roy/Llama-2-7b-chat-hf/openvino_model.xml exists after optimum-cli export for Llama-2-7b-chat-hf.

Aniruddha521 · 2024-10-23T16:29:22Z

@Wovchena
1) As you said I have checked that my /home/roy/Llama-2-7b-chat-hf/openvino_model.xml and fix that part but even after fixing this getting the following error
2) Also I manually added /openvino.genai/tools/continuous_batching to openvino_install/samples/cpp and compiled it with a few extra lines of code in it cmake

find_package(OpenVINOGenAI REQUIRED
    HINTS
        "${CMAKE_BINARY_DIR}"  # Reuse the package from the build.
        ${OpenVINO_DIR}  # GenAI may be installed alogside OpenVINO.
    NO_CMAKE_FIND_ROOT_PATH
)

for both the directory(e.g. accuracy and benchmark ) but at the end get the following errors
Note:- I used optimum-cli export openvino --trust-remote-code --model THUDM/chatglm3-6b chatglm3-6b-with-past --task text-generation-with-past to download chatglm3-6b

Wovchena · 2024-10-24T07:36:01Z

You can use chatglm as draft and main models for speculative_decoding_lm. That excludes Llama-2-7b-chat-hf from the list of problems.

Missing .xml files is strange. Every sample requires them to exist and some of them already passed for you. Double check the folder content.

Undeclared beam_idx is also strange because every sample relies on it.

I forgot to mention that I've updated the main issue with the description how to install samples. But since you've already figured that out, no action is required. Although your solution is different.

Aniruddha521 · 2024-10-25T05:17:13Z

@Wovchena
I have re-executed optimum-cli export openvino --trust-remote-code --model THUDM/chatglm3-6b chatglm3-6b-with-past --task text-generation-with-past which takes the model already downloaded in my default cache and compress it(below is the image of the whole process)

Below Image shows the contains of chatglm3-6b-with-past directory

And below image show the error I am still getting on speculative_decoding_lm, continuous_batching_benchmark, continuous_batching_accuracy and continuous_batching_speculative_decoding

As you can see that error is related to beam_idx , can you guide me that what may have gone wrong or where I need to check

Wovchena · 2024-10-25T06:15:51Z

I'm unable to reproduce peculative_decoding_lm, continuous_batching_benchmark, and continuous_batching_accuracy issues. We can still try to investigate it with you in as a background task. Meanwhile you can proceed assuming that they work.

You can check openvino_model.xml content. There should be a layer named beam_idx. Example:

		<layer id="0" name="beam_idx" type="Parameter" version="opset1">
			<data shape="?" element_type="i32" />
			<output>
				<port id="0" precision="I32" names="beam_idx">
					<dim>-1</dim>
				</port>
			</output>
		</layer>

continuous_batching_speculative_decoding requires -m and -a named args, not just paths. @iefode, is it possible to add a validation for cmd args and make them required?

Aniruddha521 · 2024-10-25T06:52:19Z

@Wovchena
Did you mean I should create a pull request while assuming speculative_decoding_lm, continuous_batching_benchmark, continuous_batching_accuracy working. Also I too checked openvino_model.xml earlier and able to locate the suggested portion.

Wovchena · 2024-10-25T07:00:44Z

Did you mean I should create a pull request while assuming speculative_decoding_lm, continuous_batching_benchmark, continuous_batching_accuracy working.

Yes.

@ilya-lavrenov, maybe you can suggest something about failing speculative_decoding_lm, continuous_batching_benchmark, continuous_batching_accuracy?

ilya-lavrenov · 2024-10-25T07:09:57Z

@Aniruddha521 what OpenVINO version do you use for inference?

Looks like PA transformation have not worked correctly for ChatGLM3-6B

iefode · 2024-10-25T07:36:13Z

I'm unable to reproduce peculative_decoding_lm, continuous_batching_benchmark, and continuous_batching_accuracy issues. We can still try to investigate it with you in as a background task. Meanwhile you can proceed assuming that they work.

You can check openvino_model.xml content. There should be a layer named beam_idx. Example:
		<layer id="0" name="beam_idx" type="Parameter" version="opset1">
			<data shape="?" element_type="i32" />
			<output>
				<port id="0" precision="I32" names="beam_idx">
					<dim>-1</dim>
				</port>
			</output>
		</layer>
continuous_batching_speculative_decoding requires -m and -a named args, not just paths. @iefode, is it possible to add a validation for cmd args and make them required?

Agree with you @Wovchena to make args to speculative decoding required. But can say that the original problem is reproduced in all CB samples. Totally agree with @ilya-lavrenov that Looks like PA transformation have not worked correctly for ChatGLM3-6B

CuriousPanCake · 2024-10-25T09:18:03Z

And below image show the error I am still getting on speculative_decoding_lm, continuous_batching_benchmark, continuous_batching_accuracy and continuous_batching_speculative_decoding

As you can see that error is related to beam_idx , can you guide me that what may have gone wrong or where I need to check

@Aniruddha521 , please make sure you're using the latest version of OpenVINO. I've just successfully run the model and inferred it.

Aniruddha521 · 2024-10-25T13:15:46Z

@Wovchena I built openvino with openvino genai using the following commands sequentially:

git clone --recursive https://github.com/openvinotoolkit/openvino.git
git clone --recursive https://github.com/openvinotoolkit/openvino.genai.git

cd openvino
sudo ./install_build_dependencies.sh
mkdir build && cd build
cmake -DCMAKE_BUILD_TYPE=Release ..
cmake --build . --parallel 14
cd --

cmake --install openvino/build --prefix openvino_install
source openvino_install/setupvars.sh

cd openvino.genai
cmake -DCMAKE_BUILD_TYPE=Release -S ./ -B ./build/
cmake --build ./build/ --config Release --parallel 14
cmake --install ./build/ --config Release --prefix openvino_install
cd openvino_install/samples/cpp

./build_samples.sh
cd --

cd openvino_cpp_samples_build/intel64/Release/

1) beam_search_causal_lm: ./beam_search_causal_lm /home/roy/chatglm3-6b-with-past "Why sun is yellow?"

2) benchmark_genai: ./benchmark_genai -m /home/roy/chatglm3-6b-with-past working fine.

3) chat_sample: ./chat_sample /home/roy/chatglm3-6b-with-past working fine.

4) greedy_causal_lm: ./greedy_causal_lm /home/roy/chatglm3-6b-with-past "Why sun is yellow?" working fine.

5) lora_greedy_causal_lm: ./lora_greedy_causal_lm /home/roy/chatglm3-6b-with-past /home/roy/.cache/huggingface/hub/models--THUDM--chatglm3-6b/snapshots/91a0561caa089280e94bf26a9fc3530482f0fe60/model-00001-of-00007.safetensors "Why sun is yellow?" working fine.

6) multinomial_causal_lm: ./multinomial_causal_lm /home/roy/chatglm3-6b-with-past "Why sun is yellow?" working fine.

7) prompt_lookup_decoding_lm: ./prompt_lookup_decoding_lm /home/roy/chatglm3-6b-with-past "return 0;" working fine.

8) speculative_decoding_lm: ./speculative_decoding_lm /home/roy/chatglm3-6b-with-past /home/roy/Llama-2-7b-chat-hf "Why sun is yellow?".

After complete building and installing(e.g openvino_install) I noticed that openvino_install/samples/cpp is missing speculative_decoding_lm, prompt_lookup_decoding_lm and lora_greedy_causal_lm folders. So, I manually added this three folders in openvino_install/samples/cpp and executed ./build_samples.sh which generate openvino_cpp_samples_build containing executable files for all the samples folder present in openvino_install/samples/cpp , Is it fine or I am expected to use any other approach or I missed anything?

I re-cloned openvino and openvino.genai and proceed as mention in the above steps and my openvino version in my conda enviroment is 2024.4.0-16579-c3152d32c9c-releases/2024/4
Could you please share scripts or code snippets responsible for implementing PA transformation and beam indexing?I’d like to learn and explore to deepen my understanding.
@ilya-lavrenov @Wovchena @iefode @CuriousPanCake

CuriousPanCake · 2024-10-25T15:43:32Z

@Wovchena I built openvino with openvino genai using the following commands sequentially:
git clone --recursive https://github.com/openvinotoolkit/openvino.git
git clone --recursive https://github.com/openvinotoolkit/openvino.genai.git
cd openvino
sudo ./install_build_dependencies.sh
mkdir build && cd build
cmake -DCMAKE_BUILD_TYPE=Release ..
cmake --build . --parallel 14
cd --
cmake --install openvino/build --prefix openvino_install
source openvino_install/setupvars.sh
cd openvino.genai
cmake -DCMAKE_BUILD_TYPE=Release -S ./ -B ./build/
cmake --build ./build/ --config Release --parallel 14
cmake --install ./build/ --config Release --prefix openvino_install
cd openvino_install/samples/cpp
./build_samples.sh
cd --
cd openvino_cpp_samples_build/intel64/Release/
1) beam_search_causal_lm: ./beam_search_causal_lm /home/roy/chatglm3-6b-with-past "Why sun is yellow?"
2) benchmark_genai: ./benchmark_genai -m /home/roy/chatglm3-6b-with-past working fine.
3) chat_sample: ./chat_sample /home/roy/chatglm3-6b-with-past working fine.
4) greedy_causal_lm: ./greedy_causal_lm /home/roy/chatglm3-6b-with-past "Why sun is yellow?" working fine.
5) lora_greedy_causal_lm: ./lora_greedy_causal_lm /home/roy/chatglm3-6b-with-past /home/roy/.cache/huggingface/hub/models--THUDM--chatglm3-6b/snapshots/91a0561caa089280e94bf26a9fc3530482f0fe60/model-00001-of-00007.safetensors "Why sun is yellow?" working fine.
6) multinomial_causal_lm: ./multinomial_causal_lm /home/roy/chatglm3-6b-with-past "Why sun is yellow?" working fine.
7) prompt_lookup_decoding_lm: ./prompt_lookup_decoding_lm /home/roy/chatglm3-6b-with-past "return 0;" working fine.
8) speculative_decoding_lm: ./speculative_decoding_lm /home/roy/chatglm3-6b-with-past /home/roy/Llama-2-7b-chat-hf "Why sun is yellow?".
After complete building and installing(e.g openvino_install) I noticed that openvino_install/samples/cpp is missing speculative_decoding_lm, prompt_lookup_decoding_lm and lora_greedy_causal_lm folders. So, I manually added this three folders in openvino_install/samples/cpp and executed ./build_samples.sh which generate openvino_cpp_samples_build containing executable files for all the samples folder present in openvino_install/samples/cpp , Is it fine or I am expected to use any other approach or I missed anything?

I re-cloned openvino and openvino.genai and proceed as mention in the above steps and my openvino version in my conda enviroment is 2024.4.0-16579-c3152d32c9c-releases/2024/4 Could you please share scripts or code snippets responsible for implementing PA transformation and beam indexing?I’d like to learn and explore to deepen my understanding. @ilya-lavrenov @Wovchena @iefode @CuriousPanCake

I think, the fix for your issue may not be in 2024.4.0, but it is present on the current master.

Aniruddha521 · 2024-10-25T19:02:33Z

@CuriousPanCake
I executed the below mention commands sequentially to build openvino with openvino genai

git clone --recursive https://github.com/openvinotoolkit/openvino.git
git clone --recursive https://github.com/openvinotoolkit/openvino.genai.git
cd openvino
sudo ./install_build_dependencies.sh
mkdir build && cd build
cmake -DCMAKE_BUILD_TYPE=Release ..
cmake --build . --parallel 14
cd --
cmake --install openvino/build --prefix openvino_install
source openvino_install/setupvars.sh
cd openvino.genai
cmake -DCMAKE_BUILD_TYPE=Release -S ./ -B ./build/
cmake --build ./build/ --config Release --parallel 14
cd ..
cmake --install openvino.genai/build/ --config Release --prefix openvino_install
cd openvino_install/samples/cpp
./build_samples.sh
cd --

If there is anything I have missed then please let me know.
You mentioned that this issue can be resolved by using the current master, can you provide more clarity regarding this? I also tried export PYTHONPATH=Path_to_cloned_directory but the result remain same.
Also can you share the build commands you have used?

Aniruddha521 · 2024-10-28T08:54:37Z

Can anyone help me in this matter. I am getting this error while checking https://github.com/openvinotoolkit/openvino.genai/tree/master/tests/python_tests#customise-tests-run tests
Also the version of openvino_genai == 2024.5.0.0 in the build prefix(openvino_install) where as in my conda enviroment it is 2024.4.0.0
and when using pip install openvino-genai==2024.5.0.0 it is showing

ERROR: Could not find a version that satisfies the requirement openvino-genai==2024.5.0.0 (from versions: 2024.2.0.0, 2024.3.0.0, 2024.4.0.0, 2024.4.1.0.dev20240926)
ERROR: No matching distribution found for openvino-genai==2024.5.0.0

Which I think is because 2024.5.0.0 is not release.

@ilya-lavrenov @Wovchena @iefode @CuriousPanCake

ilya-lavrenov · 2024-10-28T09:12:40Z

and when using pip install openvino-genai==2024.5.0.0 it is showing

OpenVINO 2024.5.0 is not released yet. It's available as pre-release package and should be installed with extra options --pre --extra-index-url https://storage.openvinotoolkit.org/simple/wheels/nightly

See https://docs.openvino.ai/2024/get-started/install-openvino.html?PACKAGE=OPENVINO_BASE&VERSION=NIGHTLY&OP_SYSTEM=WINDOWS&DISTRIBUTION=PIP

Wovchena · 2024-10-28T10:12:36Z

You mentioned that this issue can be resolved by using the current master, can you provide more clarity regarding this?

I was able to run speculative_decoding_lm from docker: sudo docker run -it ubuntu:20.04 /bin/bash. You can try the same to verify it works for you. If it passes, you need to find what part diverged in your steps.

cd ~
apt update
apt install git python3.9 -y
apt install python3.9-dev -y
git clone --recursive https://github.com/openvinotoolkit/openvino.git
git clone --recursive https://github.com/openvinotoolkit/openvino.genai.git
cd openvino
./install_build_dependencies.sh
mkdir build && cd build
cmake -DENABLE_PYTHON=ON -DPython3_EXECUTABLE=/usr/bin/python3.9 -DCMAKE_BUILD_TYPE=Release ..
cmake --build . --parallel 14
cd --
cmake --install openvino/build --prefix openvino_install
source openvino_install/setupvars.sh
cd openvino.genai
cmake -DCMAKE_BUILD_TYPE=Release -S ./ -B ./build/
cmake --build ./build/ --config Release --parallel 14
cd ..
cmake --install openvino.genai/build/ --config Release --prefix openvino_install
cd openvino_install/samples/cpp
./build_samples.sh
cd --
python3.9 -m pip install -r ~/openvino.genai/samples/requirements.txt
export PYTHONPATH=/root/openvino_install/python/
python3.9 -m pip install openvino.genai/thirdparty/openvino_tokenizers/ --pre --extra-index-url https://storage.openvinotoolkit.org/simple/wheels/nightly
optimum-cli export openvino --trust-remote-code --task text-generation-with-past --model THUDM/chatglm3-6b chatglm3-6b
./openvino_cpp_samples_build/intel64/Release/speculative_decoding_lm chatglm3-6b/ chatglm3-6b/ "Why is the Sun yellow?"

Aniruddha521 · 2024-10-29T20:30:09Z

You mentioned that this issue can be resolved by using the current master, can you provide more clarity regarding this?

I was able to run speculative_decoding_lm from docker: sudo docker run -it ubuntu:20.04 /bin/bash. You can try the same to verify it works for you. If it passes, you need to find what part diverged in your steps.

cd ~
apt update
apt install git python3.9 -y
apt install python3.9-dev -y
git clone --recursive https://github.com/openvinotoolkit/openvino.git
git clone --recursive https://github.com/openvinotoolkit/openvino.genai.git
cd openvino
./install_build_dependencies.sh
mkdir build && cd build
cmake -DENABLE_PYTHON=ON -DPython3_EXECUTABLE=/usr/bin/python3.9 -DCMAKE_BUILD_TYPE=Release ..
cmake --build . --parallel 14
cd --
cmake --install openvino/build --prefix openvino_install
source openvino_install/setupvars.sh
cd openvino.genai
cmake -DCMAKE_BUILD_TYPE=Release -S ./ -B ./build/
cmake --build ./build/ --config Release --parallel 14
cd ..
cmake --install openvino.genai/build/ --config Release --prefix openvino_install
cd openvino_install/samples/cpp
./build_samples.sh
cd --
python3.9 -m pip install -r ~/openvino.genai/samples/requirements.txt
export PYTHONPATH=/root/openvino_install/python/
python3.9 -m pip install openvino.genai/thirdparty/openvino_tokenizers/ --pre --extra-index-url https://storage.openvinotoolkit.org/simple/wheels/nightly
optimum-cli export openvino --trust-remote-code --task text-generation-with-past --model THUDM/chatglm3-6b chatglm3-6b
./openvino_cpp_samples_build/intel64/Release/speculative_decoding_lm chatglm3-6b/ chatglm3-6b/ "Why is the Sun yellow?"

@Wovchena

I too need to proceed with almost same sequence of commands but I have ubuntu 24 and python 3.11, and this python3.9 -m pip install openvino.genai/thirdparty/openvino_tokenizers/ --pre --extra-index-url https://storage.openvinotoolkit.org/simple/wheels/nightly extra line which I was missing. But then also I noticed a problem that if I skip export PYTHONPATH=/root/openvino_install/python/ line I get the following error:

while if not skipped then:

Wovchena · 2024-10-30T07:23:30Z

Were you able to reproduce it in docker?

Aniruddha521 · 2024-10-30T11:46:32Z

Were you able to reproduce it in docker?

Probably yes, since after executing ./openvino_cpp_samples_build/intel64/Release/speculative_decoding_lm chatglm3-6b/ chatglm3-6b/ "Why is the Sun yellow?" my laptop used to lag and remain non-responcive for few minutes after which it output killed may be due to high computational resource requirement?

Aniruddha521 · 2024-10-30T13:14:06Z

@Wovchena

I proceed as mentioned in the task #259 with the following changes.
1) Extended the nightly_model in the file openvino.genai/tests/python_tests/ov_genai_test_utils.py

nightly_models = [
        "TinyLlama/TinyLlama-1.1B-Chat-v1.0",
        "facebook/opt-125m",
        "microsoft/phi-1_5",
        "microsoft/phi-2",
        "THUDM/chatglm2-6b",
        "THUDM/chatglm3-6b", # no beam_search
        "Qwen/Qwen2-0.5B-Instruct",
        "Qwen/Qwen-7B-Chat",
        "Qwen/Qwen1.5-7B-Chat",
        "argilla/notus-7b-v1",
        "HuggingFaceH4/zephyr-7b-beta",
        "ikala/redpajama-3b-chat",
        "mistralai/Mistral-7B-v0.1",

2) Added model to

openvino.genai/.github/workflows/causal_lm_cpp.yml

Line 62 in 8470250

run: |

cpp-greedy_causal_lm-Chatglm3-6b and cpp-prompt_lookup_decoding_lm-ubuntu-Chatglm3-6b

If I missed anything or any modification is needed then please let me know I will be glad to modify. I appreciate any help.

Wovchena · 2024-10-31T08:18:52Z

You also need to extend the supported models list. Add a note that beam_search_causal_lm isn't supported. Where can I find a pull request?

Aniruddha521 · 2024-10-31T20:10:31Z

@Wovchena
You can find the pull request here.

p-wysocki added the good first issue Good for newcomers label Mar 1, 2024

github-project-automation bot added this to Good first issues Mar 1, 2024

github-project-automation bot moved this to Contributors Needed in Good first issues Mar 1, 2024

p-wysocki mentioned this issue Mar 1, 2024

[Discussion][Good First Issue]: Verify different LLMs work with text_generation #259

Open

github-actions bot assigned Utkarsh-2002 Mar 5, 2024

p-wysocki moved this from Contributors Needed to Assigned in Good first issues Mar 6, 2024

p-wysocki unassigned Utkarsh-2002 Apr 3, 2024

p-wysocki moved this from Assigned to Contributors Needed in Good first issues Apr 3, 2024

github-actions bot assigned HikaruSadashi Apr 4, 2024

p-wysocki moved this from Contributors Needed to Assigned in Good first issues Apr 5, 2024

p-wysocki unassigned HikaruSadashi May 6, 2024

p-wysocki moved this from Assigned to Contributors Needed in Good first issues May 6, 2024

github-actions bot assigned duydl May 14, 2024

p-wysocki moved this from Contributors Needed to Assigned in Good first issues May 15, 2024

duydl removed their assignment May 15, 2024

p-wysocki moved this from Assigned to Contributors Needed in Good first issues May 15, 2024

github-actions bot assigned Jessielovecodings Jun 3, 2024

mlukasze linked a pull request Nov 4, 2024 that will close this issue

Verify chatglm3 6b #1119

Open

mlukasze moved this from Assigned to In Review in Good first issues Nov 4, 2024

[Good First Issue]: Verify chatglm3-6b with GenAI text_generation #268

[Good First Issue]: Verify chatglm3-6b with GenAI text_generation #268

Comments

p-wysocki commented Mar 1, 2024

Context

What needs to be done?

Example Pull Requests

Resources

Contact points

Ticket

Utkarsh-2002 commented Mar 5, 2024

github-actions bot commented Mar 5, 2024

p-wysocki commented Mar 12, 2024

Utkarsh-2002 commented Mar 12, 2024 • edited Loading

p-wysocki commented Apr 3, 2024

HikaruSadashi commented Apr 4, 2024

github-actions bot commented Apr 4, 2024

duydl commented May 14, 2024

github-actions bot commented May 14, 2024

duydl commented May 15, 2024 • edited Loading

p-wysocki commented May 15, 2024

Jessielovecodings commented Jun 3, 2024

github-actions bot commented Jun 3, 2024

Wovchena commented Oct 16, 2024

Aniruddha521 commented Oct 16, 2024

Aniruddha521 commented Oct 18, 2024 • edited Loading

Aniruddha521 commented Oct 18, 2024 • edited Loading

Wovchena commented Oct 21, 2024

Aniruddha521 commented Oct 22, 2024

Wovchena commented Oct 23, 2024

Aniruddha521 commented Oct 23, 2024

Wovchena commented Oct 24, 2024

Aniruddha521 commented Oct 25, 2024 • edited Loading

Wovchena commented Oct 25, 2024

Aniruddha521 commented Oct 25, 2024

Wovchena commented Oct 25, 2024

ilya-lavrenov commented Oct 25, 2024 • edited Loading

iefode commented Oct 25, 2024

CuriousPanCake commented Oct 25, 2024

Aniruddha521 commented Oct 25, 2024 • edited Loading

CuriousPanCake commented Oct 25, 2024

Aniruddha521 commented Oct 25, 2024 • edited Loading

Aniruddha521 commented Oct 28, 2024

ilya-lavrenov commented Oct 28, 2024

Wovchena commented Oct 28, 2024

Aniruddha521 commented Oct 29, 2024 • edited Loading

Wovchena commented Oct 30, 2024

Aniruddha521 commented Oct 30, 2024

Aniruddha521 commented Oct 30, 2024

Wovchena commented Oct 31, 2024

Aniruddha521 commented Oct 31, 2024

Utkarsh-2002 commented Mar 12, 2024 •

edited

Loading

duydl commented May 15, 2024 •

edited

Loading

Aniruddha521 commented Oct 18, 2024 •

edited

Loading

Aniruddha521 commented Oct 18, 2024 •

edited

Loading

Aniruddha521 commented Oct 25, 2024 •

edited

Loading

ilya-lavrenov commented Oct 25, 2024 •

edited

Loading

Aniruddha521 commented Oct 25, 2024 •

edited

Loading

Aniruddha521 commented Oct 25, 2024 •

edited

Loading

Aniruddha521 commented Oct 29, 2024 •

edited

Loading