From 543bcf6f2d1273bf027236e34eb047f35f56bb0c Mon Sep 17 00:00:00 2001 From: Tsai-Shien Chen Date: Sat, 27 Apr 2024 23:57:35 -0700 Subject: [PATCH] Update README.md --- captioning/README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/captioning/README.md b/captioning/README.md index 5645aea..ee41f8f 100755 --- a/captioning/README.md +++ b/captioning/README.md @@ -1,5 +1,7 @@ # 🐼 Panda-70M: Video Captioning +**[Note] To use our captioning code, please make sure you follow [the guideline](https://github.com/lm-sys/FastChat/blob/main/docs/vicuna_weights_version.md#how-to-apply-delta-weights-only-needed-for-weights-v0) and correctly prepare vicuna-7b-v0 weight. Basically, you need to first download the original weights and then apply delta weights. Improper weights preparation will lead to meaningless outputs.** + ## Introduction We propose a video captioning model to generate a caption for a short video clip. The model includes vision (green) and textual (blue) branches to benefit video captioning by both video and text inputs.