diff --git a/captioning/README.md b/captioning/README.md index 5645aea..ee41f8f 100755 --- a/captioning/README.md +++ b/captioning/README.md @@ -1,5 +1,7 @@ # 🐼 Panda-70M: Video Captioning +**[Note] To use our captioning code, please make sure you follow [the guideline](https://github.com/lm-sys/FastChat/blob/main/docs/vicuna_weights_version.md#how-to-apply-delta-weights-only-needed-for-weights-v0) and correctly prepare vicuna-7b-v0 weight. Basically, you need to first download the original weights and then apply delta weights. Improper weights preparation will lead to meaningless outputs.** + ## Introduction We propose a video captioning model to generate a caption for a short video clip. The model includes vision (green) and textual (blue) branches to benefit video captioning by both video and text inputs.