diff --git a/captioning/README.md b/captioning/README.md
index 5645aea..ee41f8f 100755
--- a/captioning/README.md
+++ b/captioning/README.md
@@ -1,5 +1,7 @@
 # 🐼 Panda-70M: Video Captioning
 
+**[Note] To use our captioning code, please make sure you follow [the guideline](https://github.com/lm-sys/FastChat/blob/main/docs/vicuna_weights_version.md#how-to-apply-delta-weights-only-needed-for-weights-v0) and correctly prepare vicuna-7b-v0 weight. Basically, you need to first download the original weights and then apply delta weights. Improper weights preparation will lead to meaningless outputs.**
+
 ## Introduction
 We propose a video captioning model to generate a caption for a short video clip.
 The model includes vision (green) and textual (blue) branches to benefit video captioning by both video and text inputs.