Question about the retriever training #30

szhang42 · 2023-06-07T00:02:00Z

Hello,

I have questions for the retriever training. I first use the provided nq reader base to generate the cross attention score json file for Natural Question training json using the below code.

python test_reader.py
--model_path ./pretrained_models/nq_reader_base
--eval_data ./nq_train.json
--per_gpu_batch_size 1
--n_context 100
--name likelihood_debug_4
--checkpoint_dir checkpoint
--write_crossattention_scores \

Then, with the above generated cross-attention score json file as the training file, I loaded provided retriever checkpoint for NQ and continued the training on the Natural Question training dataset using the provided code below. The performance continues to decrease as I finetune this longer. If possible, could you please let me know your thoughts on this? Thanks very much!

python train_retriever.py
--lr 1e-4
--optim adamw
--scheduler linear
--train_data nqtrain_data_wtih_crossattention.json
--eval_data nqdev_data.json
--n_context 100
--total_steps 20000
--scheduler_steps 30000 \

xionghao132 · 2023-11-16T12:04:30Z

@szhang42 Hi,do you solve this problem for the retriever training?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the retriever training #30

Question about the retriever training #30

szhang42 commented Jun 7, 2023

xionghao132 commented Nov 16, 2023

Question about the retriever training #30

Question about the retriever training #30

Comments

szhang42 commented Jun 7, 2023

xionghao132 commented Nov 16, 2023