Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about the retriever training #30

Open
szhang42 opened this issue Jun 7, 2023 · 1 comment
Open

Question about the retriever training #30

szhang42 opened this issue Jun 7, 2023 · 1 comment

Comments

@szhang42
Copy link

szhang42 commented Jun 7, 2023

Hello,

I have questions for the retriever training. I first use the provided nq reader base to generate the cross attention score json file for Natural Question training json using the below code.

python test_reader.py
--model_path ./pretrained_models/nq_reader_base
--eval_data ./nq_train.json
--per_gpu_batch_size 1
--n_context 100
--name likelihood_debug_4
--checkpoint_dir checkpoint
--write_crossattention_scores \

Then, with the above generated cross-attention score json file as the training file, I loaded provided retriever checkpoint for NQ and continued the training on the Natural Question training dataset using the provided code below. The performance continues to decrease as I finetune this longer. If possible, could you please let me know your thoughts on this? Thanks very much!

python train_retriever.py
--lr 1e-4
--optim adamw
--scheduler linear
--train_data nqtrain_data_wtih_crossattention.json
--eval_data nqdev_data.json
--n_context 100
--total_steps 20000
--scheduler_steps 30000 \

@xionghao132
Copy link

@szhang42 Hi,do you solve this problem for the retriever training?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants