This repository releases the code of paper DIGAT: Modeling News Recommendation with Dual-Graph Interaction (EMNLP-2022 Findings).
The experiments are conducted on MIND-small and MIND-large. Our code tries to download and extract MIND-small and MIND-large to the directories ../MIND-small
and ../MIND-large
. Since MIND is quite large, if our code cannot download it due to network issues, please manually download MIND with the script download_MIND.sh
.
Dependencies are needed to be installed by
bash install_dependencies.sh
Our experiments require python>=3.7, torch==1.12.0, and torch_scatter==2.0.9. The torch_scatter package is necessary. If the dependency installation fails, please follow https://github.com/rusty1s/pytorch_scatter to install the package manually.
Training DIGAT
python main.py --graph_encoder=DIGAT
# Use two GPUs to train DIGAT with DDP
python -m torch.distributed.launch --nproc_per_node=2 main.py --graph_encoder=DIGAT
Experiments in Section 4.4
python main.py --graph_encoder=wo_SA
python main.py --graph_encoder=Seq_SA
Experiments in Section 4.5
python main.py --graph_encoder=wo_interaction
python main.py --graph_encoder=news_graph_wo_inter
python main.py --graph_encoder=user_graph_wo_inter
Experiments in Section 4.6
python main.py --graph_encoder=DIGAT --SAG_neighbors=1 --SAG_hops=2
python main.py --graph_encoder=DIGAT --SAG_neighbors=2 --SAG_hops=2
python main.py --graph_encoder=DIGAT --SAG_neighbors=3 --SAG_hops=2
python main.py --graph_encoder=DIGAT --SAG_neighbors=4 --SAG_hops=2
python main.py --graph_encoder=DIGAT --SAG_neighbors=5 --SAG_hops=2
python main.py --graph_encoder=DIGAT --SAG_neighbors=6 --SAG_hops=2
python main.py --graph_encoder=DIGAT --SAG_neighbors=7 --SAG_hops=2
python main.py --graph_encoder=DIGAT --SAG_neighbors=8 --SAG_hops=2
python main.py --graph_encoder=DIGAT --SAG_neighbors=3 --SAG_hops=1
python main.py --graph_encoder=DIGAT --SAG_neighbors=3 --SAG_hops=3
python main.py --graph_encoder=DIGAT --SAG_neighbors=3 --SAG_hops=4
Experiments in Section 4.7
python main.py --graph_encoder=DIGAT --graph_depth=1
python main.py --graph_encoder=DIGAT --graph_depth=2
python main.py --graph_encoder=DIGAT --graph_depth=3
python main.py --graph_encoder=DIGAT --graph_depth=4
python main.py --graph_encoder=DIGAT --graph_depth=5
python main.py --graph_encoder=DIGAT --graph_depth=6
python main.py --graph_encoder=DIGAT --graph_depth=7
The experiment dataset can be specified by the config parameter --dataset=[MIND-small,MIND-large] (default MIND-small)
.
python main.py --dataset=MIND-small
python main.py --dataset=MIND-large
For MIND-large, please submit the model prediction file to MIND leaderboard for performance evaluation. For example, having finished training model #1, the model prediction file is at prediction/MIND-large/MSA-DIGAT/#1/prediction.zip
. If the prediction zip file is not found, please find the raw prediction file at test/MIND-large/res/MSA-DIGAT/best_model_MIND-large_MSA-DIGAT_#1_MSA-DIGAT/MSA-DIGAT.txt
.
We had benchmarked and found that the most computation overhead comes from Eq. (8) in the paper. Hence, we perform partial quantization in computing Eq. (8). For the faster inference code, please checkout to the branch faster-inference
:
git checkout faster-inference
In faster-inference
mode, the inference time on MIND-small reduces from around 600s to 400s (benchmarked on Nvidia 3090), where we observe no AUC/MRR/nDCG performance degradation (accurate to 1e-4).
It is also worth noting that 1) quantization is only partially performed in Eq. (8), the other parts of DIGAT are still computed in fp32. 2) Do NOT perform quantization in training DIGAT, which will degrade the performance. 3) Concretely, the computation overhead lies within the huge broadcast-add K3 + K1 + K2. This broadcast-add may be optimized by tailored efficient CUDA operator in the future.
- DIGAT + PLM news encoder
- FP16 training
P.S. As the majority of this work is done in CUHK, we do not have enough GPUs to train PLM news encoder on MIND-large (due to millions of user logs). We hope to experiment DIGAT with PLM news encoder in the future. For anyone interested in training DIGAT with PLM news encoder, please feel free to contact us via [email protected], and we are delighted to share the code.
@inproceedings{mao-etal-2022-digat,
title = "{DIGAT}: Modeling News Recommendation with Dual-Graph Interaction",
author = "Mao, Zhiming and
Li, Jian and
Wang, Hongru and
Zeng, Xingshan and
Wong, Kam-Fai",
booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2022",
month = dec,
year = "2022",
address = "Abu Dhabi, United Arab Emirates",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2022.findings-emnlp.491",
pages = "6595--6607"
}
The paper was first submitted to ARR 2021 November [link].