This repository contains the implementation of the paper:
"Retrieval-Augmented Neural Team Formation"
This work introduces a Retrieval-Augmented Generation (RAG) model that integrates historical collaboration data with required skill sets, enabling the selection of expert teams with both the necessary competencies and a proven track record of effective teamwork.
- Encoder: Custom T5-based encoder optimized with contrastive learning to generate semantic embeddings for skill sets.
- Retriever: Efficient FAISS-based retriever to extract relevant historical team formations.
- Generator: Transformer-based generator tailored to propose expert teams with cohesive collaboration dynamics.
- Datasets: Includes DBLP and Dota2 datasets for team formation experiments.
- Evaluation: Implements metrics such as Recall, MAP, and NDCG.
rag-team-formation/
├── datasets/ # Sample datasets for experiments
├── src/ # Source code
│ ├── encoder/ # Encoder training and modules
│ ├── generator/ # Generator training and modules
│ ├── model/ # RAG inference and utilities
│ ├── tokenizer/ # Tokenizer-related scripts
│ └── utils/ # Helper functions
├── figures/ # Model diagrams and result visualizations
├── LICENSE # License information
├── README.md # Project documentation
└── requirements.txt # Python dependencies
-
Clone the repository:
git clone https://github.com/MohammadDara/Retrieval-Augmented-Neural-Team-Formation.git cd Retrieval-Augmented-Neural-Team-Formation
-
Install dependencies:
pip install -r requirements.txt
Train the encoder with contrastive learning:
python src/encoder/train_encoder.py
Pretrain the generator module:
python src/generator/pretrain_generator.py
Fine-tune the generator with RAG:
python src/generator/train_generator.py
Generate expert teams using the trained RAG model:
python src/model/rag.py
If you use this code in your research, please cite the paper:
@inproceedings{RAG_Teamformation,
title={Retrieval-Augmented Neural Team Formation},
author={},
booktitle={},
year={2025}
}