Skip to content

Commit

Permalink
Merge pull request FlagOpen#1245 from hanhainebula/master
Browse files Browse the repository at this point in the history
release training data for bge-multilingual-gemma2
  • Loading branch information
hanhainebula authored Nov 19, 2024
2 parents 2efb004 + a547347 commit 3fdf27c
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions dataset/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,7 @@ This will point to the training data we use for training various models.
| [bge-m3-data](https://huggingface.co/datasets/Shitao/bge-m3-data) | Fine-tuning data used by [bge-m3](https://huggingface.co/BAAI/bge-m3) |
| [public-data](https://huggingface.co/datasets/cfli/bge-e5data) | Public data identical to [e5-mistral](https://huggingface.co/intfloat/e5-mistral-7b-instruct) |
| [full-data](https://huggingface.co/datasets/cfli/bge-full-data) | The full dataset we used for training [bge-en-icl](https://huggingface.co/BAAI/bge-en-icl) |
| [bge-multilingual-gemma2-data](https://huggingface.co/datasets/hanhainebula/bge-multilingual-gemma2-data) | The full multilingual dataset we used for training [bge-multilingual-gemma2](https://huggingface.co/BAAI/bge-multilingual-gemma2) |
| [reranker-data](Shitao/bge-reranker-data) | a mixture of multilingual datasets |


0 comments on commit 3fdf27c

Please sign in to comment.