You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi~
May I ask if the data in ./examples/sample_data/pre provided by the repository is not the complete pretraining dataset?
If so, do you know which dataset DNABERT used? Did the author mention it in the paper?
I was just wondering if you have released the exact dataset that you used for pretraining DNABERT1 and DNABERT2?
I would be interested in doing some ablation studies using this dataset.
Thank you,
LeAnn
The text was updated successfully, but these errors were encountered: