language_model_finetuning

Scripts for fine-tuning pretrained language models on custom data sets, e.g.

for fine-tuning Sci-Bert on the COVID-19 Open Research Dataset (CORD-19) using the Hugging Face Transformer library.
for fine-tuning Bert on the ACL Anthology Reference Corpus.

Examples are provided for using the models (SciBERT, SciBERT fine-tuned, and BERT-"original") for extractive summarization (BERT) and text-generation (GPT-2).

Credits:

Copyright for papers belongs to ACL. Adaptations of original notebooks by Chris Callison-Burch and Derek Miller.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
acl_papers		acl_papers
README.md		README.md
gpt2_fine_tuner.ipynb		gpt2_fine_tuner.ipynb
gpt2_fine_tuner_comp_ling.ipynb		gpt2_fine_tuner_comp_ling.ipynb
scibert_fine_tuner.ipynb		scibert_fine_tuner.ipynb
scibert_summaries.ipynb		scibert_summaries.ipynb
text_gen_4_comp_ling.ipynb		text_gen_4_comp_ling.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

language_model_finetuning

Credits:

About

Releases

Packages

Languages

Nikoschenk/language_model_finetuning

Folders and files

Latest commit

History

Repository files navigation

language_model_finetuning

Credits:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages