ESCF-CFT

Download training data

you can get training data from this website.
this project will use these datasets below:

empathetic_dialogues
eSNL
gigaword
eli5
wiki_auto

Prepare training data

Place the data in the corresponding folders according to the diagram structure.

Run the Python script for preprocessing as follows:

cd dataset/rawdata/scripts/
python prepare_trainingdata.py

Train model with 5 datasets

Please cd to the root directory of the project and then run the script. * all bash scripts need run in root directory of the project

bash scripts/train-on-5-instruction-task.sh

Eval model on instruction tasks

Generate answer with base model and the tuned model

bash scripts/generate-pred-result.sh

Evaluate scores

bash scripts/evaluate-on-instruction-tasks.sh

Evaluate model on general tasks

Use lm-evaluation-harness to evaluate general tasks scores

Install

To install the lm-eval package from the github repository, run:

git clone https://github.com/EleutherAI/lm-evaluation-harness
cd lm-evaluation-harness
pip install -e .

Evaluate

There are 3 sub-tasks in general tasks:

Domain Knowledge
Reasoning
Reading Comperhension

Launch the eval scripts and change the sub-task that you need in this script

bash scripts/evaluate-on-general-tasks.sh

plot results

plot results use python notebook plot_results.ipynb in the main path of this respo.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
configs		configs
dataset		dataset
finetuned_models		finetuned_models
images		images
model_finetuning		model_finetuning
scripts		scripts
.gitignore		.gitignore
README.md		README.md
eval.py		eval.py
generate.py		generate.py
plot_results.ipynb		plot_results.ipynb
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ESCF-CFT

Download training data

Prepare training data

Train model with 5 datasets

Eval model on instruction tasks

Evaluate model on general tasks

Install

Evaluate

plot results

About

Releases

Packages

Contributors 2

Languages

zhaoningyuan/ESCF-CFT

Folders and files

Latest commit

History

Repository files navigation

ESCF-CFT

Download training data

Prepare training data

Train model with 5 datasets

Eval model on instruction tasks

Evaluate model on general tasks

Install

Evaluate

plot results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages