A project about construction of spatial-temporal knowledge graph, then analysis, tracking, visuzalization of the potential infection paths on the knowledge graph.
The main contributions of this paper are as follows:
• I design and implement an end-to-end automated analysis framework AVESA , which solves the inefficiency of traditional manual epidemiological analysis.
• I design and implement a knowledge extraction model based on the combination of horizontal fusion, vertical fusion and deep learning, which solves the problem of low performance of directly transfer models trained in other fields.
• I design and implement a complete set of the automated construction process of the spatial-temporal knowledge graph with 550 nodes and 1385 edges based on existing case reports text.
• I establish this complete reasoning rules with different dimensions based on the epidemiological knowledge graph and present visualized results. These results provide an essential basis for epidemiological survey workers to select the next epidemic control strategies.
• I design a series of comparative experiments for applications, conduct experiments, compare results, and finally demonstrate the feasibility and advantages of the AVESA framework.
root: sourceCode/nlpProcess/bertBased(foolNer)/model
main.py: Model training
eval.py: Model evaluation
cdc_ner.py: NER inference
Trained models will be saved in sourceCode/nlpProcess/bertBased(fullNer)/res/[model_name]
Settings
--batch-size 12
--epochs 5
--lr 3e-5
--resume False
Training
python main.py --policy [model_name,such as Bert_MLP]
Evaluation results will be saved in sourceCode/nlpProcess/bertBased(fullNer)/res/eval.log
Single model evaluation
python eval.py --policy [model_name,such as Bert_MLP]
Ensemble model evaluation
python eval.py --ensemble [split with ",",such as Bert_MLP,BERT_CRF]
Outputs will be saved in 在sourceCode/nlpRes
Single model
python cdc_ner.py --policy [model_name,such as Bert_MLP]
Ensemble model
python cdc_ner.py --ensemble [split with ",",suchc as Bert_MLP,BERT_CRF]
cd sourceCode/nlpRes
python merge.py
Final results will be saved in mergeRes.txt,the content is as following:
{"text": "...", "entities": [[4, 7, "name", "..."], [8, 11, "name", "..."], [14, 17, "Age", "..."], [21, 23, "position", "..."], [70, 84, "ResidencePlace", "..."]}
...
root: sourceCode/nlpToKG
python generate.py
Refer to files in sourceCode/nlpToKG
Refer to pipeline.bat