Skip to content

Latest commit

 

History

History
13 lines (11 loc) · 803 Bytes

README.md

File metadata and controls

13 lines (11 loc) · 803 Bytes

Chinese_Humor_MultiLabeled

Human labeled Chinese jokes and their verification codes in Python.

1. Description of Files:

    Chinese_Humor_Multi-Labeled.xlsx: the human labeled dataset.
    Excel2txt.py: the Python code to extract texts and labels from the above Excel file for experiments.
    mlabel_corpora: the folder containing the extracted texts and lablels for experiments.
    tools: the Jupyter Notebook and Python codes for humour comprehension experiments.

2. To cite this datasets, source code, or experiment results:

Yuen-Hsien Tseng, Wun-Syuan Wu, Chia-Yueh Chang, Hsueh-Chih Chen and Wei-Lun Hsu, "Development and Validation of a Corpus for Machine Humor Comprehension", 12th Language Resources and Evaluation Conference (LREC 2020), Marseille, France, May 11-16, 2020.