The Turnip Triple Scorer for entity-relation ranking This project is under active development for my undergraduate thesis. See WSDM cup for more details.
To run the cnnrank for classifying single person profession pair:
python cnnrank.py -i data/ -o output/ -pro
where -i specifies the directory for input data. Currently train files are hardcoded. The format for train file is:
<PersonName><Tab><ProfessionName><Tab><TextLength><Tab><TrainText>
The last column could be further tab separated for different text sources.
- cnnrank.py is for experiments with CNNs and LSTMs and exploring their usability
For a comprehensive overview of the project head over to the blog post as well as the poster