[DESCRIPTION]
A tool for preprocessing of text data in Japanese for further machine learning. It uses MeCab for tokenization and part-of-speech tagging and Cabocha for shallow and deep parsing.
usage:
bash main.sh input_file.exe
[DEPENDENCIES]
MeCab, MeCab Perl binding, Cabocha.