Skip to content

Latest commit

 

History

History

tos_dictionary

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 

Part 2: Dictionary (tos_dictionary/)

Merge scraped data, append zh-tw language, and export a lookup table. Implemented by pandas.

The language mapping file is exported from project Tos-Translator.

Usage:

  1. Prepare latest language mapping (langmap) files into langmap. The langmap files could be exported from Tos-Translator by function matcher.export('tw', 'map', output_path / 'langmap').

  2. python langmap_merge.py: to merge langmap files into single file langmap/merged.tsv.

  3. python dictionary_gen.py: export lookup table onto output/*.json.