Taiwanese Hokkien Transliterator and Tokeniser
App to customise transliteration and retrieve any necessary information about Taiwanese Hokkien pronunciation.
Table of Contents
Taibun is a comprehensive tool for transliterating and tokenising Taiwanese Hokkien. Developed as a demo for the Taibun library, this web application showcases its capabilities by allowing users to customise the transliteration of Chinese characters and retrieve detailed pronunciation information.
The application is designed to streamline the process of transliterating and tokenising Taiwanese Hokkien text, providing users with a user-friendly interface and a variety of customisation options that are supported by Taibun.
Run the app locally:
$ npm install
$ npm run dev
The website can be accessed through the URL http://localhost:5173/
.
Transliterator Page allows users to convert Chinese characters to the chosen transliteration system with customisable parameters such as system, dialect, tone format, delimiter, sandhi rules, and punctuation style.
Tokeniser Page performs tokenisation of Taiwanese Hokkien sentences, breaking down the text into individual words or tokens.
Set Converter Page converts text between Simplified and Traditional Chinese characters for Taiwanese Hokkien.
Users can switch between English, Taiwanese, and Mandarin from the language selector in the Navbar.
Because Taibun is MIT-licensed, any developer can essentially do whatever they want with it as long as they include the original copyright and licence notice in any copies of the source code.