GitHub - gogabr/ml-decode: Speech decoder written in Standard ML

This is a speech decoder written in Standard ML. It is compiled with MLton, and uses the Kaldi library (http://kaldi.sourceforge.net) to convert .wav files into MFCC features. It also uses Kaldi's acoustic model files (transition model + diagonal GMM) and OpenFST's WFST format (const FST). The algorithm is state-equivalent to Kaldi's faster-decoder, and in my tests, works about 20% slower on a large vocabulary task (Russian geographical queries).

The reason for writing this was my frustration from having to work with C++ code.

I tried reimplementing the decoder in Haskell (modifying the excellent Husky software by Takahiro Shinozaki, http://sourceforge.net/projects/skyhusky/), but could not get reasonable performance. MLton proved easier to tame than GHC.

You are free to use and modify ml-decode in any way you like. Patches are welcome as well.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
acoustic-model.sig		acoustic-model.sig
acoustic-model.sml		acoustic-model.sml
config.txt		config.txt
ctl		ctl
decoder.sig		decoder.sig
decoder.sml		decoder.sml
fst.sig		fst.sig
fst.sml		fst.sml
kaldi-funs.sig		kaldi-funs.sig
kaldi-funs.sml		kaldi-funs.sml
kaldi-input.sig		kaldi-input.sig
kaldi-input.sml		kaldi-input.sml
kaldi-interface.cc		kaldi-interface.cc
mfc.sig		mfc.sig
mfc.sml		mfc.sml
mono-partition-fun.sml		mono-partition-fun.sml
mono-partition.sig		mono-partition.sig
read-bin.sig		read-bin.sig
read-bin.sml		read-bin.sml
test.mlb		test.mlb
test.sml		test.sml
util.sig		util.sig
util.sml		util.sml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

gogabr/ml-decode

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages