- vector (integer)
- Vector (float, double)
- Matrix (float, double)
- Posterior (posteriors, nnet1 training targets, confusion networks, ...)
import kaldi_io
for key,mat in kaldi_io.read_mat_scp(file):
...
import kaldi_io
with open(ark_file,'wb') as f:
for key,mat in dict.iteritems():
kaldi_io.write_mat(f, mat, key=key)
import kaldi_io
ark_scp_output='ark:| copy-feats --compress=true ark:- ark,scp:data/feats2.ark,data/feats2.scp'
with kaldi_io.open_or_fd(ark_scp_output,'wb') as f:
for key,mat in dict.iteritems():
kaldi_io.write_mat(f, mat, key=key)
- from pypi:
pip install kaldi_io
- from sources:
git clone https://github.com/vesis84/kaldi-io-for-python.git <kaldi-io-dir>`
pip install -r requirements.txt
pip install --editable .
Note: it is recommended to set export KALDI_ROOT=<some_kaldi_dir>
environment variable.
The I/O based on pipes can then contain kaldi binaries.
(note: these are not included in pypi package)
Unit tests are started this way:
./run_tests.sh
or by:
python3 -m unittest discover -s tests -t .
python2 -m unittest discover -s tests -t .
Apache License, Version 2.0 ('LICENSE-2.0.txt')
- accepting pull requests with extensions on GitHub
- accepting feedback via GitHub 'Issues' in the repo