Software Open Access

julius-speech/julius: Release 4.5

Akinobu Lee; Tatsuya Kawahara

What's new in Julius-4.5 Improved VAD

A WebRTC-based voice activity detection (VAD) was introduced to improve voice detection: https://github.com/dpirch/libfvad

Now Julius has two VAD modules: old module (input level and zero-cross based) and the new module (libfvad = model based). Both of them runs concurrently in parallel for an audio input. When the new module is enabled, Julius will detect speech trigger only if both of them outputs speech flag.

Use -fvad option to enable. See Release.txt or julius/Options.md how to use it.

DNN-HMM CPU multi-threading

DNN-HMM computation now supports CPU multi-threading. Multi-threading may speed up DNN-HMM computation to some extent. The number of threads can be specified by num_threads in dnnconf. This feature is enabled by default, and the default number of threads is 2.

XML special character escape on module mode output

Notice for module-mode users: in order to make module output XML-compliant, the XML special characters are now escaped, ex. '>' to '>'. A backward-compat option -noxmlescape is available to inhibit the new escaping feature and go back to the same behavior as previous versions.

Moving to UTF-8

We have moved to UTF-8. The codes and documents in master branch has been unified to UTF-8 encoding.

For backward compatibility and make log tracking easy, the recent release codes with old encoding is kept at branch "master-4.5-legacy". If you want to see the code before the release of 4.5 (2019/1/2) in old encoding, please checkout the kept branch.

Markdown

We are moving from text file to markdown for easier update. Currently available ones are:

Others will come soon.

Files (10.2 MB)
Name Size
julius-speech/julius-v4.5.zip
md5:dba862fe2f3a9b4444656916a592ead6
10.2 MB Download
1,724
407
views
downloads
All versions This version
Views 1,7241,724
Downloads 407407
Data volume 4.1 GB4.1 GB
Unique views 1,5671,567
Unique downloads 369369

Share

Cite as