Published May 11, 2022 | Version 1.2
Software Open

Lahjoita puhetta baseline Kaldi ASR model

  • 1. Aalto University

Description

Lahjoita puhetta baseline speech recognition model, built with the Kaldi toolkit. Trained on 1600 hours of Finnish speech. Described in the paper "Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks". For more details and instructions, see the Github page.

This upload includes the acoustic model in "am_lp_1600h.zip", the i-vector extractor in "extractor.zip", word-based decoding graph in "graph_word_lp_web_dsp.zip", and subword-based decoding graph in "graph_morfessor_lp_web_dsp.zip".

Files

am_lp_1600h.zip

Files (1.7 GB)

Name Size Download all
md5:6b7fa435ae78d4777de73bf619a4977d
64.2 MB Preview Download
md5:9ca8a1d50a5ff2ab6df7bee88c8440f1
18.4 MB Preview Download
md5:b1ec09aa2879ec5acfdeba3c495ae48e
829.1 MB Preview Download
md5:3957fe88b13974ea55f52bb6e72d5c77
812.6 MB Preview Download