Winedays

@Winedays

Joined on Jul 1, 2019

  • 紀錄各種 Speech/Text Corpus 的相關資訊,以中英文相關 data 為主,包含實驗室已有的 Corpus 及已知公開免費或授權的 Corpus,收費 Corpus 暫且不作紀錄 ASR Corpus 適合 ASR 用途的 Corpus Name: corpus name Duration: total duration of corpus, '-' meant unknow Info.: information of corpus Data Perpare: data perpare code using in Kaldi if any Link: link to corpus main or download page, and the machine name if already download
     Like  Bookmark
  • http://www.speech.sri.com/projects/srilm/manpages/ngram-class.1.html NAME ngram-class - induce word classes from N-gram statistics SYNOPSIS ngram-class [ -help ] option ... DESCRIPTION ngram-class induces word classes from distributional statistics, so as to minimize perplexity of a class-based N-gram model given the provided word N-gram counts. Presently, ==only bigram statistics are used==, i.e., the induced classes are best suited for a class-bigram language model.
     Like  Bookmark