# Our (漢字) (char) decode: decode_asr_branchformer.yaml train: 1. (hhw) asr2/train_asr_conformer_e12_amp_smil.yaml 2. (tty) asr2/train_asr_branchformer.yaml 3. (hhw) asr2/train_asr_ebranchformer_small.yaml 4. (tty) 9espnet/whisper/medium (3 epoch) 5. (hhw) asr4/train_asr_conformer7_wavlm_large_smil.yaml 6. (tty) asr4/train_asr_conformer7_xlsr_large.yaml 7. (hhw) asr4/train_asr_conformer7_hubert_large_smil.yaml 8. (tty) asr4/train_asr_conformer7_chinesehubert_large.yaml 9. (tty) asr4/train_asr_ebranchformer_xlsr_large.yaml ## test | test | yaml | features | LM & decode | CER | note | |:-------------:|:----:|:----------------:|:------------------:|:----------:|:----:| | conformer | 1 | fbank | w/o LM + acc.ave | 4.11 (5.3) | | | branchformer | 2 | fbank | w/o LM + acc.ave | 4.63 (6.1) | | | ebranchformer | 3 | fbank | w/o LM + acc.ave | 4.07 | | | whisper | 4 | spectrogram | w/o LM + acc.ave | 2.96 | | | conformer | 5 | wavlm_large | w/o LM + acc.ave | 2.06 | | | conformer | 6 | xlsr2_960m_1000k | w/o LM + acc.ave | 1.95 | | | conformer | 6 | xlsr2_960m_1000k | Full_LM + acc.ave | 1.77 | | | conformer | 6 | xlsr2_960m_1000k | Four_LM + acc.ave | 1.69 | | | conformer | 7 | hubert_large | w/o LM + acc.ave | 2.15 | | | conformer | 8 | ch_hubert_large | w/o LM + acc.ave | 2.00 | | | ebranchformer | 9 | xlsr2_960m_1000k | w/o LM + acc.ave | 2.12 | | ## XYH-8-X | XYH-8-X | yaml | features | LM & decode | note | |:---------:|:----:|:----------------:|:----------------:|:----:| | conformer | 6 | xlsr2_960m_1000k | w/o LM + acc.ave | done | # Our (Pinyin) (word) 1. asr1/train_asr_branchformer.yaml ## test | test | yaml | LM | decode | WER | CER | |:----------------------------:|:----:|:---:|:-------------:|:---:|:---:| | branchformer | 1 | X | valid.acc.ave | 5.3 | 1.9 | | conformer + xlsr2_960m_1000k | | X | valid.acc.ave | 4.5 | 1.4 | | whisper (3 epoch) | | X | valid.acc.ave | 4.3 | 1.4 | ## XYH-8-X | test | yaml | LM | decode | note | |:------------:|:----:|:---:|:-------------:|:----:| | branchformer | 1 | X | valid.acc.ave | | # Hakka 漢字 Baseline (char) 1. asr2/train_asr_branchformer.yaml 2. asr2/train_asr_conformer_e12_amp.yaml 3. asr4/train_asr_conformer+wavlm.yaml & train_lm_transformer2.yaml 4. asr4/train_asr_conformer7_wavlm_large.yaml ## test | test | yaml | LM | decode | WER | CER | - | |:------------:|:-----:|:---:|:--------------:|:----:|:---:|:----:| | branchformer | 1 | X | valid.acc.ave | 17.8 | 6.1 | | | conformer | 2 | X | valid.acc.ave | | 5.3 | home | | conformer | **3** | V | valid.acc.ave | 50.0 | 6.7 | | | conformer | **4** | X | valid.acc.best | | 3.3 | home | ## dev | dev | yaml | LM | decode | WER | CER | |:------------:|:-----:|:---:|:--------------:|:----:|:---:| | branchformer | 1 | X | valid.acc.ave | 20.3 | 7.0 | | conformer | 2 | X | valid.acc.ave | - | - | | conformer | **3** | V | valid.acc.ave | 46.3 | 6.4 | | conformer | **4** | X | valid.acc.best | - | - | # Hakka Pinyin Baseline (BPE) 1. asr1/train_asr_branchformer.yaml 2. asr1/train_asr_conformer_e12_amp.yaml 3. asr3/train_asr_conformer+wavlm.yaml & asr3/train_lm_transformer2.yaml 4. asr3/train_asr_conformer7_wavlm_large.yaml ## test | test | yaml | LM | decode | WER | CER | TER | - | |:------------:|:--------:|:---:|:--------------:|:---:|:---:|:---:|:----:| | branchformer | 1 | X | valid.acc.ave | 5.1 | 1.7 | 3.1 | | | conformer | 2 | X | valid.acc.best | 8.8 | 3.7 | - | home | | conformer | **3** | V | valid.loss.ave | 5.5 | 2.0 | 3.5 | | | conformer | **3_v2** | V | valid.loss.ave | 4.7 | 1.6 | 2.9 | | | conformer | **4** | X | valid.acc.ave | 3.9 | 1.2 | - | home | | conformer | **4** | X | valid.acc.best | 4.7 | 1.5 | - | home | ## dev | dev | yaml | LM | decode | WER | CER | TER | |:------------:|:--------:|:---:|:--------------:|:---:|:---:|:---:| | branchformer | 1 | X | valid.acc.ave | 5.1 | 1.6 | 3.5 | | conformer | 2 | X | valid.acc.best | - | - | - | | conformer | **3** | V | valid.loss.ave | 5.5 | 1.9 | 3.8 | | conformer | **3_v2** | V | valid.loss.ave | 4.6 | 1.4 | 3.2 | | conformer | **4** | X | valid.acc.ave | - | - | - | | conformer | **4** | X | valid.acc.best | - | - | - |