In the following section, denotes a random variable of a word, while denotes a state.
Moverover, means a tuple of . If you have ever write python or matlab, you would be familar to this notation.
For example, denotes the first word and means the first word is this word(ex. , …)
To derive bigram part of Viterbi algorithm, define:
where:
For , initialize first timestep like:
To derive trigram, define:
Then, by same method, we can get:
For and , initialize first timestep like: